Generative AI Hub
Generative AI hub covers the AI lifecycle efforts end to end and enables customers to develop, deploy, and manage custom-built AI solutions and AI-powered extensions of SAP applications.
Access: Enables access to frontier AI models, out-of-the-box selection of compute resources, and orchestration modules (like content filtering, data masking, and more).
Exploration and Development: Provides a comprehensive toolset for building of custom AI solutions and model exploration, including a prompt editor, prompt management, prompt registry, libraries, SDKs, and a fine-tuning service. This also includes an AI playground, chat, and prompt management to explore different models, meta data and parameter changes to find the best fitting technology for your needs. All in a secure and safe environment to interact with cutting-edge technology.
Deployment and Delivery: Software deployment includes all the steps, processes, and activities that are required to make a software system or update available to its intended users. Today, most IT organizations and software developers deploy software updates, patches, and new applications with a combination of manual and automated processes.
Support for Bring Your Own Model (BYOM) as well as allocation of compute resources, (re) training, and serving template workflows. This facilitates efficient deployment and delivery of AI models and applications to ensure they are operational and accessible to users.
Orchestration: AI orchestration refers to the process of coordinating and managing the deployment, integration, and interaction of various AI components within a system or workflow. This includes orchestrating the execution of multiple AI models, managing data flow, and optimizing the utilization of computational resources.
AI orchestration aims to streamline and automate the end-to-end life cycle of AI applications. It ensures the efficient collaboration of different AI models, services, and infrastructure components, leading to improved overall performance, scalability, and responsiveness of AI systems.
Coordinating and managing AI compute workflows and scheduling, content moderation, data masking, agent deployment, grounding capabilities, and inference engines to ensure seamless and efficient operation of AI systems.
Governance: Implementing policies and procedures to manage the development, deployment, and operation of AI systems in compliance with regulatory and organizational standards. This includes logs for tracking and auditing purposes, metering, monitoring, multi-tenancy, CaaS flow, as well as roles and responsibilities.
Adaptability (Adaptation): With custom AI model it's crucial to constantly adapt, whether you need new better models, different model architecture, or simply exchange the underlying dataset.
Benefiting from easier interaction with LLMs for grounding, fine-tuning, custom AI models, and AI Agents, you can drive your AI adaptation at your pace without the need to move to other services. Adaptability allows you to:
- Adapt easily by switching models when needed or change orchestration configuration on the fly. Adapt your prompts to be more agnostic by saving different variants that call different LLMs.
- Switch models, configure orchestration, add further modules, and register prompt variants for different LLMs.
- Benefit from configurations, lifecycle, and change of custom models and content packages.
Trust & Security: Ensuring data privacy, data isolation, and robust security measures to protect AI systems and their data:
- No automatic saving of prompts or data
- Ensuring data masking and content filtering (prompt injection, jailbreak)
- SOC 2, NIST, ISO certifications
Vector Engine
With our grounding capability, we've integrated an SAP managed vector engine (powered by SAP HANA cloud) to simplify retrieving your business documents relevant to a question or task and providing them as context for the LLM.
In summary, the generative AI hub offers a comprehensive suite of tools and services to integrate AI into your applications, exploring generative AI capabilities in a safe environment, ensuring trust, control, and seamless access to foundation models and business data.