Introducing to Process Data Pipelines

Objective

After completing this lesson, you will be able to explain the different options for loading data into the system

Introducing to Process Data Pipelines

Process Data Pipelines.

Process Data Pipeline

A process data pipeline defines how data is extracted from a source system, transformed according to business rules, and loaded into a process for further use.

A process data pipeline defines how data is extracted from a source system, transformed using business rules, and then loaded into a designated process:

  • Connection – Establishes access to the source system
  • Process – The destination or dashboard where the transformed data is loaded
  • Source Data – Specifies what data should be extracted
  • Transformation Configuration – Includes the business logic and rules used to transform the extracted data

By understanding and setting up these components, you’ll ensure your pipeline runs efficiently and delivers reliable, usable data to your processes.

Extract: setup source system connections, configure data integrations, and schedule recurring data extractions. Transform: Define business objects, compose SQL scripts, and create data model tables. Load: link a process, execute transformations, and create data model.

Process Data Pipeline: Data Integration

Process Intelligence ETL consists of three components.

Connections

  • To set up a new data pipeline, the first step is to establish a connection to the source system.
  • Depending on your source system, an enterprise system, database, cloud, or API - choose the appropriate connection type and enter your connection details and make your connection valid.

On-Premises Extractors

  • Using on-premises extractor, you can connect your on-premises source systems to Process Data Management in SAP Signavio Process Intelligence
  • An on-premises extractor is installed and operated within company's own physical infrastructure. The primary function of an on-premises extractor is to collect and extract data from various sources hosted in your company's environment.

Source Data

  • In source data, you define which tables and columns you want to extract from source systems.
  • You can manage data extraction filters, partition strategies in the source data

Process data Pipelines link all three components to create an end-to-end pipeline for your process analysis.

Data Transformation Templates

Data transformation templates—part of the Value Accelerators for SAP Signavio Process Intelligence—streamline the creation of process data pipelines.

For different source systems, preconfigured templates are offered that include the business process, data extraction, and transformation logic. These templates are built around common business scenarios like Lead-to-Opportunity, Lead-to-Quote, and Incident-to-Resolution. Explore the full list of available templates to quickly build your process data models. The templates include the following:

  • A common business process with activities and business objects
  • A definition of which data to extract along with the list of all tables and columns
  • Scripts to transform the extracted data

Note

For a detailed description of each template visit Data transformation templates.

To use the templates, you need the SAP Signavio Process Intelligence – Data Modelling feature set. Your workspace administrator can enable this for you.

Please note: Template documentation is accessible only to SAP Signavio users with an active SAP Signavio Process Intelligence license.

To learn more about how to create a process data pipeline, visit our SAP Signavio Process Intelligence User Guide.

Access Requirements for Data Pipelines

Let's have an overview of ETL prerequisites, covering feature sets, connectors, transformation templates, and source system compatibility.

You can access the data pipelines only if the following requirements are met:

Data Source Management

The data source management is the framework to manage online data sources. It includes credential management and scheduling.

Screenshot shows the Source Data within SAP Signavio Process Intelligence.

Integration Management

The integration management is the framework to define what, how, and when to extract data. It includes pseudonymisation and partitioning schemas.

Screenshot shows the Integration Management within SAP Signavio Process Intelligence