Sharing SAP Data to SAP Databricks

Objective

After completing this lesson, you will be able to share SAP Data to SAP Databricks

Sharing an SAP Business Data Cloud Data Product

This lesson guides you through the process of sharing a data product from SAP Business Cloud (BDC) with SAP Databricks.

Sharing a Data Product to SAP Databricks

Before you are able to share a data product, there are some tasks that must be completed:

  • Add SAP Databricks to your SAP BDC formation.
  • Activate one or more data packages that provide the data products that you wish to share.
  • Ensure you have been assigned the Data Catalog Administrator role in SAP BDC.
  • Ensure you have been assigned the USE PROVIDER and CREATE_CATALOG roles in SAP Databricks.

Share an SAP Data Product with SAP Databricks

Here are the steps to share an SAP-delivered data product with SAP Databricks:

  1. Launch the SAP BDC cockpit.
  2. Choose Search and locate the data product you want to share.
  3. Select the data product and choose Share to open the Manage Share Access dialog.
  4. Choose Add Target.
  5. Choose your target system (SAP Databricks).
  6. Enter the name to use for the generated Delta Share catalog.

    Note

    The Delta Share catalog is the SAP Databricks component where your shared data will become available.
  7. Select one or more SAP Databricks workspaces in which the shared data will be available.

    Note

    In SAP Databricks, a workspace is like a custom area in the cloud where your team can access and work with SAP Databricks tools and resources.

    Your organization can choose to define either one workspace for everyone to use or several workspaces for individual teams.

SAP Company Code data product card, shared status, active, with details.

Once your data product is shared, you'll see the Shared status appear in the SAP BDC catalog.

SAP Databricks catalog menu with companycode_share highlighted under Delta Shares Received.

In SAP Databricks, you'll see your shared data product in the Delta Shares Received section of the Unity Catalog.

Unity Catalog

Unity Catalog is a tool for managing data and AI resources in SAP Databricks. It serves as the central library that helps you:

  • Control who can access specific resources.
  • Track changes in data and resources.
  • Identify the origin of data.
  • Locate the data you need across all your SAP Databricks workspaces.

In the Unity Catalog, data objects are organized in a three-level hierarchy:

  • Catalogs are the main components for organizing data in SAP Databricks. You can share catalogs across workspaces within the same region and account.
  • Catalogs are subdivided into schemas.
  • Schemas contain tables that organize and govern access to structured data. In SAP Databricks, most tables are actually Delta tables (also known as Delta Lake tables).

    Note

    A Delta table is a specific file format used for volume-optimized and access-efficient storage.

Data Product and Delta Share

In SAP BDC, the business data (for company code in our example) is packaged into a data product. The data product also includes an ORD file that is required for discoverability. Open Resource Discovery (ORD) is an open, industry standard protocol that is widely used for metadata discovery. ORD makes it possible for any compliant consumer to discover data products in SAP BDC.

In SAP Databricks, the data is packaged into a catalog of type DELTASHARING_CATALOG (companycode_share in our example). By default, the catalog name is the name you defined in the Manage Share Access dialog. You need to have the MANAGE permission on the shared catalog to rename it.

This Delta Share catalog contains a Schema (companycode). A schema groups of all the delta tables that correspond to your data product.

SAP Databricks catalog explorer showing details of the companycode schema from the companycode_share delta share, with catalog type DELTASHARING_CATALOG highlighted.

You are now ready to query the delta tables.

Share SAP Data with SAP Databricks

In this practice exercise, you will learn how to share an SAP data product with SAP Databricks.

Removing Shared Product

Once the data product has been shared, it will appear in SAP Databricks, but you cannot remove it from there.

To remove the shared data product, you need to return to the SAP BDC cockpit and select the Share option again.

Manage Access dialog lists one target system with options to edit or remove it. The delete icon is highlighted.

In the Manage Access dialog, choose the Delete (cross) icon to remove the data product from the selected target system.

Summary

  • Active SAP data products can be shared from SAP BDC cockpit.
  • When shared in an SAP Databricks workspace, they become Delta Share catalogs.
  • Data is available through delta tables presented in the Schema of this catalog.
  • To remove a shared data product from SAP Databricks you need to delete the target system in the Manage Access dialog of this data product in SAP BDC cockpit.