Introduction to multi-source data landscape

Introduction to multi-source data landscape

Multi-Source data landscape

SAP HANA Cloud is designed to be the single gateway to all your data – no matter where data is stored, be it on a local on-premise database, on an SAP HANA Cloud instance or in non-SAP cloud databases. You can connect any data storage to SAP HANA Cloud and integrate all the data to then manage it in one place.

In this scenario, connections have been pre-configured for data stored in non-SAP and SAP cloud data sources:

  • Importing and Exporting data from Amazon S3.
  • Importing and Exporting data from SAP HANA Cloud, Data Lake Files

Multi-source landscape

Work with an example of a hybrid data landscape where multiple data sources such as an SAP HANA Cloud data lake or Amazon S3 cloud storage are configured.

  • Amazon Simple Storage Service (Amazon S3) An object storage service offering industry-leading scalability, data availability, security, and performance. S3 is built to store and retrieve any amount of data from anywhere. Data is stored as objects within resources called buckets, and a single object can be up to 5 terabytes in size.

  • SAP HANA Cloud, Data Lake Files Use SQL commands to import and export data directly from SAP HANA Cloud data lake file storage by specifying the path to the file store and creating the required credentials for users who need to access data lake.

    • Provides a convenient, efficient storage layer for structured, semi-structured, and unstructured data in SAP HANA Cloud
    • Automatically enabled and fully managed when you provision a data lake relational engine.
    • Certificate based access, authorization management and user-role definition.
    • Provides a consistent API and file naming across hyperscalers (WebHDFS based).
    • Serve as landing zone for data from external sources, e.g. Kafka.
    • Data lake files can be shared with other tools, e.g. Spark.