Adding a Data Lake to your SAP HANA Cloud Database

Objectives
After completing this lesson, you will be able to:

After completing this lesson, you will be able to:

  • Add an Integrated Data Lake to the SAP HANA Cloud Database

Addition of an Integrated Data Lake to SAP HANA Cloud

Lesson Overview

In this lesson, you'll learn how to attach an SAP HANA data lake instance to the SAP HANA database in your SAP BTP account.

Business Case

See the following video to know the business case for adding an integrated Data Lake.

The ability to add a data lake to an SAP HANA Cloud database instance depends on the entitlements and the region where you've deployed your SAP HANA Cloud database.

Add an Integrated Data Lake

Use the following path to navigate from the SAP BTP Cockpit to the SAP HANA Cloud area:

  • On the SAP BTP Cockpit – Global Account page, select the <your subaccount>< your Cloud Foundry space>. In the above screenshot, that would be 1234abcdtrailtrialdev. The breadcrumbs area (1) shows your location in the SAP BTP.
  • On the Space: dev - Applications page, in the menu panel, select the SAP HANA Cloud (2) button.
  • To manage your SAP HANA databases and data lakes instances, select the Manage SAP HANA Cloud (3) button.

The SAP HANA Cloud Central app gives you an overview of all your databases. You're able to see the status of all databases, and can also perform database administration tasks like, for example, start/stop, upgrade, or adding an integrated data lake. The security settings allow you only to see the databases where you've the permissions to access them.

The SAP HANA Cloud Central opens in a new browser tab. This makes it easier to navigate between the SAP HANA Cloud Central and SAP BTP Cockpit.

Select the °°° (2) button to open the Actions Menu for the selected cloud database. From the Actions Menu, select the Add Data Lake (3) option.

In the Create Data Lake Instance wizard, you need to specify the Data Lake instance name and description in the Basics area. You must also enter and confirm a strong password.

Note
The user HDLADMIN is automatically created and assigned to you for data lake administration tasks.

In the Size area of the input screen, you can specify the number of vCPUs for the Compute and Storage nodes. The minimal required number of vCPUs is calculated from the storage size. In the trial and free tier accounts, the storage size is fixed, so you have to accept the given number of vCPUs.

Note
In an enterprise SAP BTP account, you can also select the Storage Service Type. You can choose between SAP Native Data Lake files and AWS Elastic File System (EFS). In the free tier and trial account, this is fixed to the AWS Elastic File System (EFS) and can't be changed.

In the Connections area, you specify which IP addresses are allowed to connect to your data lake instance. Depending on your need, you keep the connections local to the SAP BTP, or only specific IP addresses and ranges of your company or all IP addresses.

Caution
The option Allowing all IP addresses means that your data lake exposes to the whole internet!

By default the provisioning wizard configures and schedules data lake backups. You can manually disable this feature if no backups are required.

Customer Controlled Key Management

The Encryption Key Management Service is also available for SAP HANA data lake. This allows you to use the customer-controlled encryption key (CCEK) feature for integration with the SAP Data Custodian Key Management Service (KMS). Follow the links to find more information on how to use the Customer Controlled Key Management Services and SAP Data Custodian

Add an Integrated Data Lake

Log in to track your progress & complete quizzes