The Python machine learning client for SAP HANA (hana-ml) is a Python package that enables data scientists to access SAP HANA data. Practitioners can build machine learning models using the data directly in SAP HANA [2].
The Python machine learning client for SAP HANA consists of two main components:
- SAP HANA DataFrame, which provides a set of methods for accessing and querying data in SAP HANA without bringing the data to the client.
- A set of machine learning APIs for developing machine learning models.
The machine learning APIs are composed of two packages:
PAL package
PAL package consists of a set of Python algorithms and functions which provides access to machine learning capabilities in the SAP HANA Predictive Analysis Library (PAL). SAP HANA PAL functions cover a variety of machine learning algorithms for training a model and then the trained model is used for scoring.
APL package
Automated Predictive Library (APL) package exposes the data mining capabilities of the automated analytics engine in SAP HANA through a set of functions. These functions develop a predictive modeling process that analysts can use to answer simple questions on their customer datasets stored in SAP HANA.
HANA-ML uses the SAP HANA Python driver (hdbcli) to connect to and access SAP HANA. For further information, please refer to [2].