Introducing Rule Mining

Objective

After completing this lesson, you will be able to describe how machine learning has been applied in data quality management.

Rule Mining

Apply Machine Learning to Discover Rules in Existing Master Data

Innovations

With SAP S/4HANA you can use rule mining to discover rules in existing master data:

  • Use mining runs to analyze existing master data.
  • Collaboratively decide on business relevancy of proposed rules from rule mining.
  • Create and link data quality rule from accepted rules.
  • Information from rule mining is used in implementation of data quality rule.
Business value
  • Ease and shorten the discovery of rules with machine learning.
  • Efficiently qualify and implement discovered rules as data quality rules.

Master Data Rule Mining Process

The figure explains the main steps of master data rule mining, which are: Define, Start, Find & Accept, and Implement rules. Once that is done, evaluate your data quality. This feature is enabled for Product, Business Partner including Customer and Supplier and Custom Objects. Use the Apps Manage Rule Mining and Process Rules from Rule Mining.

The preceding figure illustrates the typical process in master data rule mining.

Mining Run

The screenshot shows the initial screen of the app Manage Rule Mining for Products with the basic data.

Mining Run guides the system on how to find interesting rules:

Goal
Set your expectation.
Tables

A list of tables to be mined at the same time.

  • Focus Area: The data set you want to use for mining. For example, Product Type = Finished Goods (FERT).

  • Fields: Selected fields to find potential rules for:

    • Checked by Rule: the THEN part of the rule.
    • Condition of Rule: the IF part of the rule.
Parameters

Maximum number of rules: top N best rules proposed by the system, by default 100

The screenshot show a successful mining run for Lot sizing.

To start the mining run:

  • You are informed about the volume of data for mining before you confirm the mining run start.
  • A mining operation that is running can be stopped from the UI.
  • The system executes a machine learning algorithm in the background job (asynchronous). The end user is free to leave the screen.
  • To check the mining run status and the progress, check the message strip details in the header.
  • A restart or edit of the mining run is possible when it is set to Error by the program or it is manually stopped.

Managing the Mining Run

  • Filter by Status, Description, Focus Area Fields, or administration data, and so on.
  • Navigate to the rules from rule mining.
  • Create a new mining run by copying an old one.
  • To delete the proposed rules with Initial status together, choose Delete Mining Run.
The screenshot shows the app Process Rules from Master Data Rule Mining for Product, showing different rules and the corresponding Data Evaluation.

To explore a mining run:

  • Navigate from a completed Mining Run to the rules of this run.
  • Access the app from the launchpad to freely filter and search for rules from different mining runs.
  • Search rules by Condition Field name and Condition Field Value, Status, and so on.
  • Technical Description and Business View are both visible.

Review and Accept Rules

The figure illustrates the main facts about review an accepting rules, like: Review the rule from business meaning, bring expertise from others, data evaluation indicates how good the rule complies with data, accept the candidates for next step, and reject or delete those that don't make sense.

The figure illustrates the main facts about review and accepting rules.

Link to a Data Quality Rule

The screenshot shows the how to link to a new rule.

The figure above shows the possibility to create a link to a new quality rule. You can also:

  • Enhance an existing Data Quality Rule
  • Merge multiple accepted rules into one data quality rule
  • Navigate to the linked data quality rule
The screenshot shows the rules with a supported automatic implementation.

Facts about the illustration in the figure:

  • For many rules from rule mining, an automatic rule implementation in BRFplus is supported.

  • By adding a usage and preparing it, an active BRFplus rule implementation will be generated.

  • Information on rules from rule mining is provided in separate sections.

  • Approving the data quality rule and enabling the usage works as for any other data quality rule.

Domain Coverage

Master Data Governance apps for Data Quality Management are now not only available for product mass data, but also business partner data. These apps can be used directly without much configuration, without any coding or data modeling. You can also include the extensions you have made to the product master, customer master, vendor master or business partner master data model, into these apps.

The screenshot shows the Data Quality Management Apps for Products, Business Partner and Custom Objects.

Business partner and product master data are covered as packaged applications. This provides a platform for custom objects:

Generic apps with adaptation concept
  • Master Data Quality Rules, Export, Import
  • Configure Master Data Quality Scores
  • Evaluate Master Data Quality
  • Rule Mining
Basis for your own apps for data quality overview and analysis
  • Evaluation Results
  • Data Quality Evaluation Overview

The data quality apps reuse the same model information and are based on the flexibility of the MDG Consolidation and Mass Processing framework.

Data Quality Management with SAP Master Data Governance: Summary

DQM provides unique benefits:

  • Central repository for data quality rules, striving to cover every SAP Master Data Governance process

  • Out-of-the box applications for product and business partner master data

  • One platform covering multiple domains: product, business partner, and custom objects

  • Integrated rule implementation with predefined data model, value helps, …

  • Easy access using SAP Fiori launchpad, like all SAP S/4HANA functions

  • Reuse of application authorizations when analyzing evaluation results

  • Drill-down from KPIs down to direct access to active master data in one single place

  • Insight to action: Start correction from the detected error