After completing these steps, you will have fixed data quality issues using the data remediation preparation.
Click ‘View Preparations’.

Click ‘Preparation for PHARMA_CLAIMS_ENRICHED_.csv’

Click the header for the column named ‘DRUG_NAME’.

Click ‘Replace’.

Type ‘Nystatine’ for the search string and type ‘Nystatin’ for the ‘Replace by’. Click ‘Apply’.

The value ‘Nystatine’ was replaced by ‘Nystatin’. Click ‘Replace’.

Type ‘Acetaminofen’ for the search string and type ‘Acetaminophen’ for the ‘Replace by’. Click ‘Apply’.

The value ‘Acetaminofen’ was replaced by ‘Acetaminophen’. Click ‘<’ to go back to the ‘Rulebook Dataset Results’.

Click ‘<’ to go back to the ‘Run Preparation’.

Type ‘REMEDIATED_DATA_’ for the ‘Dataset Name’. Click ‘Apply’.

Click ‘Data Intelligence Metadata Explorer’. Click ‘Home’.

You fixed data quality issues using the data preparation recipe that was automatically generated with the execution of the rulebook. In a normal process, the output dataset resulting from that data remediation process would be used in order to update the original table that contains the data quality issues using workflow and approvals. For this hands-on, you can use the data preparation recipe you created to simulate the update process of the data and see the results in the rulebook.
Click ‘View Preparations’.

Click ‘DRUG_’.

Click ‘Recipe’. Edit the ‘Enrich’ action.

Click ‘+’ to add a new data source.

Click ‘Browse’. Select the connection ‘DI_DATA_LAKE’.

Click on the ‘shared’ -> ‘Pharma’ folders.

Search for the ‘’ folder. Click your folder.

Click ‘REMEDIATED_DATA_’.

Click ‘OK’.

The remediated dataset was added to the list of sources.

Click ‘X’ to delete the join with the dataset ‘D1’.

Drag and drop the ‘REMEDIATED_DATA_’ data source to the left side of ‘P1’ .

Select ‘Left Join’.

Scroll down and uncheck ‘VALID_DRUG’ and ‘POTENCY’ under the ‘left source’ list of columns.

Scroll down and uncheck ‘ORIG_PRODUCT’, ‘DOSAGE’, ‘ROUTE_ADMINISTERED’, and ‘NOTES’ from the right source list of columns. Click ‘Apply’.

Click ‘Apply Enrichment’.

Click ‘POTENCY_0’. Click ‘Rename’.

Type ‘POTENCY’ for the ‘New Column Name’. Click ‘Apply’.

Click ‘<’ to come back to the ‘Run Preparation’ menu.

You can now run your preparation to create a new curated dataset:

Click ‘Data Intelligence Metadata Explorer’. Click ‘Home’.

Click ‘rulebooks’.

Type ‘PHARMA_CLAIMS_’ in the ‘Filter rulebook names’ text field. Click on your rulebook.

Expand both ‘Accuracy’ and ‘Completeness’ set of rules.

Edit the rules binding associated to the first rule: ‘DRUG_NAME_VALID_’.

Delete the rule binding and click ‘Save’.

Click ‘Close’.

Expand ‘Accuracy’. Click ‘Create Rule Binding’.

Click ‘Step 2’.

Click ‘Browse’. Select ‘DI_DATA_LAKE’ for the ‘Connection’.

Click ‘shared’. Click ‘Pharma’. If you get a ‘Container is not supported by rules’ message then click directly on the folder name.

Filter ‘’ and select your folder. If you get a ‘Container is not supported by rules’ message then click directly on the your folder name.

Select ‘PHARMA_CLAIMS_CURATED_’. Then click ‘Step 3’.

Select ‘VALID_DRUG’ for the mapping. Then click ‘Create Binding’.

Click ‘Close’.

Expand ‘Completeness’, then edit the rule binding.

Delete the column mapping for ‘DRUG_NAME’. Then click ‘Save’.

Click ‘Close’.

Expand ‘Completeness’, then click ‘Create Rule Binding’.

Click ‘Step 2’.

Select the existing ‘Connection ID’ for ‘PHARMA_CLAIMS_CURATED_’, then click ‘Step 3’.

Click ‘Create Binding’.

Click ‘Close’.

Click ‘Run All’.

Click ‘Run’.

Wait for the rulebook execution to finish.

Click ‘View Results’.

Expand the dataset.

The quality of the data improved.

Well done! You have now fixed data quality issues using the data remediation preparation.