It’s time to put what you’ve learned to the test, get 8 questions right to pass this week.
Q1.
When you use a train-and-test regime, which subset is used to develop the model?
Choose the correct answer.
A
Train
B
Validation
C
Test
D
Complete data set
Q2.
In which task of the business understanding phase of the cross-industry standard process for data mining (CRISP-DM) methodology do you list the risks and contingencies?
Choose the correct answer.
A
Determine business objectives
B
Determine data science goals
C
Produce project plan
D
Assess situation
Q3.
Which phase of the cross-industry standard process for data mining (CRISP-DM) methodology immediately precedes the deployment phase?
Choose the correct answer.
A
Monitoring
B
Evaluation
C
Modeling
D
Business understanding
Q4.
What are the capabilities provided by SAP HANA Automated Predictive Library (APL)?
There are 3 correct answers.
A
Business Intelligence (BI) dashboards
B
Recommendations
C
Spreadsheet analysis
D
Time series analysis
E
Classification models
Q5.
Which statement regarding the cross-industry standard process for data mining (CRISP-DM) methodology is true?
Choose the correct answer.
A
It was developed to be used only with IBM SPSS software.
B
It is only used in financial services industries.
C
It has five different generic phases.
D
It focuses on business issues and technical analysis.
Q6.
Which of the following functions are integrated in SAP Analytics Cloud?
There are 3 correct answers.
A
Business Intelligence (BI)
B
Planning
C
TensorFlow
D
Predictive
E
Robotic Process Automation (RPA)
Q7.
In which report generated in the data understanding phase of the cross-industry standard process for data mining (CRISP-DM) methodology do you list the quantity of data?
Choose the correct answer.
A
Initial data quality report
B
Data description report
C
Data exploration report
D
Data quality report
Q8.
Why do you sometimes need to add a monitoring phase into the cross-industry standard process for data mining (CRISP-DM) process?
There are 3 correct answers.
A
Because using a new type of modeling algorithm might improve accuracy.
B
Because there is a change to the business question that the model is designed to answer.
C
Because the model's performance degrades in time.
D
Because changes to the general business environment might mean the existing model needs updating.
E
Because the data that we apply the model onto has changed in some way.
Q9.
Which of the following are results of an Initial Data Analysis (IDA)?
There are 3 correct answers.
A
The analysis of descriptive statistic.
B
An analysis of the quality of the data.
C
An assessment of the assumptions on which the analysis will be based.
D
The proposal of hypotheses about the causes of observed phenomena.