It’s time to put what you’ve learned to the test, get 8 questions right to pass this week.
Q1.
What influence does a reward typically have on the learning process in reinforcement learning?
Choose the correct answer.
A
All actions that contributed to the reward are shown more frequently in the future.
B
Only the action that directly led to the reward will be shown more often in the future.
C
None, as long as the agent has not also experienced a punishment.
D
All actions that did not contribute to obtaining the reward will be shown less frequently in the future.
Q2.
In the banana hunting game, why does the monkey jump even if there is no barrel nearby?
Choose the correct answer.
A
Unnecessary jumping does not lead to any punishment.
B
The monkey has a very high learning rate.
C
Not jumping was punished.
D
The agent alternates between the jumping and not jumping actions.
Q3.
What is weak AI?
Choose the correct answer.
A
Software that can solve a specific problem
B
Software that solves problems slower than humans
C
Computers with artificial intelligence and very high power consumption
D
Software that does not require training data to solve specific problems
Q4.
Which pseudocode can be used to describe the reward and punishment for the agent in the banana hunt game?
Choose the correct answer.
A
if touching barrel then -10 else 1
B
if touching mouse pointer then -10 else 1
C
wait until touching barrel
D
if -10 > 1 then touching barrel else touching floor
Q5.
Which of the following applications is considered an example of AI?
There are 2 correct answers.
A
Style change filters for a photo, which for example transfer the art style of a well-known painter.
B
Music recommendations based on one's musical taste.
C
A huge database that contains many millions of videos and enables their playback.
D
Calculating the mean value of a series of numbers.
Q6.
What is the second step in reinforcement learning?
Choose the correct answer.
A
Try action
B
Capture state
C
Receive reward (or punishment)
D
Adjust strategy
Q7.
What is the relationship between machine learning and artificial intelligence?
Choose the correct answer.
A
Machine learning is a subfield of artificial intelligence.
B
Artificial intelligence is a subfield of machine learning.
C
Machine learning is another term for artificial intelligence.
D
Machine learning describes the opposite of artificial intelligence.
Q8.
Which of the following statements about the model in the Banana Hunt example are true?
There are 3 correct answers.
A
The values for a new state are initialized in the model whenever this state occurs for the first time.
B
A positive value for a specific action in a specific state in the model means that the monkey has received more rewards than punishments for this action in this state.
C
The model is updated after each action.
D
The model table consists of 2 columns, the values for the "jump" and the "do nothing" action.
E
The model is initialized at the beginning of the program and updated once after the program is stopped.
Q9.
Which principle is essential to reinforcement learning?
Choose the correct answer.
A
The agent receives a reward or punishment at certain points in time and thus learns to assess what value an action has in a certain state.
B
The agent receives instructions on how to behave in a particular state.
C
A programmer has specified the best action for the agent in a given state. If it follows their recommendation, it is rewarded.
D
Reward and punishment are balanced in their frequency of occurrence.
Q10.
What is the name of a system that can help people solve more complex problems by deriving recommended actions from a knowledge base?