Exam clarifications/corrections
Hi everyone!
Here we will post clarifications/corrections to the exam text.
Problem 2 - Fitness Sharing: There seems to have gotten in 2 small errors here. 1) The value alpha (=1) does not belong in the formula. You can ignore that value. 2) The problem refers to the fitness value you calculated in a. That is incorrect, what is meant is the fitness value that is listed in the table on the left. In a, you actually calculated selection probabilities, not fitness values.
Problem 3: " 'dollar' occurs. number" is the number of occurrences of the word 'dollar' in the document.
Problem 5: The learning rate is given in (5a).
Problem 7b: If you're wondering what policy the agent used here, the exact policy doesn't really matter - but the actions chosen by its policy are the actions illustrated in the figure: Going right, followed by down.
Problem 9: In case it wasn't clear, in task c) you should deliver the drawing from one iteration through the full string F[+F][-F]
Good luck!!
-Kai and Jan Tore