I managed a glitch at …

  • I managed a glitch at the end of Monday's lecture: the 'policy update' step is not about updating the value function, but finding the new control. (I let the formula from the value iteration remain on the board, but I should of course have removed the right hand side and replaced 'sup' by 'argsup' or 'argmax'.)

- Nils

Published Sep. 6, 2011 7:54 PM - Last modified Nov. 11, 2011 8:02 PM