Donnerstag, 28. April 2011

1. Reinforcement Learning

Markov Decision Processes, Bellman-Gleichungen, Temporal Differences Learning. Folien.
Monte-Carlo Sampling, Diskretisierung, Approximate Policy Iteration. Folien.