Football Table RL

From Control Systems Technology Group
Jump to navigation Jump to search

Reinforcement Learning

The football table employs on-line value iteration, namely Greedy-GQ[math]\displaystyle{ (\lambda) }[/math] and Approximate-Q[math]\displaystyle{ (\lambda) }[/math]. This page does not explain Reinforcement learning theory, it just touches on the usage and implementation of the provided library (libvfa located on the SVN). [ http://webdocs.cs.ualberta.ca/~sutton/book/the-book.html Reinforcement Learning: an introduction]


Value Function Approximation