Football Table RL

From Control Systems Technology Group
Revision as of 16:21, 11 September 2013 by S041723 (talk | contribs)
Jump to navigation Jump to search

Reinforcement Learning

The football table employs on-line value iteration, namely Greedy-GQ[math]\displaystyle{ (\lambda) }[/math] and Approximate-Q[math]\displaystyle{ (\lambda) }[/math].

Value Function Approximation