連続な空間における強化学習

説明用OHP資料

[Baird 95b] Baird, L.:
Residual Algorithms: Reinforcement Learning with Function Approximation,
Proceedings of the 12th International Conference on Machine Learning, pp.30--37 (1995).
[Bertsekas et al. 96] Bertsekas, D.P. and Tsitsiklis, J.N.:
Neuro-Dynamic Programming, Athena Scientific (1996).
[Santamaria et al.98] Santamaria, J. C., Sutton, R. S. and Ram, A.:
Experiments with Reinforcement Learning in Problems with Continuous State and Action Spaces,
Adaptive Behavior 6 (2), pp.163--218 (1998).
[Sutton et al. 98] Sutton, R. S. and Barto, A.:
Reinforcement Learning: An Introduction,
A Bradford Book, The MIT Press (1998).