連続な空間における強化学習
説明用OHP資料
参考文献
- [Baird 95b] Baird, L.:
Residual Algorithms: Reinforcement Learning with Function Approximation,
Proceedings of the 12th International Conference on Machine Learning,
pp.30--37 (1995).
- [Bertsekas et al. 96] Bertsekas, D.P. and Tsitsiklis, J.N.:
Neuro-Dynamic Programming, Athena Scientific (1996).
- [Santamaria et al.98] Santamaria, J. C., Sutton, R. S. and Ram, A.:
Experiments with Reinforcement Learning in Problems with Continuous State and Action Spaces,
Adaptive Behavior 6 (2), pp.163--218 (1998).
- [Sutton et al. 98] Sutton, R. S. and Barto, A.:
Reinforcement Learning: An Introduction,
A Bradford Book, The MIT Press (1998).
「強化学習の概要」へもどる
2002年4月2日更新