連続な空間における強化学習

説明用OHP資料

参考文献

  1. [Baird 95b] Baird, L.:
    Residual Algorithms: Reinforcement Learning with Function Approximation,
    Proceedings of the 12th International Conference on Machine Learning, pp.30--37 (1995).
  2. [Bertsekas et al. 96] Bertsekas, D.P. and Tsitsiklis, J.N.:
    Neuro-Dynamic Programming, Athena Scientific (1996).
  3. [Santamaria et al.98] Santamaria, J. C., Sutton, R. S. and Ram, A.:
    Experiments with Reinforcement Learning in Problems with Continuous State and Action Spaces,
    Adaptive Behavior 6 (2), pp.163--218 (1998).
  4. [Sutton et al. 98] Sutton, R. S. and Barto, A.:
    Reinforcement Learning: An Introduction,
    A Bradford Book, The MIT Press (1998).

「強化学習の概要」へもどる
2002年4月2日更新