Convergence of Model-Based Temporal Difference Learning for Control

Hasselt, H. van; Wiering, M.A.

Convergence of Model-Based Temporal Difference Learning for Control

DSpace/Manakin Repository

Convergence of Model-Based Temporal Difference Learning for Control

Hasselt, H. van; Wiering, M.A.

(2007) Proceedings of IEEE International Symposium on Approximate Dynamic Programming and Reinforcement Learning (ADPRL)

(Article in proceedings)

Abstract

A theoretical analysis of Model-Based Temporal Difference Learning for Control is given, leading to a proof of convergence. This work differs from earlier work on the convergence of Temporal Difference Learning by proving convergence to the optimal value function. This means that not the values of the current policy are found, but instead the policy ... read more

Download/Full Text

Open Access version via Utrecht University Repository

See more statistics about this item