Speeding up Q (λ)- learning

Wiering, M.A.; Schmidhuber, J.

Speeding up Q (λ)- learning

DSpace/Manakin Repository

Speeding up Q (λ)- learning

Wiering, M.A.; Schmidhuber, J.

(1998) Lecture notes in computer science, volume 1398, pp. 352 - 363

(Article in proceedings)

Abstract

Q(λ)learning uses TD(λ)methods to accelerate Q-learning. The worst case complexity for a single update step of previous online Q(λ) implementations based on lookup tables is bounded by the size of the state action space.Our faster algorithm's worst case complexity is bounded by the number of actions. The algorithm is based on the ... read more

Download/Full Text

Open Access version via Utrecht University Repository

Keywords: Reinforcement learning, Q-learning, TD (λ), online Q (λ), lazy learning

Publisher: Springer

See more statistics about this item