Paper published in a book (Scientific congresses and symposiums)
Online least-squares policy iteration for reinforcement learning control
Busoniu, Lucian; Ernst, Damien; De Schutter, Bart et al.
2010In Proceedings of the 2010 American Control Conference
Peer reviewed
 

Files


Full Text
cdversion_0383.pdf
Publisher postprint (209.65 kB)
Download

All documents in ORBi are protected by a user license.

Send to



Details



Keywords :
reinforcement learning; online LSPI
Abstract :
[en] Reinforcement learning is a promising paradigm for learning optimal control. We consider policy iteration (PI) algorithms for reinforcement learning, which iteratively evaluate and improve control policies. State-of-the-art, least-squares techniques for policy evaluation are sample-efficient and have relaxed convergence requirements. However, they are typically used in offline PI, whereas a central goal of reinforcement learning is to develop online algorithms. Therefore, we propose an online PI algorithm that evaluates policies with the so-called least-squares temporal difference for Q-functions (LSTD-Q). The crucial difference between this online least-squares policy iteration (LSPI) algorithm and its offline counterpart is that, in the online case, policy improvements must be performed once every few state transitions, using only an incomplete evaluation of the current policy. In an extensive experimental evaluation, online LSPI is found to work well for a wide range of its parameters, and to learn successfully in a real-time example. Online LSPI also compares favorably with offline LSPI and with a different flavor of online PI, which instead of LSTD-Q employs another least-squares method for policy evaluation.
Disciplines :
Computer science
Author, co-author :
Busoniu, Lucian
Ernst, Damien  ;  Université de Liège - ULiège > Dép. d'électric., électron. et informat. (Inst.Montefiore) > Systèmes et modélisation
De Schutter, Bart
Babuska, Robert
Language :
English
Title :
Online least-squares policy iteration for reinforcement learning control
Publication date :
2010
Event name :
2010 American Control Conference
Event place :
Baltimore, United States
Event date :
June 30-July 2, 2010
Audience :
International
Main work title :
Proceedings of the 2010 American Control Conference
ISBN/EAN :
978-1-4244-7426-4
Pages :
486-491
Peer reviewed :
Peer reviewed
Funders :
F.R.S.-FNRS - Fonds de la Recherche Scientifique [BE]
Available on ORBi :
since 07 July 2010

Statistics


Number of views
171 (2 by ULiège)
Number of downloads
730 (1 by ULiège)

Scopus citations®
 
59
Scopus citations®
without self-citations
54
OpenCitations
 
28

Bibliography


Similar publications



Contact ORBi