Article (Scientific journals)
Cross-entropy optimization of control policies with adaptive basis functions
Busoniu, Lucian; Ernst, Damien; Babuska, Robert et al.
2011In IEEE Transactions on Systems, Man and Cybernetics. Part B, Cybernetics, 41 (1), p. 196-209
Peer Reviewed verified by ORBi
 

Files


Full Text
finalieeeversion_05491120.pdf
Publisher postprint (1.21 MB)
Download

All documents in ORBi are protected by a user license.

Send to



Details



Keywords :
Markov decision processes; direct policy search; adaptive basis functions; cross-entropy optimization
Abstract :
[en] This paper introduces an algorithm for direct search of control policies in continuous-state, discrete-action Markov decision processes. The algorithm looks for the best closed-loop policy that can be represented using a given number of basis functions (BFs), where a discrete action is assigned to each BF. The type of the BFs and their number are specified in advance and determine the complexity of the representation. Considerable flexibility is achieved by optimizing the locations and shapes of the BFs, together with the action assignments. The optimization is carried out with the cross-entropy method and evaluates the policies by their empirical return from a representative set of initial states. The return for each representative state is estimated using Monte Carlo simulations. The resulting algorithm for crossentropy policy search with adaptive BFs is extensively evaluated in problems with two to six state variables, for which it reliably obtains good policies with only a small number of BFs. In these experiments, cross-entropy policy search requires vastly fewer BFs than value-function techniques with equidistant BFs, and outperforms policy search with a competing optimization algorithm called DIRECT.
Disciplines :
Computer science
Author, co-author :
Busoniu, Lucian
Ernst, Damien  ;  Université de Liège - ULiège > Dép. d'électric., électron. et informat. (Inst.Montefiore) > Systèmes et modélisation
Babuska, Robert
De Schutter, Bart
Language :
English
Title :
Cross-entropy optimization of control policies with adaptive basis functions
Publication date :
February 2011
Journal title :
IEEE Transactions on Systems, Man and Cybernetics. Part B, Cybernetics
ISSN :
1083-4419
eISSN :
1941-0492
Publisher :
Institute of Electrical and Electronics Engineers, New-York, United States - New York
Volume :
41
Issue :
1
Pages :
196-209
Peer reviewed :
Peer Reviewed verified by ORBi
Funders :
F.R.S.-FNRS - Fonds de la Recherche Scientifique [BE]
Available on ORBi :
since 09 February 2011

Statistics


Number of views
78 (2 by ULiège)
Number of downloads
494 (2 by ULiège)

Scopus citations®
 
60
Scopus citations®
without self-citations
49
OpenCitations
 
46

Bibliography


Similar publications



Contact ORBi