Article (Scientific journals)
Approximate dynamic programming with a fuzzy parameterization
Busoniu, Lucian; Ernst, Damien; De Schutter, Bart et al.
2010In Automatica, 46 (5), p. 804-814
Peer Reviewed verified by ORBi
 

Files


Full Text
aut10.pdf
Author postprint (1.52 MB)
Download

All documents in ORBi are protected by a user license.

Send to



Details



Keywords :
approximate dynamic programming; fuzzy approximation; value iteration; convergence analysis
Abstract :
[en] Dynamic programming (DP) is a powerful paradigm for general, nonlinear optimal control. Computing exact DP solutions is in general only possible when the process states and the control actions take values in a small discrete set. In practice, it is necessary to approximate the solutions. Therefore, we propose an algorithm for approximate DP that relies on a fuzzy partition of the state space, and on a discretization of the action space. This fuzzy Q-iteration algorithm works for deterministic processes, under the discounted return criterion. We prove that fuzzy Q-iteration asymptotically converges to a solution that lies within a bound of the optimal solution. A bound on the suboptimality of the solution obtained in a finite number of iterations is also derived. Under continuity assumptions on the dynamics and on the reward function, we show that fuzzy Q-iteration is consistent, i.e., that it asymptotically obtains the optimal solution as the approximation accuracy increases. These properties hold both when the parameters of the approximator are updated in a synchronous fashion, and when they are updated asynchronously. The asynchronous algorithm is proven to converge at least as fast as the synchronous one. The performance of fuzzy Q-iteration is illustrated in a two-link manipulator control problem.
Disciplines :
Computer science
Author, co-author :
Busoniu, Lucian
Ernst, Damien  ;  Université de Liège - ULiège > Dép. d'électric., électron. et informat. (Inst.Montefiore) > Systèmes et modélisation
De Schutter, Bart
Robert, Babuska
Language :
English
Title :
Approximate dynamic programming with a fuzzy parameterization
Publication date :
May 2010
Journal title :
Automatica
ISSN :
0005-1098
Publisher :
Pergamon Press - An Imprint of Elsevier Science, Oxford, United Kingdom
Volume :
46
Issue :
5
Pages :
804-814
Peer reviewed :
Peer Reviewed verified by ORBi
Funders :
F.R.S.-FNRS - Fonds de la Recherche Scientifique [BE]
Available on ORBi :
since 16 April 2010

Statistics


Number of views
108 (12 by ULiège)
Number of downloads
298 (3 by ULiège)

Scopus citations®
 
49
Scopus citations®
without self-citations
24
OpenCitations
 
47

Bibliography


Similar publications



Contact ORBi