Reference : Approximate dynamic programming with a fuzzy parameterization
Scientific journals : Article
Engineering, computing & technology : Computer science
http://hdl.handle.net/2268/2769
Approximate dynamic programming with a fuzzy parameterization
English
Busoniu, Lucian [ > > ]
Ernst, Damien mailto [Université de Liège - ULg > Dép. d'électric., électron. et informat. (Inst.Montefiore) > Systèmes et modélisation >]
De Schutter, Bart [ > > ]
Robert, Babuska [ > > ]
May-2010
Automatica
Pergamon Press - An Imprint of Elsevier Science
46
5
804-814
Yes (verified by ORBi)
International
0005-1098
Oxford
United Kingdom
[en] approximate dynamic programming ; fuzzy approximation ; value iteration ; convergence analysis
[en] Dynamic programming (DP) is a powerful paradigm for general, nonlinear optimal control. Computing exact DP solutions is in general only possible when the process states and the control actions take values in a small discrete set. In practice, it is necessary to approximate the solutions. Therefore, we propose an algorithm for approximate DP that relies on a fuzzy partition of the state space, and on a discretization of the action space. This fuzzy Q-iteration algorithm works for deterministic processes, under the discounted return criterion. We prove that fuzzy Q-iteration asymptotically converges to a solution that lies within a bound of the optimal solution. A bound on the suboptimality of the solution obtained in a finite number of iterations is also derived. Under continuity assumptions on the dynamics and on the reward function, we show that fuzzy Q-iteration is consistent, i.e., that it asymptotically obtains the optimal solution as the approximation accuracy increases. These properties hold both when the parameters of the approximator are updated in a synchronous fashion, and when they are updated asynchronously. The asynchronous algorithm is proven to converge at least as fast as the synchronous one. The performance of fuzzy Q-iteration is illustrated in a two-link manipulator control problem.
Fonds de la Recherche Scientifique (Communauté française de Belgique) - F.R.S.-FNRS
Researchers ; Professionals ; Students
http://hdl.handle.net/2268/2769
10.1016/j.automatica.2010.02.006
http://www.montefiore.ulg.ac.be/~ernst

File(s) associated to this reference

Fulltext file(s):

FileCommentaryVersionSizeAccess
Open access
aut10.pdfAuthor postprint1.48 MBView/Open

Bookmark and Share SFX Query

All documents in ORBi are protected by a user license.