Min Max Generalization for Deterministic Batch Mode Reinforcement Learning: Relaxation Schemes

Fonteneau, Raphaël

Download

Speech/Talk (Diverse speeches and writings)

Min Max Generalization for Deterministic Batch Mode Reinforcement Learning: Relaxation Schemes

Fonteneau, Raphaël

2013

Permalink
https://hdl.handle.net/2268/182290

Files (1)Send to Details Statistics Bibliography Similar publications

Files

Full Text

29_11_2013_min_max@Maastricht.pdf

Author postprint (1.65 MB)

Download

All documents in ORBi are protected by a user license.

Send to

RIS BibTex APA Chicago Permalink X Linkedin

Details

Keywords :

Reinforcement Learning

Abstract :

[en] We study the min max optimization problem introduced in [Fonteneau et al. (2011), ``Towards min max reinforcement learning'', Springer CCIS, vol. 129, pp. 61-77] for computing policies for batch mode reinforcement learning in a deterministic setting with fixed, finite time horizon. First, we show that the min part of this problem is NP-hard. We then provide two relaxation schemes. The first relaxation scheme works by dropping some constraints in order to obtain a problem that is solvable in polynomial time. The second relaxation scheme, based on a Lagrangian relaxation where all constraints are dualized, can also be solved in polynomial time. We also theoretically prove and empirically illustrate that both relaxation schemes provide better results than those given in [Fonteneau et al. (2011)]

Disciplines :

Computer science

Author, co-author :

Fonteneau, Raphaël ; Université de Liège > Dép. d'électric., électron. et informat. (Inst.Montefiore) > Systèmes et modélisation

Language :

English

Title :

Min Max Generalization for Deterministic Batch Mode Reinforcement Learning: Relaxation Schemes

Publication date :

29 November 2013

Event name :

Dutch-Belgian Reinforcement Learning Workshop

Event place :

Maastricht, Netherlands

Event date :

29-11-2013

Audience :

International

Available on ORBi :

since 02 June 2015

Statistics

Number of views

22 (0 by ULiège)

Number of downloads

127 (0 by ULiège)

More statistics