Online Sparse Bandit for Card Games

Lupien St-Pierre, David; Louveaux, Quentin; Teytaud, Olivier

Request a copy

Paper published in a journal (Scientific congresses and symposiums)

Online Sparse Bandit for Card Games

Lupien St-Pierre, David; Louveaux, Quentin; Teytaud, Olivier

2011 • In Advance in Computer Games

Peer reviewed

Permalink
https://hdl.handle.net/2268/120275

Files (1)Send to Details Statistics Bibliography Similar publications

Files

Full Text

acg2011.pdf

Publisher postprint (168.14 kB)

Request a copy

All documents in ORBi are protected by a user license.

Send to

RIS BibTex APA Chicago Permalink X Linkedin

Details

Abstract :

[en] Finding an approximation of a Nash equilibria in matrix games is an important topic that reaches beyond the strict application to matrix games. A bandit algorithm commonly used to approximate a Nash equilibrium is EXP3. However, the solution to many problems is often sparse, yet EXP3 inherently fails to exploit this property. To the knowledge of the authors, there exist only an offline truncation to tackle such issue. In this paper, we propose a variation of EXP3 to exploit the fact that solution is sparse by dynamically removing arms; the resulting algorithm empirically performs better than previous versions. We apply the resulting algorithm to a MCTS program for the Urban Rivals card game.

Disciplines :

Computer science

Author, co-author :

Lupien St-Pierre, David ; Université de Liège - ULiège > Dép. d'électric., électron. et informat. (Inst.Montefiore) > Dép. d'électric., électron. et informat. (Inst.Montefiore)

Louveaux, Quentin ; Université de Liège - ULiège > Dép. d'électric., électron. et informat. (Inst.Montefiore) > Système et modélisation : Optimisation discrète

Teytaud, Olivier

Language :

English

Title :

Online Sparse Bandit for Card Games

Publication date :

2011

Event name :

13th Conference in Advances in Computer Games

Event date :

September 2011

Audience :

International

Journal title :

Advance in Computer Games

Peer reviewed :

Peer reviewed

Available on ORBi :

since 09 May 2012

Statistics

Number of views

99 (15 by ULiège)

Number of downloads

2 (2 by ULiège)

More statistics