Paper published in a journal (Scientific congresses and symposiums)
Online Sparse Bandit for Card Games
Lupien St-Pierre, David; Louveaux, Quentin; Teytaud, Olivier
2011In Advance in Computer Games
Peer reviewed
 

Files


Full Text
acg2011.pdf
Publisher postprint (168.14 kB)
Request a copy

All documents in ORBi are protected by a user license.

Send to



Details



Abstract :
[en] Finding an approximation of a Nash equilibria in matrix games is an important topic that reaches beyond the strict application to matrix games. A bandit algorithm commonly used to approximate a Nash equilibrium is EXP3. However, the solution to many problems is often sparse, yet EXP3 inherently fails to exploit this property. To the knowledge of the authors, there exist only an offline truncation to tackle such issue. In this paper, we propose a variation of EXP3 to exploit the fact that solution is sparse by dynamically removing arms; the resulting algorithm empirically performs better than previous versions. We apply the resulting algorithm to a MCTS program for the Urban Rivals card game.
Disciplines :
Computer science
Author, co-author :
Lupien St-Pierre, David ;  Université de Liège - ULiège > Dép. d'électric., électron. et informat. (Inst.Montefiore) > Dép. d'électric., électron. et informat. (Inst.Montefiore)
Louveaux, Quentin ;  Université de Liège - ULiège > Dép. d'électric., électron. et informat. (Inst.Montefiore) > Système et modélisation : Optimisation discrète
Teytaud, Olivier
Language :
English
Title :
Online Sparse Bandit for Card Games
Publication date :
2011
Event name :
13th Conference in Advances in Computer Games
Event date :
September 2011
Audience :
International
Journal title :
Advance in Computer Games
Peer reviewed :
Peer reviewed
Available on ORBi :
since 09 May 2012

Statistics


Number of views
99 (15 by ULiège)
Number of downloads
2 (2 by ULiège)

Bibliography


Similar publications



Contact ORBi