Welcome!
To use the personalized features of this site, please log in or register.
If you have forgotten your username or password, we can help.
My Menu
Saved Items

On a Dynamical Analysis of Reinforcement Learning in Games: Emergence of Occam’s Razor

Karl TuylsContact Information, Katja VerbeeckContact Information and Sam MaesContact Information

(3)  Department of Computer Science, CoMo, VUB, Belgium
Abstract
Modeling learning agents in the context of Multi-agent Systems requires an adequate understanding of their dynamic behaviour. Usually, these agents are modeled similar to the different players in a standard game theoretical model. Unfortunately traditional Game Theory is static and limited in its usefelness.
Evolutionary Game Theory improves on this by providing a dynamics which describes how strategies evolve over time. In this paper, we discuss three learning models whose dynamics are related to the Replicator Dynamics(RD). We show how a classical Reinforcement Learning(RL) technique, i.e. Q-learning relates to the RD. This allows to better understand the learning process and it allows to determine how complex a RL model should be. More precisely, Occam’s Razor applies in the framework of games, i.e. the simplest model (Cross) suffices for learning equilibria. An experimental verification in all three models is presented.
Author funded by a doctoral grant of the institute for advancement of scientific technological research in Flanders (IWT).

Contact Information Karl Tuyls
Email: ktuyls@vub.ac.be

Contact Information Katja Verbeeck
Email: kaverbee@vub.ac.be

Contact Information Sam Maes
Email: sammaes@vub.ac.be
Fulltext Preview (Small, Large)
Image of the first page of the fulltext

References secured to subscribers.



Export this chapter
Export this chapter as RIS | Text
 
Remote Address: 38.107.191.107 • Server: MPWEB26
HTTP User Agent: CCBot/1.0 (+http://www.commoncrawl.org/bot.html)