Intranet Tools

nb. next round of REF2013 will NOT be using data from eprints.ecs, but the central university REF interface.

RSS 1.0 Feed
RSS 2.0 Feed
Atom Feed
 

Planning Against Fictitious Players in Repeated Normal Form Games

Munoz de Cote, E. and Jennings, N. (2010) Planning Against Fictitious Players in Repeated Normal Form Games. In: 9th International Conference on Autonomous Agents and Multi-Agent Systems (AAMAS2010), Toronto, Canada. pp. 1073-1080.

Download

[img]
Preview
Published Version
PDF

179Kb

Abstract

Planning how to interact against bounded memory and unbounded memory learning opponents needs different treatment. Thus far, however, work in this area has shown how to design plans against bounded memory learning opponents, but no work has dealt with the unbounded memory case. This paper tackles this gap. In particular, we frame this as a planning problem using the framework of repeated matrix games, where the planner's objective is to compute the best exploiting sequence of actions against a learning opponent. The particular class of opponent we study uses a fictitious play process to update her beliefs, but the analysis generalizes to many forms of Bayesian learning agents.

Our analysis is inspired by Banerjee and Peng's AIM framework, which works for planning and learning against bounded memory opponents (e.g an adaptive player). Building on this, we show how an unbounded memory opponent (specifically a fictitious player) can also be modelled as a finite MDP and present a new efficient algorithm that can find a way to exploit the opponent by computing in polynomial time a sequence of play that can obtain a higher average reward than those obtained by playing a game theoretic (Nash or correlated) equilibrium.

Item Type:Conference or Workshop Item
Creator/Authors:
Enrique Munoz de Cote
Nick Jennings
Research Group:Old ECS Groups > Intelligence, Agents, Multimedia
Current ECS Groups > Agents, Interaction and Complexity
Date:2010
Information about this record:
Citations:Google Scholar: 3
Downloads (2010):75
ID Code:18481
Last Modified:23 Sep 2011 10:38
Deposited On:08 Feb 2010 18:16 by Munoz De Cote Flores Luna, Jose Enrique

Tools & Metadata

Download Statistics

Last month

Last year

Members of ECS may view the download statistics dashboard for this record.

Corrections

ECS staff and postgraduates may modify this record

  Welcome from Deputy Head of School (Research) Research Prospectus Industrial Partnerships New Research Students Notes for Guidance New Research Students Notes for Guidance
The ECS EPrints Repository supports OAI 2.0 with a base URL of http://eprints.ecs.soton.ac.uk/cgi/oai2

EPrints is free software developed by the University of Southampton to facilitate Open Access to research.
EPrints