Abstract
The analysis of the collective behavior of agents in a distributed
multi-agent environment received a lot of attention in the past decade. More
accurately, coordination was studied intensely because it enables agents to
converge to Pareto optimal solutions and Nash equilibria. Most of these
studies focussed on team games. In this paper we report on a technique for
finding fair solutions in conflicting interest multi-stage games. Our
hierarchical periodic policies algorithm is based on the characteristics of a
homo egualis society in which the players also care about the proportional
distribution of the pay-off in relation to the pay-off of the other players.
This feature is built into a hierarchy of learning automata which is suited
for playing sequential decision problems.
Original language | English |
---|---|
Title of host publication | Knowledge-Based Intelligent Information and Engineering Systems |
Subtitle of host publication | 12th International Conference, KES 2008 |
Editors | Ignic Lovrek, Robert J. Howlett, Lakhmi C. Jain |
Place of Publication | Heidelberg |
Publisher | Springer |
Pages | 391-402 |
Volume | II |
ISBN (Print) | 978-3-540-85564-4 |
DOIs | |
Publication status | Published - 2008 |
MoE publication type | A4 Article in a conference publication |
Event | 12th International Conference on Knowledge-Based Intelligent Information and Engineering Systems, KES-2008 - Zagreb, Croatia Duration: 3 Sept 2008 → 5 Sept 2008 Conference number: 12 |
Publication series
Series | Lecture Notes in Computer Science |
---|---|
Volume | 5178 |
ISSN | 0302-9743 |
Conference
Conference | 12th International Conference on Knowledge-Based Intelligent Information and Engineering Systems, KES-2008 |
---|---|
Abbreviated title | KES-2008 |
Country/Territory | Croatia |
City | Zagreb |
Period | 3/09/08 → 5/09/08 |
Keywords
- Learning automata
- reinforcement learning
- policy gradient
- multi-agent systems
- exploration