SIMULTANEOUS STATE ESTIMATION AND LEARNING IN REPEATED COURNOT GAMES

被引：6

作者：

Kebriaei, Hamed ^{[1
]}

Ahmadabadi, Majid Nili ^{[1
,2
]}

Rahimi-Kian, Ashkan ^{[1
]}

机构：

[1] Univ Tehran, Control & Intelligent Proc Ctr Excellence, Sch ECE, Tehran, Iran

[2] Inst Res Fundamental Sci, Sch Cognit Sci, Tehran, Iran

来源：

APPLIED ARTIFICIAL INTELLIGENCE | 2014年 / 28卷 / 01期

关键词：

SIMPLE DYNAMIC-MODEL; PEOPLE PLAY GAMES; BIDDING STRATEGIES; AGENTS;

D O I：

10.1080/08839514.2014.862774

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

The aim of this article is to propose that an intelligent agent can be able to decide properly in an incomplete information repeated Cournot game. The market model and the competitors' decision models are not known to the players. The proposed agent employs a combination of the k-nearest neighbor (KNN) method and the Bayes classifier to predict the next action of its rivals, using the market decision history. The agent takes the predicted actions as an estimate of its next state and learns the expected payoff of its state-action pairs interactively using the reinforcement learning (RL) algorithm. The results of the proposed agent's competition with two benchmark competitors in different simulated Cournot games are presented. The simulation results show that the proposed agent can significantly earn more payoffs in comparison with the two benchmark agents.

引用

页码：66 / 89

页数：24

共 50 条

[1] Cost Structures and Nash Play in Repeated Cournot Games
Douglas D. Davis
Robert J. Reilly
Bart J. Wilson
Experimental Economics, 2003, 6 (2) : 209 - 226
[2] Adaptation and convergence of behavior in repeated experimental Cournot games
Rassenti, S
Reynolds, SS
Smith, VL
Szidarovszky, F
JOURNAL OF ECONOMIC BEHAVIOR & ORGANIZATION, 2000, 41 (02) : 117 - 146
[3] Model-Based and Learning-Based Decision Making in Incomplete Information Cournot Games: A State Estimation Approach
Kebriaei, Hamed
Rahimi-Kian, Ashkan
Ahmadabadi, Majid Nili
IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2015, 45 (04): : 713 - 718
[4] Social aspiration reinforcement learning in Cournot games
Fatas, Enrique
Morales, Antonio J.
Jaramillo-Gutierrez, Ainhoa
ECONOMIC THEORY, 2024,
[5] Repeated Stackelberg security games: Learning with incomplete state information
Alcantara-Jimenez, Guillermo
Clempner, Julio B.
RELIABILITY ENGINEERING & SYSTEM SAFETY, 2020, 195
[6] Learning the state of nature in repeated games with incomplete information and signals
Renault, J
Tomala, T
GAMES AND ECONOMIC BEHAVIOR, 2004, 47 (01) : 124 - 156
[7] Multi-Agent Reinforcement Learning in Cournot Games
Shi, Yuanyuan
Zhang, Baosen
2020 59TH IEEE CONFERENCE ON DECISION AND CONTROL (CDC), 2020, : 3561 - 3566
[8] Learning the demand function in a repeated Cournot oligopoly game
Bischi, Gian-Italo
Sbragia, Lucia
Szidarovszky, Ferenc
INTERNATIONAL JOURNAL OF SYSTEMS SCIENCE, 2008, 39 (04) : 403 - 419
[9] Learning aspiration in repeated games
Cho, IK
Matsui, A
JOURNAL OF ECONOMIC THEORY, 2005, 124 (02) : 171 - 201
[10] BAYESIAN LEARNING IN REPEATED GAMES
JORDAN, JS
GAMES AND ECONOMIC BEHAVIOR, 1995, 9 (01) : 8 - 20

← 1 2 3 4 5 →