Welfare Maximization in Competitive Equilibrium: Reinforcement Learning for Markov Exchange Economy

被引：0

作者：

Liu, Zhihan ^{[1
]}

Lu, Miao ^{[2
]}

Wang, Zhaoran ^{[1
]}

Jordan, Michael I. ^{[3
]}

Yang, Zhuoran ^{[4
]}

机构：

[1] Northwestern Univ, Dept Ind Engn & Management Sci, Evanston, IL 60208 USA

[2] Univ Sci & Technol China, Sch Gifted Young, Hefei, Peoples R China

[3] Univ Calif Berkeley, Dept Stat, Berkeley, CA 94720 USA

[4] Yale Univ, Dept Stat & Data Sci, New Haven, CT 06520 USA

来源：

INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162 | 2022年

关键词：

RESOURCE-ALLOCATION; STRATEGIES;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

We study a bilevel economic system, which we refer to as a Markov exchange economy (MEE), from the point of view of multi-agent reinforcement learning (MARL). An MEE involves a central planner and a group of self-interested agents. The goal of the agents is to form a Competitive Equilibrium (CE), where each agent myopically maximizes her own utility at each step. The goal of the central planner is to steer the system so as to maximize social welfare, which is defined as the sum of the utilities of all agents. Working in a setting in which the utility function and the system dynamics are both unknown, we propose to find the socially optimal policy and the CE from data via both online and offline variants of MARL. Concretely, we first devise a novel suboptimality metric specifically tailored to MEE, such that minimizing such a metric certifies globally optimal policies for both the planner and the agents. Second, in the online setting, we propose an algorithm, dubbed as MOLM, which combines the optimism principle for exploration with subgame CE seeking. Our algorithm can readily incorporate general function approximation tools for handling large state spaces and achieves a sublinear regret. Finally, we adapt the algorithm to an offline setting based on the pessimism principle and establish an upper bound on the suboptimality.

引用

页数：42

共 50 条

[1] Competitive equilibrium in an exchange economy with indivisibilities
Bikhchandani, S
Mamer, JW
JOURNAL OF ECONOMIC THEORY, 1997, 74 (02) : 385 - 413
[2] Boosting Reinforcement Learning in Competitive Influence Maximization with Transfer Learning
Ali, Khurshed
Wang, Chih-Yu
Chen, Yi-Shin
2018 IEEE/WIC/ACM INTERNATIONAL CONFERENCE ON WEB INTELLIGENCE (WI 2018), 2018, : 395 - 400
[3] EXISTENCE OF A COMPETITIVE EQUILIBRIUM IN A NONSTANDARD EXCHANGE ECONOMY
BROWN, DJ
ECONOMETRICA, 1976, 44 (03) : 537 - 546
[4] Leveraging transfer learning in reinforcement learning to tackle competitive influence maximization
Khurshed Ali
Chih-Yu Wang
Yi-Shin Chen
Knowledge and Information Systems, 2022, 64 : 2059 - 2090
[5] Leveraging transfer learning in reinforcement learning to tackle competitive influence maximization
Ali, Khurshed
Wang, Chih-Yu
Chen, Yi-Shin
KNOWLEDGE AND INFORMATION SYSTEMS, 2022, 64 (08) : 2059 - 2090
[6] COMPETITIVE ENTRY IN A SPATIAL ECONOMY - MARKET EQUILIBRIUM AND WELFARE IMPLICATIONS
HOLAHAN, WL
SCHULER, RE
JOURNAL OF REGIONAL SCIENCE, 1981, 21 (03) : 341 - 357
[7] Learning competitive equilibrium in laboratory exchange economies
Sean Crockett
Economic Theory, 2008, 34 : 157 - 180
[8] Learning competitive equilibrium in laboratory exchange economies
Crockett, Sean
ECONOMIC THEORY, 2008, 34 (01) : 157 - 180
[9] Realizable Utility Maximization as a Mechanism for the Stability of Competitive General Equilibrium in a Scarf Economy
Tongkui Yu
Shu-Heng Chen
Computational Economics, 2021, 58 : 133 - 167
[10] Realizable Utility Maximization as a Mechanism for the Stability of Competitive General Equilibrium in a Scarf Economy
Yu, Tongkui
Chen, Shu-Heng
COMPUTATIONAL ECONOMICS, 2021, 58 (01) : 133 - 167

← 1 2 3 4 5 →