Improved Bandits in Many-to-One Matching Markets with Incentive Compatibility

被引:0
|
作者
Kong, Fang [1 ]
Li, Shuai [1 ]
机构
[1] Shanghai Jiao Tong Univ, John Hopcroft Ctr Comp Sci, Shanghai, Peoples R China
来源
THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 12 | 2024年
基金
中国国家自然科学基金;
关键词
COLLEGE ADMISSIONS; STABILITY; TUITION;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Two-sided matching markets have been widely studied in the literature due to their rich applications. Since participants are usually uncertain about their preferences, online algorithms have recently been adopted to learn them through iterative interactions. An existing work initiates the study of this problem in a many-to-one setting with responsiveness. However, their results are far from optimal and lack guarantees of incentive compatibility. We first extend an existing algorithm for the one-to-one setting to this more general setting and show it achieves a near-optimal bound for player-optimal regret. Nevertheless, due to the substantial requirement for collaboration, a single player's deviation could lead to a huge increase in its own cumulative rewards and a linear regret for others. In this paper, we aim to enhance the regret bound in many-to-one markets while ensuring incentive compatibility. We first propose the adaptively explore-then-deferred-acceptance (AETDA) algorithm for responsiveness setting and derive an upper bound for player-optimal stable regret while demonstrating its guarantee of incentive compatibility. This result is a significant improvement over existing works. And to the best of our knowledge, it constitutes the first player-optimal guarantee in matching markets that offers such robust assurances. We also consider broader substitutable preferences, one of the most general conditions to ensure the existence of a stable matching and cover responsiveness. We devise an online DA (ODA) algorithm and establish an upper bound for the player-pessimal stable regret for this setting.
引用
收藏
页码:13256 / 13264
页数:9
相关论文
共 50 条
  • [1] Sequential entry in many-to-one matching markets
    Elette Boyle
    Federico Echenique
    Social Choice and Welfare, 2009, 33 : 87 - 99
  • [2] Bandit Learning in Many-to-One Matching Markets
    Wang, Zilong
    Guo, Liya
    Yin, Junming
    Li, Shuai
    PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, CIKM 2022, 2022, : 2088 - 2097
  • [3] Sequential entry in many-to-one matching markets
    Boyle, Elette
    Echenique, Federico
    SOCIAL CHOICE AND WELFARE, 2009, 33 (01) : 87 - 99
  • [4] Many-to-one matching markets with externalities among firms
    Bando, Keisuke
    JOURNAL OF MATHEMATICAL ECONOMICS, 2012, 48 (01) : 14 - 20
  • [5] Core and stability notions in many-to-one matching markets with indifferences
    Bonifacio, Agustin G.
    Juarez, Noelia
    Neme, Pablo
    Oviedo, Jorge
    INTERNATIONAL JOURNAL OF GAME THEORY, 2024, 53 (01) : 143 - 157
  • [6] Core and stability notions in many-to-one matching markets with indifferences
    Agustín G. Bonifacio
    Noelia Juarez
    Pablo Neme
    Jorge Oviedo
    International Journal of Game Theory, 2024, 53 : 143 - 157
  • [7] Search Frictions in Many-to-one Markets
    Xu, Menghan
    JOURNAL OF INDUSTRIAL ECONOMICS, 2024, 72 (01): : 284 - 326
  • [8] Improved many-to-one sweeping
    Scott, MA
    Benzley, SE
    Owen, SJ
    INTERNATIONAL JOURNAL FOR NUMERICAL METHODS IN ENGINEERING, 2006, 65 (03) : 332 - 348
  • [9] FAST MANY-TO-ONE MATCHING ALGORITHMS
    PURDOM, PW
    BROWN, CA
    LECTURE NOTES IN COMPUTER SCIENCE, 1985, 202 : 407 - 416
  • [10] Incentive Compatibility of Large Centralized Matching Markets
    Lee, Sangmok
    REVIEW OF ECONOMIC STUDIES, 2017, 84 (01): : 444 - 463