Improved Bandits in Many-to-One Matching Markets with Incentive Compatibility

被引：0

作者：

Kong, Fang ^{[1
]}

Li, Shuai ^{[1
]}

机构：

[1] Shanghai Jiao Tong Univ, John Hopcroft Ctr Comp Sci, Shanghai, Peoples R China

来源：

THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 12 | 2024年

基金：

中国国家自然科学基金;

关键词：

COLLEGE ADMISSIONS; STABILITY; TUITION;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Two-sided matching markets have been widely studied in the literature due to their rich applications. Since participants are usually uncertain about their preferences, online algorithms have recently been adopted to learn them through iterative interactions. An existing work initiates the study of this problem in a many-to-one setting with responsiveness. However, their results are far from optimal and lack guarantees of incentive compatibility. We first extend an existing algorithm for the one-to-one setting to this more general setting and show it achieves a near-optimal bound for player-optimal regret. Nevertheless, due to the substantial requirement for collaboration, a single player's deviation could lead to a huge increase in its own cumulative rewards and a linear regret for others. In this paper, we aim to enhance the regret bound in many-to-one markets while ensuring incentive compatibility. We first propose the adaptively explore-then-deferred-acceptance (AETDA) algorithm for responsiveness setting and derive an upper bound for player-optimal stable regret while demonstrating its guarantee of incentive compatibility. This result is a significant improvement over existing works. And to the best of our knowledge, it constitutes the first player-optimal guarantee in matching markets that offers such robust assurances. We also consider broader substitutable preferences, one of the most general conditions to ensure the existence of a stable matching and cover responsiveness. We devise an online DA (ODA) algorithm and establish an upper bound for the player-pessimal stable regret for this setting.

引用

页码：13256 / 13264

页数：9

共 50 条

[41] An analysis of confusion errors in many-to-one matching with temporal and nontemporal samples
Santi A.
Stanford L.
Symons J.
Animal Cognition, 1998, 1 (1) : 37 - 46
[42] Many-to-one boundary labeling
Department of Electrical Engineering, National Taiwan University, Taipei 106, Taiwan
不详
J. Graph Algorithms and Appl., 2008, 3 (319-356):
[43] Many-to-one boundary labeling
Kao, Hao-Jen
Lin, Chun-Cheng
Yen, Hsu-Chun
ASIA-PACIFIC SYMPOSIUM ON VISUALISATION 2007, PROCEEDINGS, 2007, : 65 - +
[44] Many-to-one stable matching for taxi-sharing service with selfish players
Peng, Zixuan
Shan, Wenxuan
Zhu, Xiaoning
Yu, Bin
TRANSPORTATION RESEARCH PART A-POLICY AND PRACTICE, 2022, 160 : 255 - 279
[45] EVIDENCE FOR COMMON CODING IN MANY-TO-ONE MATCHING - RETENTION, INTERTRIAL INTERFERENCE, AND TRANSFER
URCUIOLI, PJ
ZENTALL, TR
JACKSONSMITH, P
STEIRN, JN
JOURNAL OF EXPERIMENTAL PSYCHOLOGY-ANIMAL BEHAVIOR PROCESSES, 1989, 15 (03): : 264 - 273
[46] Contractor Selection for Defense Acquisition with Advanced Many-to-One Stable Matching Model
Wei, Hechuan
Shi, Jianmai
2022 41ST CHINESE CONTROL CONFERENCE (CCC), 2022, : 1872 - 1878
[47] Asymmetrical sample training and asymmetrical retention functions in one-to-one and many-to-one matching in pigeons
Grant, Douglas S.
LEARNING AND MOTIVATION, 2006, 37 (03) : 209 - 229
[48] Group incentive compatibility for matching with contracts
Hatfield, John William
Kojima, Fuhito
GAMES AND ECONOMIC BEHAVIOR, 2009, 67 (02) : 745 - 749
[49] Emergent, untrained stimulus relations in many-to-one matching-to-sample discriminations in rats
Nakagawa, E
JOURNAL OF THE EXPERIMENTAL ANALYSIS OF BEHAVIOR, 2005, 83 (02) : 185 - 195
[50] Many-to-one matching based cooperative spectrum sharing in overlay cognitive radio networks
Sharma, Meenakshi
Sarma, Nityananda
PHYSICAL COMMUNICATION, 2024, 65

← 1 2 3 4 5 →