Improved Bandits in Many-to-One Matching Markets with Incentive Compatibility

被引:0
|
作者
Kong, Fang [1 ]
Li, Shuai [1 ]
机构
[1] Shanghai Jiao Tong Univ, John Hopcroft Ctr Comp Sci, Shanghai, Peoples R China
来源
THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 12 | 2024年
基金
中国国家自然科学基金;
关键词
COLLEGE ADMISSIONS; STABILITY; TUITION;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Two-sided matching markets have been widely studied in the literature due to their rich applications. Since participants are usually uncertain about their preferences, online algorithms have recently been adopted to learn them through iterative interactions. An existing work initiates the study of this problem in a many-to-one setting with responsiveness. However, their results are far from optimal and lack guarantees of incentive compatibility. We first extend an existing algorithm for the one-to-one setting to this more general setting and show it achieves a near-optimal bound for player-optimal regret. Nevertheless, due to the substantial requirement for collaboration, a single player's deviation could lead to a huge increase in its own cumulative rewards and a linear regret for others. In this paper, we aim to enhance the regret bound in many-to-one markets while ensuring incentive compatibility. We first propose the adaptively explore-then-deferred-acceptance (AETDA) algorithm for responsiveness setting and derive an upper bound for player-optimal stable regret while demonstrating its guarantee of incentive compatibility. This result is a significant improvement over existing works. And to the best of our knowledge, it constitutes the first player-optimal guarantee in matching markets that offers such robust assurances. We also consider broader substitutable preferences, one of the most general conditions to ensure the existence of a stable matching and cover responsiveness. We devise an online DA (ODA) algorithm and establish an upper bound for the player-pessimal stable regret for this setting.
引用
收藏
页码:13256 / 13264
页数:9
相关论文
共 50 条
  • [41] An analysis of confusion errors in many-to-one matching with temporal and nontemporal samples
    Santi A.
    Stanford L.
    Symons J.
    Animal Cognition, 1998, 1 (1) : 37 - 46
  • [42] Many-to-one boundary labeling
    Department of Electrical Engineering, National Taiwan University, Taipei 106, Taiwan
    不详
    J. Graph Algorithms and Appl., 2008, 3 (319-356):
  • [43] Many-to-one boundary labeling
    Kao, Hao-Jen
    Lin, Chun-Cheng
    Yen, Hsu-Chun
    ASIA-PACIFIC SYMPOSIUM ON VISUALISATION 2007, PROCEEDINGS, 2007, : 65 - +
  • [44] Many-to-one stable matching for taxi-sharing service with selfish players
    Peng, Zixuan
    Shan, Wenxuan
    Zhu, Xiaoning
    Yu, Bin
    TRANSPORTATION RESEARCH PART A-POLICY AND PRACTICE, 2022, 160 : 255 - 279
  • [45] EVIDENCE FOR COMMON CODING IN MANY-TO-ONE MATCHING - RETENTION, INTERTRIAL INTERFERENCE, AND TRANSFER
    URCUIOLI, PJ
    ZENTALL, TR
    JACKSONSMITH, P
    STEIRN, JN
    JOURNAL OF EXPERIMENTAL PSYCHOLOGY-ANIMAL BEHAVIOR PROCESSES, 1989, 15 (03): : 264 - 273
  • [46] Contractor Selection for Defense Acquisition with Advanced Many-to-One Stable Matching Model
    Wei, Hechuan
    Shi, Jianmai
    2022 41ST CHINESE CONTROL CONFERENCE (CCC), 2022, : 1872 - 1878
  • [47] Asymmetrical sample training and asymmetrical retention functions in one-to-one and many-to-one matching in pigeons
    Grant, Douglas S.
    LEARNING AND MOTIVATION, 2006, 37 (03) : 209 - 229
  • [48] Group incentive compatibility for matching with contracts
    Hatfield, John William
    Kojima, Fuhito
    GAMES AND ECONOMIC BEHAVIOR, 2009, 67 (02) : 745 - 749
  • [49] Emergent, untrained stimulus relations in many-to-one matching-to-sample discriminations in rats
    Nakagawa, E
    JOURNAL OF THE EXPERIMENTAL ANALYSIS OF BEHAVIOR, 2005, 83 (02) : 185 - 195
  • [50] Many-to-one matching based cooperative spectrum sharing in overlay cognitive radio networks
    Sharma, Meenakshi
    Sarma, Nityananda
    PHYSICAL COMMUNICATION, 2024, 65