Dynamic Matching: Characterizing and Achieving Constant Regret

被引:1
|
作者
Kerimov, Suleyman [1 ]
Ashlagi, Itai [2 ]
Gurvich, Itai [3 ]
机构
[1] Rice Univ, Jones Grad Sch Business, Houston, TX 77005 USA
[2] Stanford Univ, Dept Management Sci & Engn, Stanford, CA 94305 USA
[3] Northwestern Univ, Kellogg Sch Management, Evanston, IL 60208 USA
基金
美国国家科学基金会;
关键词
dynamic matching; queueing; optimal control; DONATION;
D O I
10.1287/mnsc.2021.01215
中图分类号
C93 [管理学];
学科分类号
12 ; 1201 ; 1202 ; 120202 ;
摘要
We study how to optimally match agents in a dynamic matching market with heterogeneous match cardinalities and values. A network topology determines the feasible matches in the market. In general, a fundamental tradeoff exists between short-term value-which calls for performing matches frequently-and long-term value-which calls, sometimes, for delaying match decisions in order to perform better matches. We find that in networks that satisfy a general position condition, the tension between short- and long-term value is limited, and a simple periodic clearing policy (nearly) maximizes the total match value simultaneously at all times. Central to our results is the general position gap.; a proxy for capacity slack in the market. With the exception of trivial cases, no policy can achieve an all-time regret that is smaller, in terms of order, than epsilon(-1). We achieve this lower bound with a policy, which periodically resolves a natural matching integer linear program, provided that the delay between resolving periods is of the order of epsilon(-1). Examples illustrate the necessity of some delay to alleviate the tension between short- and long-term value.
引用
收藏
页码:2799 / 2822
页数:25
相关论文
共 50 条
  • [41] Non-cooperative Target Assignment using Regret Matching
    Kalam, Shemin
    Gani, Mahbub
    11TH INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION, ROBOTICS AND VISION (ICARCV 2010), 2010, : 787 - 792
  • [42] Neural Regret-Matching for Distributed Constraint Optimization Problems
    Deng, Yanchen
    Yu, Runshen
    Wang, Xinrun
    An, Bo
    PROCEEDINGS OF THE THIRTIETH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2021, 2021, : 146 - 153
  • [43] Online Allocation and Pricing: Constant Regret via Bellman Inequalities
    Vera, Alberto
    Banerjee, Siddhartha
    Gurvich, Itai
    OPERATIONS RESEARCH, 2021, 69 (03) : 821 - 840
  • [44] Reinforcement Learning in Linear MDPs: Constant Regret and Representation Selection
    Papini, Matteo
    Tirinzoni, Andrea
    Pacchiano, Aldo
    Restilli, Marcello
    Lazaric, Alessandro
    Pirotta, Matteo
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
  • [45] Bandits with Stochastic Experts: Constant Regret, Empirical Experts and Episodes
    Sharma, Nihal
    Sen, Rajat
    Basu, Soumya
    Shanmugam, Karthikeyan
    Shakkottai, Sanjay
    ACM TRANSACTIONS ON MODELING AND PERFORMANCE EVALUATION OF COMPUTING SYSTEMS, 2024, 9 (03)
  • [46] Constant or Logarithmic Regret in Asynchronous Multiplayer Bandits with Limited Communication
    Richard, Hugo
    Boursier, Etienne
    Perchet, Vianney
    INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 238, 2024, 238
  • [47] Tiered Reinforcement Learning: Pessimism in the Face of Uncertainty and Constant Regret
    Huang, Jiawei
    Zhao, Li
    Qin, Tao
    Chen, Wei
    Jiang, Nan
    Liu, Tie-Yan
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
  • [48] Dynamic Regret of Online Markov Decision Processes
    Zhao, Peng
    Li, Long-Fei
    Zhou, Zhi-Hua
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162, 2022,
  • [49] Distributed Estimation of Dynamic Parameters : Regret Analysis
    Shahrampour, Shahin
    Rakhlin, Alexander
    Jadbabaie, Ali
    2016 AMERICAN CONTROL CONFERENCE (ACC), 2016, : 1066 - 1071
  • [50] Unconstrained Dynamic Regret via Sparse Coding
    Zhang, Zhiyu
    Cutkosky, Ashok
    Paschalidis, Ioannis Ch.
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,