Dynamic Matching: Characterizing and Achieving Constant Regret

被引：1

作者：

Kerimov, Suleyman ^{[1
]}

Ashlagi, Itai ^{[2
]}

Gurvich, Itai ^{[3
]}

机构：

[1] Rice Univ, Jones Grad Sch Business, Houston, TX 77005 USA

[2] Stanford Univ, Dept Management Sci & Engn, Stanford, CA 94305 USA

[3] Northwestern Univ, Kellogg Sch Management, Evanston, IL 60208 USA

来源：

MANAGEMENT SCIENCE | 2024年 / 70卷 / 05期

基金：

美国国家科学基金会;

关键词：

dynamic matching; queueing; optimal control; DONATION;

D O I：

10.1287/mnsc.2021.01215

中图分类号：

C93 [管理学];

学科分类号：

12 ; 1201 ; 1202 ; 120202 ;

摘要：

We study how to optimally match agents in a dynamic matching market with heterogeneous match cardinalities and values. A network topology determines the feasible matches in the market. In general, a fundamental tradeoff exists between short-term value-which calls for performing matches frequently-and long-term value-which calls, sometimes, for delaying match decisions in order to perform better matches. We find that in networks that satisfy a general position condition, the tension between short- and long-term value is limited, and a simple periodic clearing policy (nearly) maximizes the total match value simultaneously at all times. Central to our results is the general position gap.; a proxy for capacity slack in the market. With the exception of trivial cases, no policy can achieve an all-time regret that is smaller, in terms of order, than epsilon(-1). We achieve this lower bound with a policy, which periodically resolves a natural matching integer linear program, provided that the delay between resolving periods is of the order of epsilon(-1). Examples illustrate the necessity of some delay to alleviate the tension between short- and long-term value.

引用

页码：2799 / 2822

页数：25

共 50 条

[41] Non-cooperative Target Assignment using Regret Matching
Kalam, Shemin
Gani, Mahbub
11TH INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION, ROBOTICS AND VISION (ICARCV 2010), 2010, : 787 - 792
[42] Neural Regret-Matching for Distributed Constraint Optimization Problems
Deng, Yanchen
Yu, Runshen
Wang, Xinrun
An, Bo
PROCEEDINGS OF THE THIRTIETH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2021, 2021, : 146 - 153
[43] Online Allocation and Pricing: Constant Regret via Bellman Inequalities
Vera, Alberto
Banerjee, Siddhartha
Gurvich, Itai
OPERATIONS RESEARCH, 2021, 69 (03) : 821 - 840
[44] Reinforcement Learning in Linear MDPs: Constant Regret and Representation Selection
Papini, Matteo
Tirinzoni, Andrea
Pacchiano, Aldo
Restilli, Marcello
Lazaric, Alessandro
Pirotta, Matteo
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
[45] Bandits with Stochastic Experts: Constant Regret, Empirical Experts and Episodes
Sharma, Nihal
Sen, Rajat
Basu, Soumya
Shanmugam, Karthikeyan
Shakkottai, Sanjay
ACM TRANSACTIONS ON MODELING AND PERFORMANCE EVALUATION OF COMPUTING SYSTEMS, 2024, 9 (03)
[46] Constant or Logarithmic Regret in Asynchronous Multiplayer Bandits with Limited Communication
Richard, Hugo
Boursier, Etienne
Perchet, Vianney
INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 238, 2024, 238
[47] Tiered Reinforcement Learning: Pessimism in the Face of Uncertainty and Constant Regret
Huang, Jiawei
Zhao, Li
Qin, Tao
Chen, Wei
Jiang, Nan
Liu, Tie-Yan
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
[48] Dynamic Regret of Online Markov Decision Processes
Zhao, Peng
Li, Long-Fei
Zhou, Zhi-Hua
INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162, 2022,
[49] Distributed Estimation of Dynamic Parameters : Regret Analysis
Shahrampour, Shahin
Rakhlin, Alexander
Jadbabaie, Ali
2016 AMERICAN CONTROL CONFERENCE (ACC), 2016, : 1066 - 1071
[50] Unconstrained Dynamic Regret via Sparse Coding
Zhang, Zhiyu
Cutkosky, Ashok
Paschalidis, Ioannis Ch.
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,

← 1 2 3 4 5 →