Optimal Screening for Hepatocellular Carcinoma: A Restless Bandit Model

被引：25

作者：

Lee, Elliot ^{[1
]}

Lavieri, Marie S. ^{[2
]}

Volk, Michael ^{[3
]}

机构：

[1] Ctr Naval Anal, Arlington, VA 22201 USA

[2] Univ Michigan, Dept Ind & Operat Engn, Ann Arbor, MI 48109 USA

[3] Loma Linda Univ, Gastroenterol, Loma Linda, CA 92354 USA

来源：

M&SOM-MANUFACTURING & SERVICE OPERATIONS MANAGEMENT | 2019年 / 21卷 / 01期

基金：

美国国家科学基金会;

关键词：

dynamic programming; healthcare management; simulation; medical decision making; multiarmed bandits; OPTIMIZATION; MANAGEMENT; PATIENT;

D O I：

10.1287/msom.2017.0697

中图分类号：

C93 [管理学];

学科分类号：

12 ; 1201 ; 1202 ; 120202 ;

摘要：

This paper seeks an efficient way to screen a population of patients at risk for hepatocellular carcinoma when (1) each patient's disease evolves stochastically and (2) there are limited screening resources shared by the population. Recent medical discoveries have shown that biological information can be learned at each screening to differentiate patients into varying levels of risk for cancer. We investigate how to exploit this knowledge to choose which patients to screen to maximize early-stage cancer detections while limiting resource usage. We model the problem as a family of restless bandits, with each patient's disease progression evolving as a partially observable Markov decision process. We derive an optimal policy for this problem and discuss managerial insights into what characterizes more effective screening. To provide numerical evidence, we use two independent data sets of over 800 patients, one to train the optimal policy, and the other to build a computer simulation to act as a test bed for said policy. We are able to show that our policy detects 22% more early-stage cancers than current practice, while using the same amount of resource expenditure. We provide insights into the structure underlying our policy and discuss the implications of our findings.

引用

页码：198 / 212

页数：15

共 50 条

[1] Optimal selection of obsolescence mitigation strategies using a restless bandit model
Kumar, U. Dinesh
Saranga, Haritha
EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2010, 200 (01) : 170 - 180
[2] Optimal Myopic Policy for Restless Bandit: A Perspective of Eigendecomposition
Wang, Kehao
Yu, Jihong
Chen, Lin
Zhou, Pan
Win, Moe Z.
IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING, 2022, 16 (03) : 420 - 433
[3] A Restless Bandit Model for Resource Allocation, Competition, and Reservation
Fu, Jing
Moran, Bill
Taylor, Peter G.
OPERATIONS RESEARCH, 2022, 70 (01) : 416 - 431
[4] Optimal learning dynamics of multiagent system in restless multiarmed bandit game
Nakayama, Kazuaki
Nakamura, Ryuzo
Hisakado, Masato
Mori, Shintaro
PHYSICA A-STATISTICAL MECHANICS AND ITS APPLICATIONS, 2020, 549
[5] Optimal policies for observing time series and related restless bandit problems
Dance, Christopher R.
Silander, Tomi
Journal of Machine Learning Research, 2019, 20
[6] Optimal Policies for Observing Time Series and Related Restless Bandit Problems
Dance, Christopher R.
Silander, Tomi
JOURNAL OF MACHINE LEARNING RESEARCH, 2019, 20
[7] Approximations of the restless bandit problem
Department of Mathematics and Statistics, Lancaster University, Lancaster, United Kingdom
J. Mach. Learn. Res.,
[8] A restless bandit model for dynamic ride matching with reneging travelers
Fu, Jing
Zhang, Lele
Liu, Zhiyuan
EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2025, 320 (03) : 581 - 592
[9] Approximations of the Restless Bandit Problem
Grunewalder, Steffen
Khaleghi, Azadeh
JOURNAL OF MACHINE LEARNING RESEARCH, 2019, 20
[10] Parameter and Model Recovery of Reinforcement Learning Models for Restless Bandit Problems
Danwitz L.
Mathar D.
Smith E.
Tuzsus D.
Peters J.
Computational Brain & Behavior, 2022, 5 (4) : 547 - 563

← 1 2 3 4 5 →