A unified framework for stochastic optimization

被引:152
|
作者
Powell, Warren B. [1 ]
机构
[1] Princeton Univ, Dept Operat Res & Financial Engn, Sherrerd Hall, Princeton, NJ 08544 USA
基金
美国国家科学基金会;
关键词
Dynamic programming; Stochastic programming; Bandit problems; Reinforcement learning; Robust optimization; Simulation optimization; OBSERVABLE MARKOV-PROCESSES; MODEL-PREDICTIVE CONTROL; OPTIMAL STOPPING-TIMES; ROBUST OPTIMIZATION; LINEAR-PROGRAMS; GLOBAL OPTIMIZATION; KNOWLEDGE-GRADIENT; BUDGET ALLOCATION; SIMULATION; RISK;
D O I
10.1016/j.ejor.2018.07.014
中图分类号
C93 [管理学];
学科分类号
12 ; 1201 ; 1202 ; 120202 ;
摘要
Stochastic optimization is an umbrella term that includes over a dozen fragmented communities, using a patchwork of sometimes overlapping notational systems with algorithmic strategies that are suited to specific classes of problems. This paper reviews the canonical models of these communities, and proposes a universal modeling framework that encompasses all of these competing approaches. At the heart is an objective function that optimizes over policies that is standard in some approaches, but foreign to others. We then identify four meta-classes of policies that encompasses all of the approaches that we have identified in the research literature or industry practice. In the process, we observe that any adaptive learning algorithm, whether it is derivative-based or derivative-free, is a form of policy that can be tuned to optimize either the cumulative reward (similar to multi-armed bandit problems) or final reward (as is used in ranking and selection or stochastic search). We argue that the principles of bandit problems, long a niche community, should become a core dimension of mainstream stochastic optimization. (C) 2018 Elsevier B.V. All rights reserved.
引用
收藏
页码:795 / 821
页数:27
相关论文
共 50 条
  • [1] Reinforcement Learning and Stochastic Optimization: A Unified Framework for Sequential Decisions
    Halperin, Igor
    [J]. QUANTITATIVE FINANCE, 2022, 22 (12) : 2151 - 2154
  • [2] A Unified q-Memorization Framework for Asynchronous Stochastic Optimization
    Gu, Bin
    Xian, Wenhan
    Huo, Zhouyuan
    Deng, Cheng
    Huang, Heng
    [J]. JOURNAL OF MACHINE LEARNING RESEARCH, 2020, 21
  • [3] Unified Algorithm Framework for Nonconvex Stochastic Optimization in Deep Neural Networks
    Zhu, Yini
    Iiduka, Hideaki
    [J]. IEEE ACCESS, 2021, 9 : 143807 - 143823
  • [4] A unified stochastic framework for robust topology optimization of continuum and truss-like structures
    Richardson, J. N.
    Coelho, R. Filomeno
    Adriaenssens, S.
    [J]. ENGINEERING OPTIMIZATION, 2016, 48 (02) : 334 - 350
  • [5] A unified simulation framework for spatial stochastic models
    Mayer, J
    Schmidt, V
    Schweiggert, F
    [J]. SIMULATION MODELLING PRACTICE AND THEORY, 2004, 12 (05) : 307 - 326
  • [6] Unified framework for quasispecies evolution and stochastic quantization
    Bianconi, Ginestra
    Rahmede, Christoph
    [J]. PHYSICAL REVIEW E, 2011, 83 (05):
  • [7] A unified stochastic approximation framework for learning in games
    Panayotis Mertikopoulos
    Ya-Ping Hsieh
    Volkan Cevher
    [J]. Mathematical Programming, 2024, 203 : 559 - 609
  • [8] A unified stochastic approximation framework for learning in games
    Mertikopoulos, Panayotis
    Hsieh, Ya-Ping
    Cevher, Volkan
    [J]. MATHEMATICAL PROGRAMMING, 2024, 203 (1-2) : 559 - 609
  • [9] A unified framework for schedule and storage optimization
    Thies, W
    Vivien, F
    Sheldon, J
    Amarasinghe, S
    [J]. ACM SIGPLAN NOTICES, 2001, 36 (05) : 232 - 242
  • [10] A unified optimization framework for microelectronics industry
    Li, Yiming
    Chen, Cheng-Kai
    Cho, Yen-Yu
    [J]. GECCO 2006: GENETIC AND EVOLUTIONARY COMPUTATION CONFERENCE, VOL 1 AND 2, 2006, : 1875 - +