Missile defense and interceptor allocation by neuro-dynamic programming

被引:72
|
作者
Bertsekas, DP [1 ]
Homer, ML
Logan, DA
Patek, SD
Sandell, NR
机构
[1] MIT, Informat & Decis Syst Lab, Cambridge, MA 02139 USA
[2] Biogen Inc, Cambridge, MA 02141 USA
[3] Alphatech Inc, Burlington, MA 01803 USA
[4] Univ Virginia, Dept Syst Engn, Charlottesville, VA 22903 USA
关键词
dynamic programming; neuro-dynamic programming; reinforcement learning; theater missile defense;
D O I
10.1109/3468.823480
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
The purpose of this paper is to propose a solution methodology for a missile defense problem involving the sequential allocation of defensive resources over a series of engagements. The problem is cast as a dynamic programming/Markovian decision problem, which is computationally intractable by exact methods because of its large number of states and its complex modeling issues. We have employed a neuro-dynamic programming (NDP) framework, whereby the cost-to-go function is approximated using neural network architectures that are trained on simulated data. We report on the performance obtained using several different training methods, and we compare this performance with the optimal.
引用
收藏
页码:42 / 51
页数:10
相关论文
共 50 条
  • [1] Neuro-dynamic programming
    Volgenant, T
    [J]. INTERFACES, 1997, 27 (06) : 143 - 143
  • [2] Approximate dynamic programming for missile defense interceptor fire control
    Davis, Michael T.
    Robbins, Matthew J.
    Lunday, Brian J.
    [J]. EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2017, 259 (03) : 873 - 886
  • [3] Neuro-dynamic programming for task allocation to unmanned aerial vehicles
    Kamel, A
    Anwar, MM
    Nygard, K
    [J]. INTELLIGENT AND ADAPTIVE SYSTEMS AND SOFTWARE ENGINEERING, 2004, : 121 - 127
  • [4] Neuro-dynamic programming for cooperative inventory control
    Bauso, D
    Giarré, L
    Pesenti, R
    [J]. PROCEEDINGS OF THE 2004 AMERICAN CONTROL CONFERENCE, VOLS 1-6, 2004, : 5527 - 5532
  • [5] Neuro-Dynamic Programming for adaptive fusion complexity control
    Ross, KN
    Chaney, RD
    [J]. SENSOR FUSION: ARCHITECTURES, ALGORITHMS, AND APPLICATIONS III, 1999, 3719 : 398 - 409
  • [6] Neuro-Dynamic Programming in Control of the Ball and Beam System
    Burghardt, Andrzej
    Szuster, Marcin
    [J]. MECHATRONIC SYSTEMS, MECHANICS AND MATERIALS II, 2014, 210 : 206 - 214
  • [7] A neuro-dynamic programming approach for stochastic reservoir management
    Boukhtouta, A
    Lamond, BF
    [J]. WATER RESOURCES MANAGEMENT II, 2003, 8 : 311 - 320
  • [8] Control of a logistic node via Neuro-Dynamic Programming
    Boccadoro, Mauro
    Martinelli, Francesco
    [J]. 49TH IEEE CONFERENCE ON DECISION AND CONTROL (CDC), 2010, : 4896 - 4901
  • [9] A neuro-dynamic programming approach to retailer inventory management
    Van Roy, B
    Bertsekas, DP
    Lee, Y
    Tsitsiklis, JN
    [J]. PROCEEDINGS OF THE 36TH IEEE CONFERENCE ON DECISION AND CONTROL, VOLS 1-5, 1997, : 4052 - 4057
  • [10] Zero-sum differential game guidance law for missile interception engagement via neuro-dynamic programming
    Xi, A-xing
    Cai, Yuan-li
    Deng, Yi-fan
    Jiang, Hao-nan
    [J]. PROCEEDINGS OF THE INSTITUTION OF MECHANICAL ENGINEERS PART G-JOURNAL OF AEROSPACE ENGINEERING, 2023, 237 (14) : 3352 - 3366