Missile defense and interceptor allocation by neuro-dynamic programming

被引:72
|
作者
Bertsekas, DP [1 ]
Homer, ML
Logan, DA
Patek, SD
Sandell, NR
机构
[1] MIT, Informat & Decis Syst Lab, Cambridge, MA 02139 USA
[2] Biogen Inc, Cambridge, MA 02141 USA
[3] Alphatech Inc, Burlington, MA 01803 USA
[4] Univ Virginia, Dept Syst Engn, Charlottesville, VA 22903 USA
关键词
dynamic programming; neuro-dynamic programming; reinforcement learning; theater missile defense;
D O I
10.1109/3468.823480
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
The purpose of this paper is to propose a solution methodology for a missile defense problem involving the sequential allocation of defensive resources over a series of engagements. The problem is cast as a dynamic programming/Markovian decision problem, which is computationally intractable by exact methods because of its large number of states and its complex modeling issues. We have employed a neuro-dynamic programming (NDP) framework, whereby the cost-to-go function is approximated using neural network architectures that are trained on simulated data. We report on the performance obtained using several different training methods, and we compare this performance with the optimal.
引用
收藏
页码:42 / 51
页数:10
相关论文
共 50 条
  • [31] Minimising total cost with regular and emergency outsourcing sources: a neuro-dynamic programming approach
    Dhawan, Aayush
    Srinivasan, Samashivan
    Rajib, Prabina
    Bidanda, Bopaya
    INTERNATIONAL JOURNAL OF PRODUCTION RESEARCH, 2009, 47 (20) : 5811 - 5827
  • [32] An Online Energy Management Control for Hybrid Electric Vehicles Based on Neuro-Dynamic Programming
    Qin, Feiyan
    Li, Weimin
    Hu, Yue
    Xu, Guoqing
    ALGORITHMS, 2018, 11 (03)
  • [33] Call admission control and routing in integrated services networks using neuro-dynamic programming
    Marbach, P
    Mihatsch, O
    Tsitsiklis, JN
    IEEE JOURNAL ON SELECTED AREAS IN COMMUNICATIONS, 2000, 18 (02) : 197 - 208
  • [34] Joint Routing and Bitrate Adjustment for DASH Video via Neuro-Dynamic Programming in SDN
    Zhu, Kunjie
    Jiang, Junchao
    Yang, Bowen
    Cai, Weizhe
    Yang, Jian
    NEURAL INFORMATION PROCESSING, ICONIP 2016, PT I, 2016, 9947 : 413 - 420
  • [35] A neuro-dynamic programming approach to admission control in ATM networks: The single link case
    Marbach, P
    Tsitsiklis, JN
    1997 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I - V: VOL I: PLENARY, EXPERT SUMMARIES, SPECIAL, AUDIO, UNDERWATER ACOUSTICS, VLSI; VOL II: SPEECH PROCESSING; VOL III: SPEECH PROCESSING, DIGITAL SIGNAL PROCESSING; VOL IV: MULTIDIMENSIONAL SIGNAL PROCESSING, NEURAL NETWORKS - VOL V: STATISTICAL SIGNAL AND ARRAY PROCESSING, APPLICATIONS, 1997, : 159 - 162
  • [36] Sensor fusion options for Ballistic Missile Defense interceptor applications
    Bjork, C
    Morris, N
    Dasarathy, BV
    Smith, B
    Allen, D
    Prestwood, WT
    SENSOR FUSION: ARCHITECTURES, ALGORITHMS, AND APPLICATIONS III, 1999, 3719 : 92 - 102
  • [37] A game theoretical interceptor guidance law for ballistic missile defense
    Shinar, J
    Shima, T
    PROCEEDINGS OF THE 35TH IEEE CONFERENCE ON DECISION AND CONTROL, VOLS 1-4, 1996, : 2780 - 2785
  • [38] Boundary Control of Linear One-Dimensional Parabolic PDE using Neuro-Dynamic Programming
    Talaei, B.
    Jagannathan, S.
    Singler, J.
    2015 IEEE CONFERENCE ON CONTROL AND APPLICATIONS (CCA 2015), 2015, : 577 - 582
  • [39] Scheduling of re-entrant lines with neuro-dynamic programming based on a new evaluating criterion
    Wang, Ying
    Jin, Huiyu
    Zhu, Shunzhi
    Li, Maoqing
    ADVANCES IN NEURAL NETWORKS - ISNN 2006, PT 3, PROCEEDINGS, 2006, 3973 : 921 - 926
  • [40] A neuro-dynamic programming-based optimal controller for tomato seedling growth in greenhouse systems
    Pucheta, J.
    Patino, H.
    Fullana, R.
    Schugurensky, C.
    Kuchen, B.
    NEURAL PROCESSING LETTERS, 2006, 24 (03) : 241 - 260