Latent-Maximum-Entropy-Based Cognitive Radar Reward Function Estimation With Nonideal Observations

被引:0
|
作者
Zhang, Luyao [1 ]
Zhu, Mengtao [2 ,3 ]
Qin, Jiahao [2 ]
Li, Yunjie [4 ]
机构
[1] Chinese Univ Hong Kong, Sch Sci & Engn, Shenzhen 518172, Peoples R China
[2] Beijing Inst Technol, Schoolof Cyberspace Sci & Technol, Beijing 100081, Peoples R China
[3] Lab Electromagnet Space Cognit & Intelligent Contr, Beijing 100191, Peoples R China
[4] Beijing Inst Technol, Sch Informat & Elect, Beijing 100081, Peoples R China
基金
中国国家自然科学基金;
关键词
Cognition; Entropy; Trajectory; Optimization; Interference; Cognitive radar; Stochastic processes; Cognitive radar (CR); expectation-maximization (EM); inverse cognition; inverse reinforcement learning; latent maximum entropy (LME); TRACKING; MANAGEMENT;
D O I
10.1109/TAES.2024.3406671
中图分类号
V [航空、航天];
学科分类号
08 ; 0825 ;
摘要
The concept of "inverse cognition" has recently emerged and has garnered significant research attention in the radar community from aspects of inverse filtering, inverse cognitive radar (I-CR), and designing smart interference for counter-adversarial autonomous systems (i.e., the cognitive radar). For instance, identifying whether an adversary cognitive radar's actions (such as waveform selection and beam scheduling) are consistent with the constrained utility maximization and if so, estimating the utility function has led to recent formulations of I-CR. In this context of I-CR, we address the challenges of estimating unknown and complex utility functions with nonideal action observations. We mean nonideal by missing and nonoptimal action observations. In this article, we assume that the adversary CR is optimizing its action policy by maximizing some forms of the expected utility function with unknown and complex structures over long time horizons. We then designed an IRL method under nonideal observations and illustrated the applicability of the methods. The nonideal factors are treated as latent variables, and the I-CR problem is formulated as a latent information inference problem. Then, an expectation-maximization (EM)-based algorithm is developed to iteratively solve the problem with nonconvex and nonlinear optimizations through a Lagrangian relaxation reformulation. The performance of the proposed method is evaluated and compared utilizing simulated CR target tracking scenarios with Markov decision process (MDP) and partially observable MDP settings. Experimental results verified the robustness, effectiveness, and superiority of the proposed method.
引用
收藏
页码:6656 / 6670
页数:15
相关论文
共 50 条
  • [1] MAXIMUM ENTROPY ESTIMATION OF RADAR CLUTTER SPECTRA.
    Kesler, Stanislav
    Haykin, Simon
    1978, v (2 5):
  • [2] Uncertainty Analysis of Quantitative Radar Rainfall Estimation Using the Maximum Entropy
    Lee, Jae-Kyoung
    ATMOSPHERE-KOREA, 2015, 25 (03): : 511 - 520
  • [3] An estimation of distribution algorithm based on maximum entropy
    Wright, A
    Poli, R
    Stephens, C
    Langdon, WB
    Pulavarty, S
    GENETIC AND EVOLUTIONARY COMPUTATION GECCO 2004 , PT 2, PROCEEDINGS, 2004, 3103 : 343 - 354
  • [4] Discharge estimation from surface-velocity observations by a maximum-entropy based method
    Koussis, Antonis D.
    Dimitriadis, Panayiotis
    Lykoudis, Spyridon
    Kappos, Nikolaos
    Katsanos, Dimitrios
    Koletsis, Ioannis
    Psiloglou, Basil
    Rozos, Evangelos
    Mazi, Katerina
    HYDROLOGICAL SCIENCES JOURNAL-JOURNAL DES SCIENCES HYDROLOGIQUES, 2022, 67 (03): : 451 - 461
  • [5] Releasing source locating based on Multi-Agent Reinforcement Learning with reward function designed by maximum entropy
    Wang, Zhi-Pu
    Zeng, Guang-Rong
    Deng, Lie-Wei
    Cao, Wang
    Guo, Yao
    2022 41ST CHINESE CONTROL CONFERENCE (CCC), 2022, : 4688 - 4693
  • [6] MAXIMUM-ENTROPY ESTIMATION OF SPREAD FUNCTION IN ASTRONOMICAL IMAGERY
    SABETPEYMAN, F
    JOURNAL OF THE OPTICAL SOCIETY OF AMERICA, 1981, 71 (12) : 1563 - 1563
  • [7] Maximum Entropy Estimation of Density Function Using Order Statistics
    Reza, Ali M.
    Kirlin, R. Lynn
    IEEE TRANSACTIONS ON INFORMATION THEORY, 2021, 67 (05) : 3075 - 3094
  • [8] An Investigation on Maximum Entropy Estimation Based on Chrestenson Transform
    Zhou, Mingyong
    Liu, Zhongkan
    Hama, Hiromitsu
    GENETIC AND EVOLUTIONARY COMPUTING, VOL I, 2016, 387 : 65 - 70
  • [9] Robust channel estimation based on the maximum entropy principle
    Zhengyang HU
    Jiang XUE
    Feng LI
    Qian ZHAO
    Deyu MENG
    Zongben XU
    ScienceChina(InformationSciences), 2023, 66 (12) : 209 - 221
  • [10] Robust channel estimation based on the maximum entropy principle
    Hu, Zhengyang
    Xue, Jiang
    Li, Feng
    Zhao, Qian
    Meng, Deyu
    Xu, Zongben
    SCIENCE CHINA-INFORMATION SCIENCES, 2023, 66 (12)