Latent-Maximum-Entropy-Based Cognitive Radar Reward Function Estimation With Nonideal Observations

被引:0
|
作者
Zhang, Luyao [1 ]
Zhu, Mengtao [2 ,3 ]
Qin, Jiahao [2 ]
Li, Yunjie [4 ]
机构
[1] Chinese Univ Hong Kong, Sch Sci & Engn, Shenzhen 518172, Peoples R China
[2] Beijing Inst Technol, Schoolof Cyberspace Sci & Technol, Beijing 100081, Peoples R China
[3] Lab Electromagnet Space Cognit & Intelligent Contr, Beijing 100191, Peoples R China
[4] Beijing Inst Technol, Sch Informat & Elect, Beijing 100081, Peoples R China
基金
中国国家自然科学基金;
关键词
Cognition; Entropy; Trajectory; Optimization; Interference; Cognitive radar; Stochastic processes; Cognitive radar (CR); expectation-maximization (EM); inverse cognition; inverse reinforcement learning; latent maximum entropy (LME); TRACKING; MANAGEMENT;
D O I
10.1109/TAES.2024.3406671
中图分类号
V [航空、航天];
学科分类号
08 ; 0825 ;
摘要
The concept of "inverse cognition" has recently emerged and has garnered significant research attention in the radar community from aspects of inverse filtering, inverse cognitive radar (I-CR), and designing smart interference for counter-adversarial autonomous systems (i.e., the cognitive radar). For instance, identifying whether an adversary cognitive radar's actions (such as waveform selection and beam scheduling) are consistent with the constrained utility maximization and if so, estimating the utility function has led to recent formulations of I-CR. In this context of I-CR, we address the challenges of estimating unknown and complex utility functions with nonideal action observations. We mean nonideal by missing and nonoptimal action observations. In this article, we assume that the adversary CR is optimizing its action policy by maximizing some forms of the expected utility function with unknown and complex structures over long time horizons. We then designed an IRL method under nonideal observations and illustrated the applicability of the methods. The nonideal factors are treated as latent variables, and the I-CR problem is formulated as a latent information inference problem. Then, an expectation-maximization (EM)-based algorithm is developed to iteratively solve the problem with nonconvex and nonlinear optimizations through a Lagrangian relaxation reformulation. The performance of the proposed method is evaluated and compared utilizing simulated CR target tracking scenarios with Markov decision process (MDP) and partially observable MDP settings. Experimental results verified the robustness, effectiveness, and superiority of the proposed method.
引用
收藏
页码:6656 / 6670
页数:15
相关论文
共 50 条
  • [31] THE MAXIMUM ENTROPY ESTIMATION OF STRUCTURAL RELIABILITY BASED ON MONTE CARLO SIMULATION
    Huang, Wenbo
    Mao, Jiangang
    Zhang, Zhiyong
    33RD INTERNATIONAL CONFERENCE ON OCEAN, OFFSHORE AND ARCTIC ENGINEERING, 2014, VOL 4B, 2014,
  • [32] Confidence Interval Estimation for Precipitation Quantiles Based on Principle of Maximum Entropy
    Wei, Ting
    Song, Songbai
    ENTROPY, 2019, 21 (03)
  • [33] Maximum likelihood DOA estimation based on the cross-entropy method
    Chen, Yen-Chih
    Su, Yu T.
    2006 IEEE INTERNATIONAL SYMPOSIUM ON INFORMATION THEORY, VOLS 1-6, PROCEEDINGS, 2006, : 851 - +
  • [34] Maximum Entropy Principle Based Estimation of Performance Distribution in Queueing Theory
    He, Dayi
    Li, Ran
    Huang, Qi
    Lei, Ping
    PLOS ONE, 2014, 9 (09):
  • [35] Maximum-Likelihood Maximum-Entropy Constrained Probability Density Function Estimation for Prediction of Rare Events
    Ahooyi, Taha Mohseni
    Soroush, Masoud
    Arbogast, Jeffrey E.
    Seider, Warren D.
    Oktem, Ulku G.
    AICHE JOURNAL, 2014, 60 (03) : 1013 - 1026
  • [36] Physics-Based Cognitive Radar Modeling and Parameter Estimation
    Sedighi, Saeid
    Shankar, Bhavani M. R.
    Mishra, Kumar Vijay
    Rangaswamy, Muralidhar
    2022 IEEE RADAR CONFERENCE (RADARCONF'22), 2022,
  • [37] Nonparametric estimation of quantile-based entropy function
    Subhash, Silpa
    Sunoj, S. M.
    Sankaran, P. G.
    Rajesh, G.
    COMMUNICATIONS IN STATISTICS-SIMULATION AND COMPUTATION, 2023, 52 (05) : 1805 - 1821
  • [38] Maximum average entropy-based quantization of local observations for distributed detection
    Wahdan, Muath A.
    Altinkaya, Mustafa A.
    DIGITAL SIGNAL PROCESSING, 2022, 123
  • [39] Local estimation of failure probability function and its confidence interval with maximum entropy principle
    Ching, Jianye
    Hsieh, Yi-Hung
    PROBABILISTIC ENGINEERING MECHANICS, 2007, 22 (01) : 39 - 49
  • [40] Dynamic state estimation of power systems considering maximum correlation entropy and quadratic function
    Chen, Tengpeng
    Liu, Fangyan
    Sun, Lu
    Amaratunga, Gehan A. J.
    Zeng, Nianyin
    MEASUREMENT SCIENCE AND TECHNOLOGY, 2023, 34 (08)