Latent-Maximum-Entropy-Based Cognitive Radar Reward Function Estimation With Nonideal Observations

被引:0
|
作者
Zhang, Luyao [1 ]
Zhu, Mengtao [2 ,3 ]
Qin, Jiahao [2 ]
Li, Yunjie [4 ]
机构
[1] Chinese Univ Hong Kong, Sch Sci & Engn, Shenzhen 518172, Peoples R China
[2] Beijing Inst Technol, Schoolof Cyberspace Sci & Technol, Beijing 100081, Peoples R China
[3] Lab Electromagnet Space Cognit & Intelligent Contr, Beijing 100191, Peoples R China
[4] Beijing Inst Technol, Sch Informat & Elect, Beijing 100081, Peoples R China
基金
中国国家自然科学基金;
关键词
Cognition; Entropy; Trajectory; Optimization; Interference; Cognitive radar; Stochastic processes; Cognitive radar (CR); expectation-maximization (EM); inverse cognition; inverse reinforcement learning; latent maximum entropy (LME); TRACKING; MANAGEMENT;
D O I
10.1109/TAES.2024.3406671
中图分类号
V [航空、航天];
学科分类号
08 ; 0825 ;
摘要
The concept of "inverse cognition" has recently emerged and has garnered significant research attention in the radar community from aspects of inverse filtering, inverse cognitive radar (I-CR), and designing smart interference for counter-adversarial autonomous systems (i.e., the cognitive radar). For instance, identifying whether an adversary cognitive radar's actions (such as waveform selection and beam scheduling) are consistent with the constrained utility maximization and if so, estimating the utility function has led to recent formulations of I-CR. In this context of I-CR, we address the challenges of estimating unknown and complex utility functions with nonideal action observations. We mean nonideal by missing and nonoptimal action observations. In this article, we assume that the adversary CR is optimizing its action policy by maximizing some forms of the expected utility function with unknown and complex structures over long time horizons. We then designed an IRL method under nonideal observations and illustrated the applicability of the methods. The nonideal factors are treated as latent variables, and the I-CR problem is formulated as a latent information inference problem. Then, an expectation-maximization (EM)-based algorithm is developed to iteratively solve the problem with nonconvex and nonlinear optimizations through a Lagrangian relaxation reformulation. The performance of the proposed method is evaluated and compared utilizing simulated CR target tracking scenarios with Markov decision process (MDP) and partially observable MDP settings. Experimental results verified the robustness, effectiveness, and superiority of the proposed method.
引用
收藏
页码:6656 / 6670
页数:15
相关论文
共 50 条
  • [21] Fusion of Bayesian maximum entropy spectral estimation and variational analysis methods for enhanced radar imaging
    Shkvarko, Yuriy
    Vazquez-Bautista, Rene
    Villalon-Turrubiates, Ivan
    ADVANCED CONCEPTS FOR INTELLIGENT VISION SYSTEMS, PROCEEDINGS, 2007, 4678 : 109 - 120
  • [22] Maximum entropy estimation of income share function from generalized Gini index
    Rad, N. Nakhaei
    Borzadaran, G. R. Mohtashami
    Yari, G. H.
    JOURNAL OF APPLIED STATISTICS, 2016, 43 (16) : 2910 - 2921
  • [23] Estimation of the maximum entropy quantile function using fractional probability weighted moments
    Deng, Jian
    Pandey, M. D.
    STRUCTURAL SAFETY, 2008, 30 (04) : 307 - 319
  • [24] Automatically determine the membership function based on the maximum entropy principle
    Cheng, HD
    Chen, JR
    INFORMATION SCIENCES, 1997, 96 (3-4) : 163 - 182
  • [25] Reconstructing parton distribution function based on maximum entropy method
    Zhang, Shihan
    Wang, Xiaobin
    Lin, Tao
    Chang, Lei
    CHINESE PHYSICS C, 2024, 48 (03)
  • [26] Reconstructing parton distribution function based on maximum entropy method
    张思翰
    王晓斌
    林涛
    常雷
    Chinese Physics C, 2024, 48 (03) : 50 - 55
  • [27] Precipitation estimation over radar gap areas based on satellite and adjacent radar observations
    Lee, Y. -R.
    Shin, D. -B.
    Kim, J. -H.
    Park, H. -S.
    ATMOSPHERIC MEASUREMENT TECHNIQUES, 2015, 8 (02) : 719 - 728
  • [28] Precipitation estimation over radar gap areas based on satellite and adjacent radar observations
    Shin, Dong-Bin
    Lee, Yu-Ri
    2015 1st URSI Atlantic Radio Science Conference (URSI AT-RASC), 2015,
  • [29] Anomaly detection in hyperspectral imagery based on maximum entropy and nonparametric estimation
    He, Lin
    Pan, Quan
    Di, Wei
    Li, Yuanqing
    PATTERN RECOGNITION LETTERS, 2008, 29 (09) : 1392 - 1403
  • [30] Research on the Periodicity of Wind Power Based on the Maximum Entropy Spectrum Estimation
    Xu, Haixiang
    Wu, Linlin
    Liu, Hui
    Wang, Ruoyang
    Cui, Zhengpai
    2016 INTERNATIONAL CONFERENCE ON PROBABILISTIC METHODS APPLIED TO POWER SYSTEMS (PMAPS), 2016,