Latent-Maximum-Entropy-Based Cognitive Radar Reward Function Estimation With Nonideal Observations

被引：0

作者：

Zhang, Luyao ^{[1
]}

Zhu, Mengtao ^{[2
,3
]}

Qin, Jiahao ^{[2
]}

Li, Yunjie ^{[4
]}

机构：

[1] Chinese Univ Hong Kong, Sch Sci & Engn, Shenzhen 518172, Peoples R China

[2] Beijing Inst Technol, Schoolof Cyberspace Sci & Technol, Beijing 100081, Peoples R China

[3] Lab Electromagnet Space Cognit & Intelligent Contr, Beijing 100191, Peoples R China

[4] Beijing Inst Technol, Sch Informat & Elect, Beijing 100081, Peoples R China

来源：

IEEE TRANSACTIONS ON AEROSPACE AND ELECTRONIC SYSTEMS | 2024年 / 60卷 / 05期

基金：

中国国家自然科学基金;

关键词：

Cognition; Entropy; Trajectory; Optimization; Interference; Cognitive radar; Stochastic processes; Cognitive radar (CR); expectation-maximization (EM); inverse cognition; inverse reinforcement learning; latent maximum entropy (LME); TRACKING; MANAGEMENT;

D O I：

10.1109/TAES.2024.3406671

中图分类号：

V [航空、航天];

学科分类号：

08 ; 0825 ;

摘要：

The concept of "inverse cognition" has recently emerged and has garnered significant research attention in the radar community from aspects of inverse filtering, inverse cognitive radar (I-CR), and designing smart interference for counter-adversarial autonomous systems (i.e., the cognitive radar). For instance, identifying whether an adversary cognitive radar's actions (such as waveform selection and beam scheduling) are consistent with the constrained utility maximization and if so, estimating the utility function has led to recent formulations of I-CR. In this context of I-CR, we address the challenges of estimating unknown and complex utility functions with nonideal action observations. We mean nonideal by missing and nonoptimal action observations. In this article, we assume that the adversary CR is optimizing its action policy by maximizing some forms of the expected utility function with unknown and complex structures over long time horizons. We then designed an IRL method under nonideal observations and illustrated the applicability of the methods. The nonideal factors are treated as latent variables, and the I-CR problem is formulated as a latent information inference problem. Then, an expectation-maximization (EM)-based algorithm is developed to iteratively solve the problem with nonconvex and nonlinear optimizations through a Lagrangian relaxation reformulation. The performance of the proposed method is evaluated and compared utilizing simulated CR target tracking scenarios with Markov decision process (MDP) and partially observable MDP settings. Experimental results verified the robustness, effectiveness, and superiority of the proposed method.

引用

页码：6656 / 6670

页数：15

共 50 条

[31] THE MAXIMUM ENTROPY ESTIMATION OF STRUCTURAL RELIABILITY BASED ON MONTE CARLO SIMULATION
Huang, Wenbo
Mao, Jiangang
Zhang, Zhiyong
33RD INTERNATIONAL CONFERENCE ON OCEAN, OFFSHORE AND ARCTIC ENGINEERING, 2014, VOL 4B, 2014,
[32] Confidence Interval Estimation for Precipitation Quantiles Based on Principle of Maximum Entropy
Wei, Ting
Song, Songbai
ENTROPY, 2019, 21 (03)
[33] Maximum likelihood DOA estimation based on the cross-entropy method
Chen, Yen-Chih
Su, Yu T.
2006 IEEE INTERNATIONAL SYMPOSIUM ON INFORMATION THEORY, VOLS 1-6, PROCEEDINGS, 2006, : 851 - +
[34] Maximum Entropy Principle Based Estimation of Performance Distribution in Queueing Theory
He, Dayi
Li, Ran
Huang, Qi
Lei, Ping
PLOS ONE, 2014, 9 (09):
[35] Maximum-Likelihood Maximum-Entropy Constrained Probability Density Function Estimation for Prediction of Rare Events
Ahooyi, Taha Mohseni
Soroush, Masoud
Arbogast, Jeffrey E.
Seider, Warren D.
Oktem, Ulku G.
AICHE JOURNAL, 2014, 60 (03) : 1013 - 1026
[36] Physics-Based Cognitive Radar Modeling and Parameter Estimation
Sedighi, Saeid
Shankar, Bhavani M. R.
Mishra, Kumar Vijay
Rangaswamy, Muralidhar
2022 IEEE RADAR CONFERENCE (RADARCONF'22), 2022,
[37] Nonparametric estimation of quantile-based entropy function
Subhash, Silpa
Sunoj, S. M.
Sankaran, P. G.
Rajesh, G.
COMMUNICATIONS IN STATISTICS-SIMULATION AND COMPUTATION, 2023, 52 (05) : 1805 - 1821
[38] Maximum average entropy-based quantization of local observations for distributed detection
Wahdan, Muath A.
Altinkaya, Mustafa A.
DIGITAL SIGNAL PROCESSING, 2022, 123
[39] Local estimation of failure probability function and its confidence interval with maximum entropy principle
Ching, Jianye
Hsieh, Yi-Hung
PROBABILISTIC ENGINEERING MECHANICS, 2007, 22 (01) : 39 - 49
[40] Dynamic state estimation of power systems considering maximum correlation entropy and quadratic function
Chen, Tengpeng
Liu, Fangyan
Sun, Lu
Amaratunga, Gehan A. J.
Zeng, Nianyin
MEASUREMENT SCIENCE AND TECHNOLOGY, 2023, 34 (08)

← 1 2 3 4 5 →