Generative Adversarial Regularized Mutual Information Policy Gradient Framework for Automatic Diagnosis

被引:0
|
作者
Xia, Yuan [1 ]
Zhou, Jingbo [2 ,3 ]
Shi, Zhenhui [1 ]
Lu, Chao [1 ]
Huang, Haifeng [1 ]
机构
[1] Baidu Inc, Beijing, Peoples R China
[2] Baidu Res, Business Intelligence Lab, Beijing, Peoples R China
[3] Natl Engn Lab Deep Learning Technol & Applicat, Beijing, Peoples R China
关键词
GAME; GO;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Automatic diagnosis systems have attracted increasing attention in recent years. The reinforcement learning (RL) is an attractive technique for building an automatic diagnosis system due to its advantages for handling sequential decision making problem. However, the RL method still cannot achieve good enough prediction accuracy. In this paper, we propose a Generative Adversarial regularized Mutual information Policy gradient framework (GAMP) for automatic diagnosis which aims to make a diagnosis rapidly and accurately. We first propose a new policy gradient framework based on the Generative Adversarial Network (GAN) to optimize the RL model for automatic diagnosis. In our framework, we take the generator of GAN as a policy network, and also use the discriminator of GAN as a part of the reward function. This generative adversarial regularized policy gradient framework can try to avoid generating randomized trials of symptom inquires deviated from the common diagnosis paradigm. In addition, we add mutual information to enhance the reward function to encourage the model to select the most discriminative symptoms to make a diagnosis. Experiment evaluations on two public datasets show that our method beats the state-of-art methods, not only can achieve higher diagnosis accuracy, but also can use a smaller number of inquires to make diagnosis decision.
引用
收藏
页码:1062 / 1069
页数:8
相关论文
共 43 条
  • [41] CE-FFGAN: A feature fusion generative adversarial network with deep embedded category information for limited sample fault diagnosis of rotating machinery under speed variation
    Yang, Chen
    Li, Hongkun
    Cao, Shunxin
    Zhang, Kongliang
    Xiang, Wei
    Liu, Xuejun
    [J]. ADVANCED ENGINEERING INFORMATICS, 2024, 62
  • [42] A new flow injection calibration system based on automatic diagnosis and correction for fault peak signal using flow injection gradient information
    Fan, SH
    Fang, ZL
    [J]. CHINESE JOURNAL OF ANALYTICAL CHEMISTRY, 2002, 30 (09) : 1038 - 1041
  • [43] Partial Differential Equation-Constrained Diffeomorphic Registration from Sum of Squared Differences to Normalized Cross-Correlation, Normalized Gradient Fields, and Mutual Information: A Unifying Framework
    Hernandez, Monica
    Ramon-Julvez, Ubaldo
    Sierra-Tome, Daniel
    [J]. SENSORS, 2022, 22 (10)