Generative Adversarial Regularized Mutual Information Policy Gradient Framework for Automatic Diagnosis

被引：0

作者：

Xia, Yuan ^{[1
]}

Zhou, Jingbo ^{[2
,3
]}

Shi, Zhenhui ^{[1
]}

Lu, Chao ^{[1
]}

Huang, Haifeng ^{[1
]}

机构：

[1] Baidu Inc, Beijing, Peoples R China

[2] Baidu Res, Business Intelligence Lab, Beijing, Peoples R China

[3] Natl Engn Lab Deep Learning Technol & Applicat, Beijing, Peoples R China

来源：

THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE | 2020年 / 34卷

关键词：

GAME; GO;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Automatic diagnosis systems have attracted increasing attention in recent years. The reinforcement learning (RL) is an attractive technique for building an automatic diagnosis system due to its advantages for handling sequential decision making problem. However, the RL method still cannot achieve good enough prediction accuracy. In this paper, we propose a Generative Adversarial regularized Mutual information Policy gradient framework (GAMP) for automatic diagnosis which aims to make a diagnosis rapidly and accurately. We first propose a new policy gradient framework based on the Generative Adversarial Network (GAN) to optimize the RL model for automatic diagnosis. In our framework, we take the generator of GAN as a policy network, and also use the discriminator of GAN as a part of the reward function. This generative adversarial regularized policy gradient framework can try to avoid generating randomized trials of symptom inquires deviated from the common diagnosis paradigm. In addition, we add mutual information to enhance the reward function to encourage the model to select the most discriminative symptoms to make a diagnosis. Experiment evaluations on two public datasets show that our method beats the state-of-art methods, not only can achieve higher diagnosis accuracy, but also can use a smaller number of inquires to make diagnosis decision.

引用

页码：1062 / 1069

页数：8

共 43 条

[1] Generative Adversarial Network with Policy Gradient for Text Summarization
Rekabdar, Banafsheh
Mousas, Christos
Gupta, Bidyut
[J]. 2019 13TH IEEE INTERNATIONAL CONFERENCE ON SEMANTIC COMPUTING (ICSC), 2019, : 204 - 207
[2] SeqGAN: Sequence Generative Adversarial Nets with Policy Gradient
Yu, Lantao
Zhang, Weinan
Wang, Jun
Yu, Yong
[J]. THIRTY-FIRST AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2017, : 2852 - 2858
[3] A Framework of Composite Functional Gradient Methods for Generative Adversarial Models
Johnson, Rie
Zhang, Tong
[J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2021, 43 (01) : 17 - 32
[4] PolicyGAN: Training generative adversarial networks using policy gradient
Paria, Biswajit
Lahiri, Avisek
Biswas, Prabir Kumar
[J]. 2017 NINTH INTERNATIONAL CONFERENCE ON ADVANCES IN PATTERN RECOGNITION (ICAPR), 2017, : 151 - 156
[5] Mutual information regularized Bayesian framework for multiple image restoration
Chen, YQ
Wang, HC
Fang, T
Tyan, J
[J]. TENTH IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION, VOLS 1 AND 2, PROCEEDINGS, 2005, : 190 - 197
[6] ECG Generation With Sequence Generative Adversarial Nets Optimized by Policy Gradient
Ye, Fei
Zhu, Fei
Fu, Yuchen
Shen, Bairong
[J]. IEEE ACCESS, 2019, 7 : 159369 - 159378
[7] Generative Adversarial Inverse Reinforcement Learning With Deep Deterministic Policy Gradient
Zhan, Ming
Fan, Jingjing
Guo, Jianying
[J]. IEEE ACCESS, 2023, 11 : 87732 - 87746
[8] Beyond Mutual Information: Generative Adversarial Network for Domain Adaptation Using Information Bottleneck Constraint
Chen, Jiawei
Zhang, Ziqi
Xie, Xinpeng
Li, Yuexiang
Xu, Tao
Ma, Kai
Zheng, Yefeng
[J]. IEEE TRANSACTIONS ON MEDICAL IMAGING, 2022, 41 (03) : 595 - 607
[9] Mutual-Information Regularized Multi-Agent Policy Iteration
Wang, Jiangxing
Ye, Deheng
Lu, Zongqing
[J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
[10] Geological model automatic reconstruction based on conditioning Wasserstein generative adversarial network with gradient penalty
Fan, Wenyao
Liu, Gang
Chen, Qiyu
Cui, Zhesi
Yang, Zixiao
Huang, Qianhong
Wu, Xuechao
[J]. EARTH SCIENCE INFORMATICS, 2023, 16 (3) : 2825 - 2843

← 1 2 3 4 5 →