Generative models improve fairness of medical classifiers under distribution shifts

被引：18

作者：

Ktena, Ira ^{[1
]}

Wiles, Olivia ^{[1
]}

Albuquerque, Isabela ^{[1
]}

Rebuffi, Sylvestre-Alvise ^{[1
]}

Tanno, Ryutaro ^{[1
]}

Roy, Abhijit Guha ^{[2
]}

Azizi, Shekoofeh ^{[1
]}

Belgrave, Danielle ^{[3
]}

Kohli, Pushmeet ^{[1
]}

Cemgil, Taylan ^{[1
]}

Karthikesalingam, Alan ^{[2
]}

Gowal, Sven ^{[1
]}

机构：

[1] Google DeepMind, London, England

[2] Google Res, London, England

[3] GSKai, London, England

来源：

NATURE MEDICINE | 2024年 / 30卷 / 04期

关键词：

PERFORMANCE; IMAGES;

D O I：

10.1038/s41591-024-02838-6

中图分类号：

Q5 [生物化学]; Q7 [分子生物学];

学科分类号：

071010 ; 081704 ;

摘要：

Domain generalization is a ubiquitous challenge for machine learning in healthcare. Model performance in real-world conditions might be lower than expected because of discrepancies between the data encountered during deployment and development. Underrepresentation of some groups or conditions during model development is a common cause of this phenomenon. This challenge is often not readily addressed by targeted data acquisition and 'labeling' by expert clinicians, which can be prohibitively expensive or practically impossible because of the rarity of conditions or the available clinical expertise. We hypothesize that advances in generative artificial intelligence can help mitigate this unmet need in a steerable fashion, enriching our training dataset with synthetic examples that address shortfalls of underrepresented conditions or subgroups. We show that diffusion models can automatically learn realistic augmentations from data in a label-efficient manner. We demonstrate that learned augmentations make models more robust and statistically fair in-distribution and out of distribution. To evaluate the generality of our approach, we studied three distinct medical imaging contexts of varying difficulty: (1) histopathology, (2) chest X-ray and (3) dermatology images. Complementing real samples with synthetic ones improved the robustness of models in all three medical tasks and increased fairness by improving the accuracy of clinical diagnosis within underrepresented groups, especially out of distribution. By generating synthetic image samples specific to underrepresented groups, diffusion models help medical image classifiers to achieve greater fairness metrics across a variety of medical disciplines and demographic attributes.

引用

页码：1166 / 1173

页数：8

共 50 条

[1] On Measuring Fairness in Generative Models
Teo, Christopher T. H.
Abdollahzadeh, Milad
Cheung, Ngai-Man
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
[2] Exploiting generative models in discriminative classifiers
Jaakkola, TS
Haussler, D
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 11, 1999, 11 : 487 - 493
[3] Transferring Fairness under Distribution Shifts via Fair Consistency Regularization
An, Bang
Che, Zora
Ding, Mucong
Huang, Furong
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
[4] Characterizing Bias in Classifiers using Generative Models
McDuff, Daniel
Song, Yale
Kapoor, Ashish
Ma, Shuang
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
[5] Supervised Algorithmic Fairness in Distribution Shifts: A Survey
Shao, Minglai
Li, Dong
Zhao, Chen
Wu, Xintao
Lin, Yujie
Tian, Qin
PROCEEDINGS OF THE THIRTY-THIRD INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2024, 2024, : 8225 - 8233
[6] Generalized "avatar" niche shifts improve distribution models for invasive species
Larson, Eric R.
Gallagher, Rachael V.
Beaumont, Linda J.
Olden, Julian D.
DIVERSITY AND DISTRIBUTIONS, 2014, 20 (11) : 1296 - 1306
[7] Ensuring Fairness under Prior Probability Shifts
Biswas, Arpita
Mukherjee, Suvam
AIES '21: PROCEEDINGS OF THE 2021 AAAI/ACM CONFERENCE ON AI, ETHICS, AND SOCIETY, 2021, : 414 - 424
[8] Uncertainty-Aware Deep Classifiers Using Generative Models
Sensoy, Murat
Kaplan, Lance
Cerutti, Federico
Saleki, Maryam
THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 5620 - 5627
[9] A new class of generative classifiers based on staged tree models
Carli, Federico
Leonelli, Manuele
Varando, Gherardo
KNOWLEDGE-BASED SYSTEMS, 2023, 268
[10] medXGAN: Visual Explanations for Medical Classifiers through a Generative Latent Space
Dravid, Amil
Schiffers, Florian
Gong, Boqing
Katsaggelos, Aggelos K.
2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, CVPRW 2022, 2022, : 2935 - 2944

← 1 2 3 4 5 →