Generative models improve fairness of medical classifiers under distribution shifts

被引:18
|
作者
Ktena, Ira [1 ]
Wiles, Olivia [1 ]
Albuquerque, Isabela [1 ]
Rebuffi, Sylvestre-Alvise [1 ]
Tanno, Ryutaro [1 ]
Roy, Abhijit Guha [2 ]
Azizi, Shekoofeh [1 ]
Belgrave, Danielle [3 ]
Kohli, Pushmeet [1 ]
Cemgil, Taylan [1 ]
Karthikesalingam, Alan [2 ]
Gowal, Sven [1 ]
机构
[1] Google DeepMind, London, England
[2] Google Res, London, England
[3] GSKai, London, England
关键词
PERFORMANCE; IMAGES;
D O I
10.1038/s41591-024-02838-6
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Domain generalization is a ubiquitous challenge for machine learning in healthcare. Model performance in real-world conditions might be lower than expected because of discrepancies between the data encountered during deployment and development. Underrepresentation of some groups or conditions during model development is a common cause of this phenomenon. This challenge is often not readily addressed by targeted data acquisition and 'labeling' by expert clinicians, which can be prohibitively expensive or practically impossible because of the rarity of conditions or the available clinical expertise. We hypothesize that advances in generative artificial intelligence can help mitigate this unmet need in a steerable fashion, enriching our training dataset with synthetic examples that address shortfalls of underrepresented conditions or subgroups. We show that diffusion models can automatically learn realistic augmentations from data in a label-efficient manner. We demonstrate that learned augmentations make models more robust and statistically fair in-distribution and out of distribution. To evaluate the generality of our approach, we studied three distinct medical imaging contexts of varying difficulty: (1) histopathology, (2) chest X-ray and (3) dermatology images. Complementing real samples with synthetic ones improved the robustness of models in all three medical tasks and increased fairness by improving the accuracy of clinical diagnosis within underrepresented groups, especially out of distribution. By generating synthetic image samples specific to underrepresented groups, diffusion models help medical image classifiers to achieve greater fairness metrics across a variety of medical disciplines and demographic attributes.
引用
收藏
页码:1166 / 1173
页数:8
相关论文
共 50 条
  • [1] On Measuring Fairness in Generative Models
    Teo, Christopher T. H.
    Abdollahzadeh, Milad
    Cheung, Ngai-Man
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [2] Exploiting generative models in discriminative classifiers
    Jaakkola, TS
    Haussler, D
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 11, 1999, 11 : 487 - 493
  • [3] Transferring Fairness under Distribution Shifts via Fair Consistency Regularization
    An, Bang
    Che, Zora
    Ding, Mucong
    Huang, Furong
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
  • [4] Characterizing Bias in Classifiers using Generative Models
    McDuff, Daniel
    Song, Yale
    Kapoor, Ashish
    Ma, Shuang
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
  • [5] Supervised Algorithmic Fairness in Distribution Shifts: A Survey
    Shao, Minglai
    Li, Dong
    Zhao, Chen
    Wu, Xintao
    Lin, Yujie
    Tian, Qin
    PROCEEDINGS OF THE THIRTY-THIRD INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2024, 2024, : 8225 - 8233
  • [6] Generalized "avatar" niche shifts improve distribution models for invasive species
    Larson, Eric R.
    Gallagher, Rachael V.
    Beaumont, Linda J.
    Olden, Julian D.
    DIVERSITY AND DISTRIBUTIONS, 2014, 20 (11) : 1296 - 1306
  • [7] Ensuring Fairness under Prior Probability Shifts
    Biswas, Arpita
    Mukherjee, Suvam
    AIES '21: PROCEEDINGS OF THE 2021 AAAI/ACM CONFERENCE ON AI, ETHICS, AND SOCIETY, 2021, : 414 - 424
  • [8] Uncertainty-Aware Deep Classifiers Using Generative Models
    Sensoy, Murat
    Kaplan, Lance
    Cerutti, Federico
    Saleki, Maryam
    THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 5620 - 5627
  • [9] A new class of generative classifiers based on staged tree models
    Carli, Federico
    Leonelli, Manuele
    Varando, Gherardo
    KNOWLEDGE-BASED SYSTEMS, 2023, 268
  • [10] medXGAN: Visual Explanations for Medical Classifiers through a Generative Latent Space
    Dravid, Amil
    Schiffers, Florian
    Gong, Boqing
    Katsaggelos, Aggelos K.
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, CVPRW 2022, 2022, : 2935 - 2944