Robust and data-efficient generalization of self-supervised machine learning for diagnostic imaging

被引:39
|
作者
Azizi, Shekoofeh [1 ]
Culp, Laura [1 ]
Freyberg, Jan [1 ]
Mustafa, Basil [1 ]
Baur, Sebastien [1 ]
Kornblith, Simon [1 ]
Chen, Ting [1 ]
Tomasev, Nenad [2 ]
Mitrovic, Jovana [2 ]
Strachan, Patricia [1 ]
Mahdavi, S. Sara [1 ]
Wulczyn, Ellery [1 ]
Babenko, Boris [1 ]
Walker, Megan [1 ]
Loh, Aaron [1 ]
Chen, Po-Hsuan Cameron [1 ]
Liu, Yuan [1 ]
Bavishi, Pinal [1 ]
McKinney, Scott Mayer [1 ]
Winkens, Jim [1 ]
Roy, Abhijit Guha [1 ]
Beaver, Zach [1 ]
Ryan, Fiona [3 ]
Krogue, Justin [1 ]
Etemadi, Mozziyar [4 ]
Telang, Umesh [1 ]
Liu, Yun [1 ]
Peng, Lily [1 ]
Corrado, Greg S. [1 ]
Webster, Dale R. [1 ]
Fleet, David [1 ]
Hinton, Geoffrey [1 ]
Houlsby, Neil [1 ]
Karthikesalingam, Alan [1 ]
Norouzi, Mohammad [1 ]
Natarajan, Vivek [1 ]
机构
[1] Google Res, Mountain View, CA USA
[2] DeepMind, London, England
[3] Georgia Inst Technol, Comp Sci, Atlanta, GA USA
[4] Northwestern Univ, Sch Med, Sch Engn, Chicago, IL USA
关键词
DIABETIC-RETINOPATHY; NEURAL-NETWORK; DEEP; CANCER; CLASSIFICATION; EDEMA;
D O I
10.1038/s41551-023-01049-7
中图分类号
R318 [生物医学工程];
学科分类号
0831 ;
摘要
A representation-learning strategy for machine-learning models applied to medical-imaging tasks improves model robustness and training efficiency and mitigates suboptimal out-of-distribution performance. Machine-learning models for medical tasks can match or surpass the performance of clinical experts. However, in settings differing from those of the training dataset, the performance of a model can deteriorate substantially. Here we report a representation-learning strategy for machine-learning models applied to medical-imaging tasks that mitigates such 'out of distribution' performance problem and that improves model robustness and training efficiency. The strategy, which we named REMEDIS (for 'Robust and Efficient Medical Imaging with Self-supervision'), combines large-scale supervised transfer learning on natural images and intermediate contrastive self-supervised learning on medical images and requires minimal task-specific customization. We show the utility of REMEDIS in a range of diagnostic-imaging tasks covering six imaging domains and 15 test datasets, and by simulating three realistic out-of-distribution scenarios. REMEDIS improved in-distribution diagnostic accuracies up to 11.5% with respect to strong supervised baseline models, and in out-of-distribution settings required only 1-33% of the data for retraining to match the performance of supervised models retrained using all available data. REMEDIS may accelerate the development lifecycle of machine-learning models for medical imaging.
引用
收藏
页码:756 / +
页数:30
相关论文
共 50 条
  • [1] Robust and data-efficient generalization of self-supervised machine learning for diagnostic imaging
    Shekoofeh Azizi
    Laura Culp
    Jan Freyberg
    Basil Mustafa
    Sebastien Baur
    Simon Kornblith
    Ting Chen
    Nenad Tomasev
    Jovana Mitrović
    Patricia Strachan
    S. Sara Mahdavi
    Ellery Wulczyn
    Boris Babenko
    Megan Walker
    Aaron Loh
    Po-Hsuan Cameron Chen
    Yuan Liu
    Pinal Bavishi
    Scott Mayer McKinney
    Jim Winkens
    Abhijit Guha Roy
    Zach Beaver
    Fiona Ryan
    Justin Krogue
    Mozziyar Etemadi
    Umesh Telang
    Yun Liu
    Lily Peng
    Greg S. Corrado
    Dale R. Webster
    David Fleet
    Geoffrey Hinton
    Neil Houlsby
    Alan Karthikesalingam
    Mohammad Norouzi
    Vivek Natarajan
    [J]. Nature Biomedical Engineering, 2023, 7 (6) : 756 - 779
  • [2] A self-supervised deep learning method for data-efficient training in genomics
    Hüseyin Anil Gündüz
    Martin Binder
    Xiao-Yin To
    René Mreches
    Bernd Bischl
    Alice C. McHardy
    Philipp C. Münch
    Mina Rezaei
    [J]. Communications Biology, 6
  • [3] A self-supervised deep learning method for data-efficient training in genomics
    Guenduez, Hueseyin Anil
    Binder, Martin
    To, Xiao-Yin
    Mreches, Rene
    Bischl, Bernd
    McHardy, Alice C.
    Muench, Philipp C.
    Rezaei, Mina
    [J]. COMMUNICATIONS BIOLOGY, 2023, 6 (01)
  • [4] Self-Supervised Learning With Data-Efficient Supervised Fine-Tuning for Crowd Counting
    Wang, Rui
    Hao, Yixue
    Hu, Long
    Chen, Jincai
    Chen, Min
    Wu, Di
    [J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 : 1538 - 1546
  • [5] A data-efficient self-supervised deep learning model for design and characterization of nanophotonic structures
    Ma, Wei
    Liu, Yongmin
    [J]. SCIENCE CHINA-PHYSICS MECHANICS & ASTRONOMY, 2020, 63 (08)
  • [6] A data-efficient self-supervised deep learning model for design and characterization of nanophotonic structures
    Wei Ma
    Yongmin Liu
    [J]. Science China Physics, Mechanics & Astronomy, 2020, 63
  • [7] A data-efficient self-supervised deep learning model for design and characterization of nanophotonic structures
    Wei Ma
    Yongmin Liu
    [J]. Science China(Physics,Mechanics & Astronomy), 2020, (08) : 27 - 34
  • [8] Data-Efficient Masked Video Modeling for Self-supervised Action Recognition
    Li, Qiankun
    Huang, Xiaolong
    Wan, Zhifan
    Hu, Lanqing
    Wu, Shuzhe
    Zhang, Jie
    Shan, Shiguang
    Wang, Zengfu
    [J]. PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 2723 - 2733
  • [9] Fair, Robust, and Data-Efficient Machine Learning in Healthcare
    Singh, Harvineet
    [J]. PROCEEDINGS OF THE 2022 AAAI/ACM CONFERENCE ON AI, ETHICS, AND SOCIETY, AIES 2022, 2022, : 914 - 914
  • [10] Primitive-contrastive network: data-efficient self-supervised learning from robot demonstration videos
    Pengfei Sun
    Zhile Yang
    Tianren Zhang
    Shangqi Guo
    Feng Chen
    [J]. Applied Intelligence, 2022, 52 : 4258 - 4273