Robust and data-efficient generalization of self-supervised machine learning for diagnostic imaging

被引：39

作者：

Azizi, Shekoofeh ^{[1
]}

Culp, Laura ^{[1
]}

Freyberg, Jan ^{[1
]}

Mustafa, Basil ^{[1
]}

Baur, Sebastien ^{[1
]}

Kornblith, Simon ^{[1
]}

Chen, Ting ^{[1
]}

Tomasev, Nenad ^{[2
]}

Mitrovic, Jovana ^{[2
]}

Strachan, Patricia ^{[1
]}

Mahdavi, S. Sara ^{[1
]}

Wulczyn, Ellery ^{[1
]}

Babenko, Boris ^{[1
]}

Walker, Megan ^{[1
]}

Loh, Aaron ^{[1
]}

Chen, Po-Hsuan Cameron ^{[1
]}

Liu, Yuan ^{[1
]}

Bavishi, Pinal ^{[1
]}

McKinney, Scott Mayer ^{[1
]}

Winkens, Jim ^{[1
]}

Roy, Abhijit Guha ^{[1
]}

Beaver, Zach ^{[1
]}

Ryan, Fiona ^{[3
]}

Krogue, Justin ^{[1
]}

Etemadi, Mozziyar ^{[4
]}

Telang, Umesh ^{[1
]}

Liu, Yun ^{[1
]}

Peng, Lily ^{[1
]}

Corrado, Greg S. ^{[1
]}

Webster, Dale R. ^{[1
]}

Fleet, David ^{[1
]}

Hinton, Geoffrey ^{[1
]}

Houlsby, Neil ^{[1
]}

Karthikesalingam, Alan ^{[1
]}

Norouzi, Mohammad ^{[1
]}

Natarajan, Vivek ^{[1
]}

机构：

[1] Google Res, Mountain View, CA USA

[2] DeepMind, London, England

[3] Georgia Inst Technol, Comp Sci, Atlanta, GA USA

[4] Northwestern Univ, Sch Med, Sch Engn, Chicago, IL USA

来源：

NATURE BIOMEDICAL ENGINEERING | 2023年 / 7卷 / 06期

关键词：

DIABETIC-RETINOPATHY; NEURAL-NETWORK; DEEP; CANCER; CLASSIFICATION; EDEMA;

D O I：

10.1038/s41551-023-01049-7

中图分类号：

R318 [生物医学工程];

学科分类号：

0831 ;

摘要：

A representation-learning strategy for machine-learning models applied to medical-imaging tasks improves model robustness and training efficiency and mitigates suboptimal out-of-distribution performance. Machine-learning models for medical tasks can match or surpass the performance of clinical experts. However, in settings differing from those of the training dataset, the performance of a model can deteriorate substantially. Here we report a representation-learning strategy for machine-learning models applied to medical-imaging tasks that mitigates such 'out of distribution' performance problem and that improves model robustness and training efficiency. The strategy, which we named REMEDIS (for 'Robust and Efficient Medical Imaging with Self-supervision'), combines large-scale supervised transfer learning on natural images and intermediate contrastive self-supervised learning on medical images and requires minimal task-specific customization. We show the utility of REMEDIS in a range of diagnostic-imaging tasks covering six imaging domains and 15 test datasets, and by simulating three realistic out-of-distribution scenarios. REMEDIS improved in-distribution diagnostic accuracies up to 11.5% with respect to strong supervised baseline models, and in out-of-distribution settings required only 1-33% of the data for retraining to match the performance of supervised models retrained using all available data. REMEDIS may accelerate the development lifecycle of machine-learning models for medical imaging.

引用

页码：756 / +

页数：30

共 50 条

[1] Robust and data-efficient generalization of self-supervised machine learning for diagnostic imaging
Shekoofeh Azizi
Laura Culp
Jan Freyberg
Basil Mustafa
Sebastien Baur
Simon Kornblith
Ting Chen
Nenad Tomasev
Jovana Mitrović
Patricia Strachan
S. Sara Mahdavi
Ellery Wulczyn
Boris Babenko
Megan Walker
Aaron Loh
Po-Hsuan Cameron Chen
Yuan Liu
Pinal Bavishi
Scott Mayer McKinney
Jim Winkens
Abhijit Guha Roy
Zach Beaver
Fiona Ryan
Justin Krogue
Mozziyar Etemadi
Umesh Telang
Yun Liu
Lily Peng
Greg S. Corrado
Dale R. Webster
David Fleet
Geoffrey Hinton
Neil Houlsby
Alan Karthikesalingam
Mohammad Norouzi
Vivek Natarajan
[J]. Nature Biomedical Engineering, 2023, 7 (6) : 756 - 779
[2] A self-supervised deep learning method for data-efficient training in genomics
Hüseyin Anil Gündüz
Martin Binder
Xiao-Yin To
René Mreches
Bernd Bischl
Alice C. McHardy
Philipp C. Münch
Mina Rezaei
[J]. Communications Biology, 6
[3] A self-supervised deep learning method for data-efficient training in genomics
Guenduez, Hueseyin Anil
Binder, Martin
To, Xiao-Yin
Mreches, Rene
Bischl, Bernd
McHardy, Alice C.
Muench, Philipp C.
Rezaei, Mina
[J]. COMMUNICATIONS BIOLOGY, 2023, 6 (01)
[4] Self-Supervised Learning With Data-Efficient Supervised Fine-Tuning for Crowd Counting
Wang, Rui
Hao, Yixue
Hu, Long
Chen, Jincai
Chen, Min
Wu, Di
[J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 : 1538 - 1546
[5] A data-efficient self-supervised deep learning model for design and characterization of nanophotonic structures
Ma, Wei
Liu, Yongmin
[J]. SCIENCE CHINA-PHYSICS MECHANICS & ASTRONOMY, 2020, 63 (08)
[6] A data-efficient self-supervised deep learning model for design and characterization of nanophotonic structures
Wei Ma
Yongmin Liu
[J]. Science China Physics, Mechanics & Astronomy, 2020, 63
[7] A data-efficient self-supervised deep learning model for design and characterization of nanophotonic structures
Wei Ma
Yongmin Liu
[J]. Science China(Physics,Mechanics & Astronomy), 2020, (08) : 27 - 34
[8] Data-Efficient Masked Video Modeling for Self-supervised Action Recognition
Li, Qiankun
Huang, Xiaolong
Wan, Zhifan
Hu, Lanqing
Wu, Shuzhe
Zhang, Jie
Shan, Shiguang
Wang, Zengfu
[J]. PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 2723 - 2733
[9] Fair, Robust, and Data-Efficient Machine Learning in Healthcare
Singh, Harvineet
[J]. PROCEEDINGS OF THE 2022 AAAI/ACM CONFERENCE ON AI, ETHICS, AND SOCIETY, AIES 2022, 2022, : 914 - 914
[10] Primitive-contrastive network: data-efficient self-supervised learning from robot demonstration videos
Pengfei Sun
Zhile Yang
Tianren Zhang
Shangqi Guo
Feng Chen
[J]. Applied Intelligence, 2022, 52 : 4258 - 4273

← 1 2 3 4 5 →