Audio-based AI classifiers show no evidence of improved COVID-19 screening over simple symptoms checkers

被引:5
|
作者
Coppock, Harry [1 ,2 ]
Nicholson, George [1 ,3 ,4 ]
Kiskin, Ivan [1 ,3 ,5 ]
Koutra, Vasiliki [1 ,6 ]
Baker, Kieran [1 ,6 ]
Budd, Jobie [7 ,8 ]
Payne, Richard [9 ]
Karoune, Emma [1 ]
Hurley, David [9 ]
Titcomb, Alexander [9 ]
Egglestone, Sabrina [9 ]
Tendero Canadas, Ana [9 ,10 ]
Butler, Lorraine [9 ]
Jersakova, Radka [1 ]
Mellor, Jonathon [9 ]
Patel, Selina [9 ,11 ]
Thornley, Tracey [12 ]
Diggle, Peter [13 ]
Richardson, Sylvia [1 ]
Packham, Josef [9 ]
Schuller, Bjoern W. [1 ,2 ,14 ]
Pigoli, Davide [1 ,6 ]
Gilmour, Steven [1 ,6 ]
Roberts, Stephen [1 ,3 ]
Holmes, Chris [1 ,3 ]
机构
[1] Alan Turing Inst, London, England
[2] Imperial Coll London, London, England
[3] Univ Oxford, Oxford, England
[4] NIHR Oxford Biomed Res Ctr, Oxford, England
[5] Univ Surrey, Surrey Inst People Centred AI, Guildford, England
[6] Kings Coll London, London, England
[7] UCL, Div Med, London, England
[8] UCL, London Ctr Nanotechnol, London, England
[9] UK Hlth Secur Agcy, London, England
[10] Univ Brighton, Sch Appl Sci, Brighton, England
[11] UCL, Inst Hlth Informat, London, England
[12] Univ Nottingham, Nottingham, England
[13] Univ Lancaster, Lancaster, England
[14] Univ Augsburg, Augsburg, Germany
基金
英国经济与社会研究理事会; 英国工程与自然科学研究理事会;
关键词
AUGMENTATION; COUGH; MODEL;
D O I
10.1038/s42256-023-00773-8
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recent work has reported that respiratory audio-trained AI classifiers can accurately predict SARS-CoV-2 infection status. However, it has not yet been determined whether such model performance is driven by latent audio biomarkers with true causal links to SARS-CoV-2 infection or by confounding effects, such as recruitment bias, present in observational studies. Here we undertake a large-scale study of audio-based AI classifiers as part of the UK government's pandemic response. We collect a dataset of audio recordings from 67,842 individuals, with linked metadata, of whom 23,514 had positive polymerase chain reaction tests for SARS-CoV-2. In an unadjusted analysis, similar to that in previous works, AI classifiers predict SARS-CoV-2 infection status with high accuracy (ROC-AUC = 0.846 [0.838-0.854]). However, after matching on measured confounders, such as self-reported symptoms, performance is much weaker (ROC-AUC = 0.619 [0.594-0.644]). Upon quantifying the utility of audio-based classifiers in practical settings, we find them to be outperformed by predictions on the basis of user-reported symptoms. We make best-practice recommendations for handling recruitment bias, and for assessing audio-based classifiers by their utility in relevant practical settings. Our work provides insights into the value of AI audio analysis and the importance of study design and treatment of confounders in AI-enabled diagnostics. AI-enabled diagnostic applications in healthcare can be powerful, but study design is very important to avoid subtle issues of bias in the dataset and evaluation. Coppock et al. demonstrate how an AI-based classifier for diagnosing SARS-Cov-2 infection from audio recordings can seem to make predictions with high accuracy but shows much lower performance after taking into account confounders, providing insights in study design and replicability in AI-based audio analysis.
引用
收藏
页码:229 / 242
页数:25
相关论文
共 38 条
  • [31] Depression and anxiety symptoms in young adults before and during the COVID-19 pandemic: evidence from a Canadian population-based cohort
    Kia Watkins-Martin
    Massimiliano Orri
    Marie-Hélène Pennestri
    Natalie Castellanos-Ryan
    Simon Larose
    Jean-Philippe Gouin
    Isabelle Ouellet-Morin
    Nicholas Chadi
    Frederick Philippe
    Michel Boivin
    Richard E. Tremblay
    Sylvana Côté
    Marie-Claude Geoffroy
    Annals of General Psychiatry, 20
  • [32] Reduction in healthcare services during the COVID-19 pandemic: Patient screening based on symptoms is an effective strategy for avoiding delayed laryngeal cancer diagnosis
    Murri, Domenico
    Botti, Cecilia
    Bassano, Edoardo
    Fornaciari, Martina
    Crocetta, Francesco Maria
    Ghidini, Angelo
    AMERICAN JOURNAL OF OTOLARYNGOLOGY, 2021, 42 (06)
  • [33] Lung cancer screening use before & during the COVID-19 pandemic in the US: Population-based evidence from 2019-2021
    Poghosyan, Hermine
    Sarkar, Sayantani
    Richman, Ilana
    CANCER RESEARCH, 2023, 83 (07)
  • [34] Effectiveness of an online short-term audio-based mindfulness program on negative emotions during the COVID-19 pandemic: Latent growth curve analyses of anxiety and moderated mediation effects of anxiety between mindfulness and negative affect
    Man Ying Kang
    Joshua K. M. Nan
    Yue Yuan
    Current Psychology, 2023, 42 : 30049 - 30061
  • [35] Effectiveness of an online short-term audio-based mindfulness program on negative emotions during the COVID-19 pandemic: Latent growth curve analyses of anxiety and moderated mediation effects of anxiety between mindfulness and negative affect
    Kang, Man Ying
    Nan, Joshua K. M.
    Yuan, Yue
    CURRENT PSYCHOLOGY, 2023, 42 (34) : 30049 - 30061
  • [36] The economic cost of implementing antigen-based rapid diagnostic tests for COVID-19 screening in high-risk transmission settings: evidence from Germany
    Alfonso Valenzuela Hurtado
    Hoa Thi Nguyen
    Viktoria Schenkel
    Jonas Wachinger
    Joachim Seybold
    Claudia M. Denkinger
    Manuela De Allegri
    Health Economics Review, 12
  • [37] The economic cost of implementing antigen-based rapid diagnostic tests for COVID-19 screening in high-risk transmission settings: evidence from Germany
    Hurtado, Alfonso Valenzuela
    Nguyen, Hoa Thi
    Schenkel, Viktoria
    Wachinger, Jonas
    Seybold, Joachim
    Denkinger, Claudia M.
    De Allegri, Manuela
    HEALTH ECONOMICS REVIEW, 2022, 12 (01)
  • [38] Research on multi-modal autonomous diagnosis algorithm of COVID-19 based on whale optimized support vector machine and improved D-S evidence fusion
    Wang, Guowei
    Guo, Shuli
    Han, Lina
    Song, Xiaowei
    Zhao, Yuanyuan
    COMPUTERS IN BIOLOGY AND MEDICINE, 2022, 150