Audio-based AI classifiers show no evidence of improved COVID-19 screening over simple symptoms checkers

被引:5
|
作者
Coppock, Harry [1 ,2 ]
Nicholson, George [1 ,3 ,4 ]
Kiskin, Ivan [1 ,3 ,5 ]
Koutra, Vasiliki [1 ,6 ]
Baker, Kieran [1 ,6 ]
Budd, Jobie [7 ,8 ]
Payne, Richard [9 ]
Karoune, Emma [1 ]
Hurley, David [9 ]
Titcomb, Alexander [9 ]
Egglestone, Sabrina [9 ]
Tendero Canadas, Ana [9 ,10 ]
Butler, Lorraine [9 ]
Jersakova, Radka [1 ]
Mellor, Jonathon [9 ]
Patel, Selina [9 ,11 ]
Thornley, Tracey [12 ]
Diggle, Peter [13 ]
Richardson, Sylvia [1 ]
Packham, Josef [9 ]
Schuller, Bjoern W. [1 ,2 ,14 ]
Pigoli, Davide [1 ,6 ]
Gilmour, Steven [1 ,6 ]
Roberts, Stephen [1 ,3 ]
Holmes, Chris [1 ,3 ]
机构
[1] Alan Turing Inst, London, England
[2] Imperial Coll London, London, England
[3] Univ Oxford, Oxford, England
[4] NIHR Oxford Biomed Res Ctr, Oxford, England
[5] Univ Surrey, Surrey Inst People Centred AI, Guildford, England
[6] Kings Coll London, London, England
[7] UCL, Div Med, London, England
[8] UCL, London Ctr Nanotechnol, London, England
[9] UK Hlth Secur Agcy, London, England
[10] Univ Brighton, Sch Appl Sci, Brighton, England
[11] UCL, Inst Hlth Informat, London, England
[12] Univ Nottingham, Nottingham, England
[13] Univ Lancaster, Lancaster, England
[14] Univ Augsburg, Augsburg, Germany
基金
英国经济与社会研究理事会; 英国工程与自然科学研究理事会;
关键词
AUGMENTATION; COUGH; MODEL;
D O I
10.1038/s42256-023-00773-8
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recent work has reported that respiratory audio-trained AI classifiers can accurately predict SARS-CoV-2 infection status. However, it has not yet been determined whether such model performance is driven by latent audio biomarkers with true causal links to SARS-CoV-2 infection or by confounding effects, such as recruitment bias, present in observational studies. Here we undertake a large-scale study of audio-based AI classifiers as part of the UK government's pandemic response. We collect a dataset of audio recordings from 67,842 individuals, with linked metadata, of whom 23,514 had positive polymerase chain reaction tests for SARS-CoV-2. In an unadjusted analysis, similar to that in previous works, AI classifiers predict SARS-CoV-2 infection status with high accuracy (ROC-AUC = 0.846 [0.838-0.854]). However, after matching on measured confounders, such as self-reported symptoms, performance is much weaker (ROC-AUC = 0.619 [0.594-0.644]). Upon quantifying the utility of audio-based classifiers in practical settings, we find them to be outperformed by predictions on the basis of user-reported symptoms. We make best-practice recommendations for handling recruitment bias, and for assessing audio-based classifiers by their utility in relevant practical settings. Our work provides insights into the value of AI audio analysis and the importance of study design and treatment of confounders in AI-enabled diagnostics. AI-enabled diagnostic applications in healthcare can be powerful, but study design is very important to avoid subtle issues of bias in the dataset and evaluation. Coppock et al. demonstrate how an AI-based classifier for diagnosing SARS-Cov-2 infection from audio recordings can seem to make predictions with high accuracy but shows much lower performance after taking into account confounders, providing insights in study design and replicability in AI-based audio analysis.
引用
收藏
页码:229 / 242
页数:25
相关论文
共 38 条
  • [21] The Show Must Go On: Evidence-Based Neuroanaesthesia Practices in a Tertiary Care Hospital in India During the COVID-19 Pandemic
    Bajaj, Jhanvi
    TURKISH JOURNAL OF ANAESTHESIOLOGY AND REANIMATION, 2022, 50 : S76 - S78
  • [22] Rapid COVID-19 Screening Based on Self-Reported Symptoms: Psychometric Assessment and Validation of the EPICOVID19 Short Diagnostic Scale
    Bastiani, Luca
    Fortunato, Loredana
    Pieroni, Stefania
    Bianchi, Fabrizio
    Adorni, Fulvio
    Prinelli, Federica
    Giacomelli, Andrea
    Pagani, Gabriele
    Maggi, Stefania
    Trevisan, Caterina
    Noale, Marianna
    Jesuthasan, Nithiya
    Sojic, Aleksandra
    Pettenati, Carla
    Andreoni, Massimo
    Incalzi, Raffaele Antonelli
    Galli, Massimo
    Molinaro, Sabrina
    JOURNAL OF MEDICAL INTERNET RESEARCH, 2021, 23 (01)
  • [23] Evidence-based traditional Siddha formulations for prophylaxis and management of respiratory symptoms in COVID-19 pandemic-a review
    Prakash, P.
    Meena, R.
    Abraham, L. Stanley
    Sunkar, Swetha
    Govindaraju, K.
    Pully, Durgasruthi
    Samrot, Antony, V
    BIOCATALYSIS AND AGRICULTURAL BIOTECHNOLOGY, 2021, 35
  • [24] Population-Based COVID-19 Screening in Mexico: Assessment of Symptoms and Their Weighting in Predicting SARS-CoV-2 Infection
    Martinez-Fierro, Margarita L.
    Diaz-Lozano, Martha
    Alvarez-Zuniga, Claudia
    Ramirez-Hernandez, Leticia A.
    Araujo-Espino, Roxana
    Trejo-Ortiz, Perla M.
    Mollinedo-Montano, Fabiana E.
    Ortiz-Castro, Yolanda
    Vazquez-Reyes, Sodel
    Velasco-Elizondo, Perla
    Garcia-Esquivel, Lidia
    Araujo-Conejo, Arturo
    Garza-Veloz, Idalia
    MEDICINA-LITHUANIA, 2021, 57 (04):
  • [25] How has the COVID-19 pandemic improved evidence-based-medicine awareness among undergraduate medical students?
    Fourtassi, Maryam
    Hjiej, Ghita
    Touissi, Youness
    Hajjioui, Abderrazak
    Abda, Naima
    MEDICAL EDUCATION ONLINE, 2020, 25 (01):
  • [26] Improved prediction of new COVID-19 cases using a simple vector autoregressive model: evidence from seven New York state counties
    Kitaoka, Takayoshi
    Takahashi, Harutaka
    BIOLOGY METHODS & PROTOCOLS, 2023, 8 (01):
  • [27] AI Promoted Virtual Screening, Structure-Based Hit Optimization, and Synthesis of Novel COVID-19 S-RBD Domain Inhibitors
    Gkekas, Ioannis
    Katsamakas, Sotirios
    Mylonas, Stelios
    Fotopoulou, Theano
    Magoulas, George E.
    Tenchiu, Alia Cristina
    Dimitriou, Marios
    Axenopoulos, Apostolos
    Rossopoulou, Nafsika
    Kostova, Simona
    Wanker, Erich E.
    Katsila, Theodora
    Papahatjis, Demetris
    Gorgoulis, Vassilis G.
    Koufaki, Maria
    Karakasiliotis, Ioannis
    Calogeropoulou, Theodora
    Daras, Petros
    Petrakis, Spyros
    JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2024, 64 (22) : 8562 - 8585
  • [28] Impact of covid-19 pandemic over depressive symptoms among mothers from a population-based birth cohort in southern brazil
    Santos, Ina S.
    Tovo-Rodrigues, Luciana
    Maruyama, Jessica M.
    Barros, Aluisio J. D.
    Bierhals, Isabel
    Del Ponte, Bianca
    Matijasevich, Alicia
    ARCHIVES OF WOMENS MENTAL HEALTH, 2023, 26 (04) : 513 - 521
  • [29] Impact of covid-19 pandemic over depressive symptoms among mothers from a population-based birth cohort in southern brazil
    Iná S. Santos
    Luciana Tovo-Rodrigues
    Jessica M. Maruyama
    Aluísio J. D. Barros
    Isabel Bierhals
    Bianca Del Ponte
    Alicia Matijasevich
    Archives of Women's Mental Health, 2023, 26 : 513 - 521
  • [30] Depression and anxiety symptoms in young adults before and during the COVID-19 pandemic: evidence from a Canadian population-based cohort
    Watkins-Martin, Kia
    Orri, Massimiliano
    Pennestri, Marie-Helene
    Castellanos-Ryan, Natalie
    Larose, Simon
    Gouin, Jean-Philippe
    Ouellet-Morin, Isabelle
    Chadi, Nicholas
    Philippe, Frederick
    Boivin, Michel
    Tremblay, Richard E.
    Cote, Sylvana
    Geoffroy, Marie-Claude
    ANNALS OF GENERAL PSYCHIATRY, 2021, 20 (01)