Scoring sleep with artificial intelligence enables quantification of sleep stage ambiguity: hypnodensity based on multiple expert scorers and auto-scoring

被引:25
|
作者
Bakker, Jessie P. [1 ]
Ross, Marco [2 ]
Cerny, Andreas [2 ]
Vasko, Ray [1 ]
Shaw, Edmund [1 ]
Kuna, Samuel [3 ,4 ]
Magalang, Ulysses J. [5 ]
Punjabi, Naresh M. [6 ]
Anderer, Peter [2 ]
机构
[1] Philips Sleep & Resp Care, Pittsburgh, PA USA
[2] Philips Sleep & Resp Care, Vienna, Austria
[3] Univ Penn, Perelman Sch Med, Philadelphia, PA 19104 USA
[4] Corporal Michael J Crescenz Vet Affairs Med Ctr, Philadelphia, PA USA
[5] Ohio State Univ, Wexner Med Ctr, Div Pulm Crit Care & Sleep Med, Columbus, OH 43210 USA
[6] Univ Miami, Div Pulm Crit Care & Sleep Med, Miami, FL USA
关键词
sleep stages; polysomnography; artificial intelligence; machine learning; validation; hypnodensity; AMERICAN ACADEMY; AUTOMATIC-ANALYSIS; EEG SIGNALS; RELIABILITY; CLASSIFICATION; POLYSOMNOGRAMS; VALIDATION; AGREEMENT; RECHTSCHAFFEN; POPULATION;
D O I
10.1093/sleep/zsac154
中图分类号
R74 [神经病学与精神病学];
学科分类号
摘要
Study Objectives To quantify the amount of sleep stage ambiguity across expert scorers and to validate a new auto-scoring platform against sleep staging performed by multiple scorers. Methods We applied a new auto-scoring system to three datasets containing 95 PSGs scored by 6-12 scorers, to compare sleep stage probabilities (hypnodensity; i.e. the probability of each sleep stage being assigned to a given epoch) as the primary output, as well as a single sleep stage per epoch assigned by hierarchical majority rule. Results The percentage of epochs with 100% agreement across scorers was 46 +/- 9%, 38 +/- 10% and 32 +/- 9% for the datasets with 6, 9, and 12 scorers, respectively. The mean intra-class correlation coefficient between sleep stage probabilities from auto- and manual-scoring was 0.91, representing excellent reliability. Within each dataset, agreement between auto-scoring and consensus manual-scoring was significantly higher than agreement between manual-scoring and consensus manual-scoring (0.78 vs. 0.69; 0.74 vs. 0.67; and 0.75 vs. 0.67; all p < 0.01). Conclusions Analysis of scoring performed by multiple scorers reveals that sleep stage ambiguity is the rule rather than the exception. Probabilities of the sleep stages determined by artificial intelligence auto-scoring provide an excellent estimate of this ambiguity. Compared to consensus manual-scoring, sleep staging derived from auto-scoring is for each individual PSG noninferior to manual-scoring meaning that auto-scoring output is ready for interpretation without the need for manual adjustment.
引用
收藏
页数:12
相关论文
共 44 条
  • [1] Scoring sleep with artificial intelligence enables quantification of sleep stage ambiguity
    Anderer, P.
    Ross, M.
    Cerny, A.
    Moreau, A.
    [J]. JOURNAL OF SLEEP RESEARCH, 2018, 27
  • [2] THE HYPNODENSITY GRAPH: A NEW REPRESENTATION OF SLEEP SCORING BASED ON MULTIPLE MANUAL EXPERT SCORINGS AND ESTIMATED BY ARTIFICIAL INTELLIGENCE
    Anderer, P.
    Ross, M.
    Cerny, A.
    Moreau, A.
    [J]. SLEEP MEDICINE, 2019, 64 : S15 - S16
  • [3] ASSESSING COMPETENCY OF Z3SCORE AUTOMATED SLEEP STAGE SCORING SYSTEM WITH MANUAL SLEEP STAGE SCORING BY MULTIPLE SCORERS
    Tay, J. Y.
    Toh, S. T.
    Leow, L. C.
    Senin, S. R.
    [J]. SLEEP MEDICINE, 2017, 40 : E326 - E326
  • [4] SCORING OF SLEEP ACCORDING TO AASM: ARTIFICIAL INTELLIGENCE VERSUS MULTIPLE HUMAN EXPERT SCORINGS
    Anderer, P.
    Moreau, A.
    Ross, M.
    Cerny, A.
    Thusoo, S.
    [J]. SLEEP, 2018, 41 : A118 - A118
  • [5] VALIDATION STUDIES FOR SCORING POLYSOMNOGRAMS AND HOME SLEEP APNEA TESTS WITH ARTIFICIAL INTELLIGENCE: SLEEP STAGE PROBABILITIES (HYPNODENSITY) DERIVED FROM NEUROLOGICAL OR CARDIORESPIRATORY SIGNALS
    Anderer, Peter
    Ross, Marco
    Cerny, Andreas
    Fonseca, Pedro
    Shaw, Edmund
    Bakker, Jessie
    [J]. SLEEP, 2022, 45 : A319 - A319
  • [6] COMPUTER SLEEP STAGE SCORING - AN EXPERT SYSTEM APPROACH
    RAY, SR
    LEE, WD
    MORGAN, CD
    AIRTHKINDREE, W
    [J]. INTERNATIONAL JOURNAL OF BIO-MEDICAL COMPUTING, 1986, 19 (01): : 43 - 61
  • [7] Validation of a New Auto-Scoring Algorithm to Estimate Sleep Staging Using Cardiorespiratory Signals
    Bakker, J. P.
    Ross, M.
    Vasko, R.
    Cerny, A.
    Fonseca, P.
    Jasko, J.
    Shaw, E.
    White, D. P.
    Anderer, P.
    [J]. AMERICAN JOURNAL OF RESPIRATORY AND CRITICAL CARE MEDICINE, 2021, 203 (09)
  • [8] Interrater sleep stage scoring reliability between manual scoring from two European sleep centers and automatic scoring performed by the artificial intelligence-based Stanford-STAGES algorithm
    Cesari, Matteo
    Stefani, Ambra
    Penzel, Thomas
    Ibrahim, Abubaker
    Hackner, Heinz
    Heidbreder, Anna
    Szentkiralyi, Andras
    Stubbe, Beate
    Voelzke, Henry
    Berger, Klaus
    Hoegl, Birgit
    [J]. JOURNAL OF CLINICAL SLEEP MEDICINE, 2021, 17 (06): : 1237 - 1247
  • [9] Validation of a New Auto-Scoring Algorithm Against Human Scoring of Respiratory Events, Arousals, Periodic Limb Movements, and Sleep Staging
    Bakker, J. P.
    Ross, M.
    Vasko, R.
    Cerny, A.
    Jasko, J.
    Shaw, E.
    White, D. P.
    Anderer, P.
    [J]. AMERICAN JOURNAL OF RESPIRATORY AND CRITICAL CARE MEDICINE, 2021, 203 (09)
  • [10] Enhanced sleep staging with artificial intelligence: a validation study of new software for sleep scoring
    Grassi, Massimiliano
    Dacco, Silvia
    Caldirola, Daniela
    Perna, Giampaolo
    Schruers, Koen
    Defillo, Archie
    [J]. FRONTIERS IN ARTIFICIAL INTELLIGENCE, 2023, 6