Understanding Medical Conversations: Rich Transcription, Confidence Scores & Information Extraction

被引:0
|
作者
Soltau, Hagen [1 ]
Wang, Mingqiu [1 ]
Shafran, Izhak [1 ]
El Shafey, Laurent [1 ]
机构
[1] Google, Mountain View, CA 94043 USA
来源
INTERSPEECH 2021 | 2021年
关键词
SPEECH RECOGNITION;
D O I
暂无
中图分类号
R36 [病理学]; R76 [耳鼻咽喉科学];
学科分类号
100104 ; 100213 ;
摘要
In this paper, we describe novel components for extracting clinically relevant information from medical conversations which will be available as Google APIs. We describe a transformerbased Recurrent Neural Network Transducer (RNN-T) model tailored for long-form audio, which can produce rich transcriptions including speaker segmentation, speaker role labeling, punctuation and capitalization. On a representative test set, we compare performance of RNN-T models with different encoders, units and streaming constraints. Our transformer-based streaming model performs at about 20% WER on the ASR task, 6% WDER on the diarization task, 43% SER on periods, 52% SER on commas, 43% SER on question marks and 30% SER on capitalization. Our recognizer is paired with a confidence model that utilizes both acoustic and lexical features from the recognizer. The model performs at about 0.37 NCE. Finally, we describe a RNN-T based tagging model. The performance of the model depends on the ontologies, with F-scores of 0.90 for medications, 0.76 for symptoms, 0.75 for conditions, 0.76 for diagnosis, and 0.61 for treatments. While there is still room for improvement, our results suggest that these models are sufficiently accurate for practical applications.
引用
收藏
页码:4418 / 4422
页数:5
相关论文
共 50 条
  • [21] Chemical documents: machine understanding and automated information extraction
    Townsend, JA
    Adams, SE
    Waudby, CA
    de Souza, VK
    Goodman, JM
    Murray-Rust, P
    ORGANIC & BIOMOLECULAR CHEMISTRY, 2004, 2 (22) : 3294 - 3300
  • [22] Information Extraction Tools and Methods for Understanding Dialogue in a Companion
    Catizone, R.
    Dingli, A.
    Pinto, H.
    Wilks, Y.
    SIXTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, LREC 2008, 2008, : 3250 - 3254
  • [23] Query Expansion Using Medical Information Extraction for Improving Information Retrieval in French Medical Domain
    Ghoulam, Aicha
    Barigou, Fatiha
    Belalem, Ghalem
    Meziane, Farid
    INTERNATIONAL JOURNAL OF INTELLIGENT INFORMATION TECHNOLOGIES, 2018, 14 (03) : 1 - 17
  • [24] Identifying relevant information in medical conversations to summarize a clinician-patient encounter
    Quiroz, Juan C.
    Laranjo, Liliana
    Kocaballi, Ahmet Baki
    Briatore, Agustina
    Berkovsky, Shlomo
    Rezazadegan, Dana
    Coiera, Enrico
    HEALTH INFORMATICS JOURNAL, 2020, 26 (04) : 2906 - 2914
  • [25] Differences in Information Seeking and Confidence Among Medical Students in Diagnostic Decisions
    Aiyer, Sriraj
    MEDICAL DECISION MAKING, 2024, 44 (02) : NP72 - NP73
  • [26] Impact of performance and information feedback on medical interns' confidence–accuracy calibration
    J. Staal
    K. Katarya
    M. Speelman
    R. Brand
    J. Alsma
    J. Sloane
    W. W. Van den Broek
    L. Zwaan
    Advances in Health Sciences Education, 2024, 29 : 129 - 145
  • [27] Information Extraction of Medical Materials: An Overview of the Track of Medical Materials MedOCR
    Liu, Lifeng
    Chang, Dejie
    Zhao, Xiaolong
    Guo, Longjie
    Chen, Mosha
    Tang, Buzhou
    HEALTH INFORMATION PROCESSING. EVALUATION TRACK PAPERS, 2023, 1773 : 137 - 142
  • [28] Automatic Key Information Extraction from Visually Rich Documents
    De Trogoff, Charles
    Hantach, Rim
    Lechuga, Gisela
    Calvez, Philippe
    2022 21ST IEEE INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS, ICMLA, 2022, : 89 - 96
  • [29] Key Information Extraction and Recognition from Rich Text Images
    Do, Tien
    Doan, Thuyen Tran
    Le, Khiem
    Nguyen, Thua
    Le, Duy-Dinh
    Ngo, Thanh Duc
    VIETNAM JOURNAL OF COMPUTER SCIENCE, 2024, 11 (04) : 569 - 594
  • [30] An Advantage Actor-Critic Algorithm with Confidence Exploration for Open Information Extraction
    Liu, Guiliang
    Li, Xu
    Sun, Miningming
    Li, Ping
    PROCEEDINGS OF THE 2020 SIAM INTERNATIONAL CONFERENCE ON DATA MINING (SDM), 2020, : 217 - 225