Speech recognition for medical conversations

被引:0
|
作者
Chiu, Chung-Cheng [1 ]
Tripathi, Anshuman
Chou, Katherine
Co, Chris
Jaitly, Navdeep
Jaunzeikare, Diana
Kannan, Anjuli
Nguyen, Patrick
Sak, Hasim
Sankar, Ananth [1 ,2 ]
Tansuwan, Justin
Wan, Nathan [1 ]
Wu, Yonghui
Zhang, Xuedong [1 ]
机构
[1] Google, Mountain View, CA USA
[2] LinkedIn, Mountain View, CA USA
来源
19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES | 2018年
关键词
medical transcription; conversational transcription; end-to-end attention models; CTC;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper we document our experiences with developing speech recognition for medical transcription a system that automatically transcribes doctor-patient conversations. Towards this goal, we built a system along two different methodological lines a Connectionist Temporal Classification (CTC) phoneme based model and a Listen Attend and Spell (LAS) grapheme based model. To train these models we used a corpus of anonymized conversations representing approximately 14,000 hours of speech. Because of noisy transcripts and alignments in the corpus, a significant amount of effort was invested in data cleaning issues. We describe a two-stage strategy we followed for segmenting the data. The data cleanup and development of a matched language model was essential to the success of the CTC based models. The LAS based models, however were found to be resilient to alignment and transcript noise and did not require the use of language models. CTC models were able to achieve a word error rate of 20.1%, and the LAS models were able to achieve 18.3%. Our analysis shows that both models perform well on important medical utterances and therefore can be practical for transcribing medical conversations.
引用
收藏
页码:2972 / 2976
页数:5
相关论文
共 50 条
  • [31] Robust Speech Recognition in the presence of noise using medical data
    Athanaselis, Theologos
    Bakamidis, Stelios
    Giannopoulos, George
    Dologlou, Ioannis
    Fotinea, Evita
    2008 IEEE INTERNATIONAL WORKSHOP ON IMAGING SYSTEMS AND TECHNIQUES, 2008, : 347 - 350
  • [32] Speech recognition shortens the recording time of prehospital medical documentation
    Shimazui, Takashi
    Nakada, Taka-aki
    Kuroiwa, Shingo
    Toyama, Yuki
    Oda, Shigeto
    AMERICAN JOURNAL OF EMERGENCY MEDICINE, 2021, 49 : 414 - 416
  • [33] Computer-based speech recognition as a replacement for medical transcription
    Rosenthal, DI
    Chew, FS
    Dupuy, DE
    Kattapuram, SV
    Palmer, WE
    Yap, RM
    Levine, LA
    AMERICAN JOURNAL OF ROENTGENOLOGY, 1998, 170 (01) : 23 - 25
  • [34] The long-term adoption of speech recognition in medical applications
    Grasso, MA
    CBMS 2003: 16TH IEEE SYMPOSIUM ON COMPUTER-BASED MEDICAL SYSTEMS, PROCEEDINGS, 2003, : 257 - 262
  • [35] Computer-based speech recognition as an alternative to medical transcription
    Borowitz, SM
    JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2001, 8 (01) : 101 - 102
  • [36] Computer-Based Speech Recognition as a Replacement for Medical Transcription
    Stephen M Borowitz
    Pediatric Research, 1999, 45 : 120 - 120
  • [37] Conversations with Medical Education
    Jacobs, Joshua
    MEDICAL EDUCATION, 2012, 46 (04) : 342 - 342
  • [38] Unnatural conversations in unnatural conversations: speech reporting in the discourse of spiritual mediumship
    Wales, Katie
    LANGUAGE AND LITERATURE, 2009, 18 (04) : 347 - 356
  • [39] The maintenance of clear speech in naturalistic conversations
    Lee, Dae-Yong
    Baese-Berk, Melissa M.
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2020, 147 (05): : 3702 - 3711
  • [40] Emotional Climate Recognition in Conversations using Peers' Speech-based Bispectral Features and Affect Dynamics
    Alhussein, Ghada
    Alkhodari, Mohanad
    Lamprou, Charalampos
    Ziogas, Ioannis
    Ganiti-Roumeliotou, Efstratia
    Khandoker, Ahsan
    Hadjileontiadis, Leontios J.
    2023 45TH ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE & BIOLOGY SOCIETY, EMBC, 2023,