Development of an ASR System for Medical Conversations

被引:0
|
作者
Renato, Alejandro [1 ]
Luna, Daniel [1 ]
Benitez, Sonia [1 ]
机构
[1] Hosp Italiano Buenos Aires, Dept Helth Informat, Buenos Aires, DF, Argentina
来源
关键词
Automatic speech recognition; artificial intelligence; human-computer interaction; natural language processing; telemedicine; SPEECH;
D O I
10.3233/SHTI231048
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this work we document the development of an ASR system for the transcription of conversations between patient and doctor and we will point out the critical aspects of the domain. The system was trained with an acoustic base of spontaneous speech that has a domain language model and a supervised phonetic dictionary. Its performance was compared with two systems: a) NeMo End-to-End Conformers in Spanish and b) Google API ASR(2) Cloud. The evaluation was carried out on a set of 208 teleconsultations recorded during the year 2020. The WER (Word Error Rate) was evaluated in ASR, and Recall and F1 for recognized medical entities. In conclusion, the developed system performed better, reaching 72.5% accuracy in the domain of teleconsultations and an F1 for entity recognition of 0.80.
引用
收藏
页码:664 / 668
页数:5
相关论文
共 50 条
  • [1] Towards Multilingual Conversations in the Medical Domain: Development of Multilingual Medical Data and A Network-based ASR System
    Sakti, Sakriani
    Kubo, Keigo
    Matsumiya, Sho
    Neubig, Graham
    Toda, Tomoki
    Nakamura, Satoshi
    Adachi, Fumihiro
    Isotani, Ryosuke
    LREC 2014 - NINTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2014, : 2639 - 2643
  • [2] Towards Understanding ASR Error Correction for Medical Conversations
    Mani, Anirudh
    Palaskar, Shruti
    Konam, Sandeep
    NATURAL LANGUAGE PROCESSING FOR MEDICAL CONVERSATIONS, 2020, : 7 - 11
  • [3] Development of the SRI/Nightingale Arabic ASR system
    Vergyri, D.
    Mandal, A.
    Wang, W.
    Stolcke, A.
    Zheng, J.
    Graciarena, M.
    Rybach, D.
    Gollan, C.
    Schlueter, R.
    Kirchhoff, K.
    Faria, A.
    Morgan, N.
    INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 1437 - 1440
  • [4] Development of Bilingual ASR System for MediaParl Corpus
    Motlicek, Petr
    Imseng, David
    Cernak, Milos
    Kim, Namhoon
    15TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2014), VOLS 1-4, 2014, : 1391 - 1394
  • [5] Automatic Development of ASR System for an Under-Resourced Language
    Safarik, Radek
    Mateju, Lukas
    2018 41ST INTERNATIONAL CONFERENCE ON TELECOMMUNICATIONS AND SIGNAL PROCESSING (TSP), 2018, : 100 - 103
  • [6] Conversations with Medical Education
    Jacobs, Joshua
    MEDICAL EDUCATION, 2012, 46 (04) : 342 - 342
  • [7] Tough Conversations: Development of a Curriculum for Medical Students to Lead Family Meetings
    Hagiwara, Yuya
    Ross, Jeanette
    Lee, Shuko
    Sanchez-Reilly, Sandra
    AMERICAN JOURNAL OF HOSPICE & PALLIATIVE MEDICINE, 2017, 34 (10): : 907 - 911
  • [8] The Althingi ASR System
    Helgadottir, Inga R.
    Nikulasdottir, Anna B.
    Borsky, Michal
    Fong, Judy Y.
    Kjaran, Robert
    Gudnason, Jon
    INTERSPEECH 2019, 2019, : 3013 - 3017
  • [9] An Enhanced Model for ASR in the Medical Field
    Hsu Wei-Chen
    Lin Pei-Xu
    Li Chi-Jou
    Tien Hao-Yu
    Kang Yi-Huang
    Lee Pei-Ju
    2024 IEEE INTERNATIONAL CONFERENCE ON INFORMATION REUSE AND INTEGRATION FOR DATA SCIENCE, IRI 2024, 2024, : 43 - 48
  • [10] A French Medical Conversations Corpus Annotated for a Virtual Patient Dialogue System
    Laleye, Frejus A. A.
    de Chalendar, Gael
    Blanie, Antonia
    Brouquet, Antoine
    Behnamou, Dan
    PROCEEDINGS OF THE 12TH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2020), 2020, : 574 - 580