Towards Understanding ASR Error Correction for Medical Conversations

被引:0
|
作者
Mani, Anirudh [1 ]
Palaskar, Shruti [2 ]
Konam, Sandeep [1 ]
机构
[1] Abridge AI Inc, Pittsburgh, PA 15232 USA
[2] Carnegie Mellon Univ, Pittsburgh, PA 15213 USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Domain Adaptation for Automatic Speech Recognition (ASR) error correction via machine translation is a useful technique for improving out-of-domain outputs of pre-trained ASR systems to obtain optimal results for specific in-domain tasks. We use this technique on our dataset of Doctor-Patient conversations using two off-the-shelf ASR systems: Google ASR (commercial) and the ASPIRE model (open-source). We train a Sequence-to-Sequence Machine Translation model and evaluate it on seven specific UMLS Semantic types, including Pharmacological Substance, Sign or Symptom, and Diagnostic Procedure to name a few. Lastly, we breakdown, analyze and discuss the 7% overall improvement in word error rate in view of each Semantic type.
引用
收藏
页码:7 / 11
页数:5
相关论文
共 50 条
  • [21] Towards Lithuanian Grammatical Error Correction
    Stankevicius, Lukas
    Lukosevicius, Mantas
    ARTIFICIAL INTELLIGENCE TRENDS IN SYSTEMS, VOL 2, 2022, 502 : 490 - 503
  • [22] ASR ERROR CORRECTION WITH DUAL-CHANNEL SELF-SUPERVISED LEARNING
    Zhang, Fan
    Tu, Mei
    Liu, Song
    Yan, Jinyao
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 7282 - 7286
  • [23] Towards scalable bosonic quantum error correction
    Terhal, B. M.
    Conrad, J.
    Vuillot, C.
    QUANTUM SCIENCE AND TECHNOLOGY, 2020, 5 (04)
  • [24] PATCorrect: Non-autoregressive Phoneme-augmented Transformer for ASR Error Correction
    Zhang, Ziji
    Wang, Zhehui
    Kamma, Rajesh
    Eswaran, Sharanya
    Sadagopan, Narayanan
    INTERSPEECH 2023, 2023, : 3904 - 3908
  • [25] Towards Spoken Medical Prescription Understanding
    Kocabiyikoglu, Ali Can
    Portet, Francois
    Blanchon, Herve
    Babouchkine, Jean-Marc
    2019 10TH INTERNATIONAL CONFERENCE ON SPEECH TECHNOLOGY AND HUMAN-COMPUTER DIALOGUE (SPED), 2019,
  • [26] UNDERSTANDING CONVERSATIONS
    BLACK, JB
    ROBERTSON, SP
    JOHNSON, PN
    BULLETIN OF THE PSYCHONOMIC SOCIETY, 1981, 18 (02) : 64 - 64
  • [27] Understanding Medical Conversations: Rich Transcription, Confidence Scores & Information Extraction
    Soltau, Hagen
    Wang, Mingqiu
    Shafran, Izhak
    El Shafey, Laurent
    INTERSPEECH 2021, 2021, : 4418 - 4422
  • [28] TEACHING MEDICAL STUDENTS HOW TO HAVE BRAVE CONVERSATIONS TO PROMOTE UNDERSTANDING
    Nwankwo, O.
    Ongyiu, A.
    Sajdak, G.
    Hayton, A.
    ChenFeng, J.
    JOURNAL OF INVESTIGATIVE MEDICINE, 2024, 72 (01) : 309 - 309
  • [29] Lattice re-scoring during manual editing for automatic error correction of ASR transcripts
    Runarsdottir, Anna, V
    Helgadottir, Inga R.
    Gudnason, Jon
    INTERSPEECH 2019, 2019, : 3810 - 3814
  • [30] ALTERNATIVE HYPOTHESIS GENERATION USING A WEIGHTED KERNEL FEATURE MATRIX FOR ASR SUBSTITUTION ERROR CORRECTION
    Liu, Chao-Hong
    Wu, Chung-Hsien
    Sarwono, David
    2012 8TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING, 2012, : 1 - 5