Improving Automatic Recognition of Aphasic Speech with AphasiaBank

被引:24
|
作者
Le, Duc [1 ]
Provost, Emily Mower [1 ]
机构
[1] Univ Michigan, Comp Sci & Engn, Ann Arbor, MI 48109 USA
关键词
speech recognition; acoustic modeling; aphasia; AphasiaBank; out-of-domain adaptation;
D O I
10.21437/Interspeech.2016-213
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Automatic recognition of aphasic speech is challenging due to various speech-language impairments associated with aphasia as well as a scarcity of training data appropriate for this speaker population. AphasiaBank, a shared database of multimedia interactions primarily used by clinicians to study aphasia, offers a promising source of data for Deep Neural Network acoustic modeling. In this paper, we establish the first large-vocabulary continuous speech recognition baseline on AphasiaBank and study recognition accuracy as a function of diagnoses. We investigate several out-of-domain adaptation methods and show that AphasiaBank data can be leveraged to significantly improve the recognition rate on a smaller aphasic speech corpus. This work helps broaden the understanding of aphasic speech recognition, demonstrates the potential of AphasiaBank, and guides researchers who wish to use this database for their own work.
引用
收藏
页码:2681 / 2685
页数:5
相关论文
共 50 条
  • [1] Improving Aphasic Speech Recognition by Using Novel Semi-Supervised Learning Methods on AphasiaBank for English and Spanish
    Torre, Ivan G.
    Romero, Monica
    Alvarez, Aitor
    [J]. APPLIED SCIENCES-BASEL, 2021, 11 (19):
  • [2] Automatic Speech Recognition (ASR) based Approach for Speech Therapy of Aphasic Patients: A Review
    Jamal, Norezmi
    Shanta, Shahnoor
    Mahmud, Farhanahani
    Sha'abani, M. N. A. H.
    [J]. ADVANCES IN ELECTRICAL AND ELECTRONIC ENGINEERING: FROM THEORY TO APPLICATIONS, 2017, 1883
  • [3] Improving Speech Synthesis by Automatic Speech Recognition and Speech Discriminator
    Huang, Li-Yu
    Chen, Chia-Ping
    [J]. JOURNAL OF INFORMATION SCIENCE AND ENGINEERING, 2024, 40 (01) : 189 - 200
  • [4] Improving Readability for Automatic Speech Recognition Transcription
    Liao, Junwei
    Eskimez, Sefik
    Lu, Liyang
    Shi, Yu
    Gong, Ming
    Shou, Linjun
    Qu, Hong
    Zeng, Michael
    [J]. ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2023, 22 (05)
  • [5] Improving the Quality of Automatic Speech Recognition in Trucks
    Korenevsky, Maxim
    Medennikov, Ivan
    Shchemelinin, Vadim
    [J]. Speech and Computer, 2016, 9811 : 362 - 369
  • [6] Improving analysis techniques for automatic speech recognition
    O'Shaughnessy, D
    [J]. 2002 45TH MIDWEST SYMPOSIUM ON CIRCUITS AND SYSTEMS, VOL III, CONFERENCE PROCEEDINGS, 2002, : 65 - 68
  • [7] THE DEREVERBERATION OF SPEECH WITH A VIEW TO IMPROVING THE AUTOMATIC RECOGNITION OF SPEECH IN ROOMS
    HIRSCH, HG
    [J]. ACUSTICA, 1989, 67 (03): : 216 - 221
  • [8] Improving Hypernasality Estimation with Automatic Speech Recognition in Cleft Palate Speech
    Song, Kaitao
    Wan, Teng
    Wang, Bixia
    Jiang, Huiqiang
    Qiu, Luna
    Xu, Jiahang
    Jiang, Liping
    Lou, Qun
    Yang, Yuqing
    Li, Dongsheng
    Wang, Xudong
    Qiu, Lili
    [J]. INTERSPEECH 2022, 2022, : 4820 - 4824
  • [9] IMPROVING AUTOMATIC SPEECH RECOGNITION ROBUSTNESS FOR THE ROMANIAN LANGUAGE
    Buzo, Andi
    Cucu, Horia
    Burileanu, Corneliu
    [J]. 19TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO-2011), 2011, : 2119 - 2122
  • [10] Improving Automatic Emotion Recognition from Speech Signals
    Bozkurt, Elif
    Erzin, Engin
    Erdem, Cigdem Eroglu
    Erdem, A. Tanju
    [J]. INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5, 2009, : 312 - +