Using speech rhythm knowledge to improve dysarthric speech recognition

被引:0
|
作者
S.-A. Selouani
H. Dahmani
R. Amami
H. Hamam
机构
[1] Université de Moncton,INRS
[2] Université du Québec,EMT
[3] École ESPRIT,undefined
[4] Université de Moncton,undefined
关键词
Dysarthria; Speech recognition; Severity level assessment; Neural networks; Hybrid systems; Rhythm metrics; Posterior distributions; Nemours database;
D O I
10.1007/s10772-011-9104-6
中图分类号
学科分类号
摘要
We introduce a new framework to improve the dysarthric speech recognition by using the rhythm knowledge. This approach builds speaker-dependent (SD) recognizers with respect to the dysarthria severity level of each speaker. This severity level is determined by a hybrid classifier combining class posterior distributions and a hierarchical structure of multilayer perceptrons. To perform this classification, rhythm-based features are used as input parameters since the preliminary evidence from perceptual experiments shows that rhythm troubles may be the common characteristic of various types of dysarthria. Then, a speaker-dependent dysarthric speech recognition is performed by using Hidden Markov Models (HMMs). The Nemours database of American dysarthric speakers is used throughout experiments. Results show the relevance of rhythm metrics and the effectiveness of the proposed framework to improve the performance of dysarthric speech recognition.
引用
收藏
页码:57 / 64
页数:7
相关论文
共 50 条
  • [1] Using speech rhythm knowledge to improve dysarthric speech recognition
    Selouani, S. -A.
    Dahmani, H.
    Amami, R.
    Hamam, H.
    [J]. INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2012, 15 (01) : 57 - 64
  • [2] Articulatory Knowledge in the Recognition of Dysarthric Speech
    Rudzicz, Frank
    [J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2011, 19 (04): : 947 - 960
  • [3] SYNTHESIZING DYSARTHRIC SPEECH USING MULTI-SPEAKER TTS FOR DYSARTHRIC SPEECH RECOGNITION
    Soleymanpour, Mohammad
    Johnson, Michael T.
    Soleymanpour, Rahim
    Berry, Jeffrey
    [J]. 2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 7382 - 7386
  • [4] Data Augmentation using Healthy Speech for Dysarthric Speech Recognition
    Vachhani, Bhavik
    Bhat, Chitralekha
    Kopparapu, Sunil Kumar
    [J]. 19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 471 - 475
  • [5] Modelling Confusion Matrices to Improve Speech Recognition Accuracy, with an Application to Dysarthric Speech
    Morales, Omar Caballero
    Cox, Stephen
    [J]. INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 2007, : 277 - 280
  • [6] A Survey of Automatic Speech Recognition for Dysarthric Speech
    Qian, Zhaopeng
    Xiao, Kejing
    [J]. ELECTRONICS, 2023, 12 (20)
  • [7] Using articulatory likelihoods in the recognition of dysarthric speech
    Rudzicz, Frank
    [J]. SPEECH COMMUNICATION, 2012, 54 (03) : 430 - 444
  • [8] Optimization of dysarthric speech recognition
    Chen, FX
    Kostov, A
    [J]. PROCEEDINGS OF THE 19TH ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY, VOL 19, PTS 1-6: MAGNIFICENT MILESTONES AND EMERGING OPPORTUNITIES IN MEDICAL ENGINEERING, 1997, 19 : 1436 - 1439
  • [9] Dysarthric Speech Transformer: A Sequence-to-Sequence Dysarthric Speech Recognition System
    Shahamiri, Seyed Reza
    Lal, Vanshika
    Shah, Dhvani
    [J]. IEEE TRANSACTIONS ON NEURAL SYSTEMS AND REHABILITATION ENGINEERING, 2023, 31 : 3407 - 3416
  • [10] The Effectiveness of Time Stretching for Enhancing Dysarthric Speech for Improved Dysarthric Speech Recognition
    Prananta, Luke
    Halpern, Bence Mark
    Feng, Siyuan
    Scharenborg, Odette
    [J]. INTERSPEECH 2022, 2022, : 36 - 40