Multiexpert automatic speech recognition using acoustic and myoelectric signals

被引:22
|
作者
Chan, ADC [1 ]
Englehart, KB
Hudgins, B
Lovely, DF
机构
[1] Carleton Univ, Dept Syst & Comp Engn, Ottawa, ON K1S 5B6, Canada
[2] Univ New Brunswick, Dept Elect & Comp Engn, Fredericton, NB E3B 5A3, Canada
[3] Univ New Brunswick, Inst Biomed Engn, Fredericton, NB E3B 5A3, Canada
基金
加拿大自然科学与工程研究理事会;
关键词
classification; electromyography; multiexpert system; myoelectric signals; speech recognition;
D O I
10.1109/TBME.2006.870224
中图分类号
R318 [生物医学工程];
学科分类号
0831 ;
摘要
Classification accuracy of conventional automatic speech recognition (ASR) systems can decrease dramatically under acoustically noisy conditions. To improve classification accuracy and increase system robustness a multiexpert ASR system is implemented. In this system, acoustic speech information is supplemented with information from facial myoelectric signals (MES). A new method of combining experts, known as the plausibility method, is employed to combine an acoustic ASR expert and a MES ASR expert. The plausibility method of combining multiple experts, which is based on the mathematical framework of evidence theory, is compared to the Borda count and score-based methods of combination. Acoustic and facial MES data were collected from 5 subjects, using a 10-word vocabulary across an 18-dB range of acoustic noise. As expected the performance of an acoustic expert decreases with increasing acoustic noise; classification accuracies of the acoustic ASR expert are as low as 11.5%. The effect of noise is significantly reduced with the addition of the MES ASR expert. Classification accuracies remain above 78.8% across the 18-dB range of acoustic noise, when the plausibility method is used to combine the opinions of multiple experts. In addition, the plausibility method produced classification accuracies higher than any individual expert at all noise levels, as well as the highest classification accuracies, except at the 9-dB noise level. Using the Borda count and score-based multiexpert systems, classification accuracies are improved relative to the acoustic ASR expert but are as low as 51.5% and 59.5%, respectively.
引用
收藏
页码:676 / 685
页数:10
相关论文
共 50 条
  • [1] A multi-expert speech recognition system using acoustic and myoelectric signals
    Chan, ADC
    Englehart, K
    Hudgins, B
    Lovely, DF
    [J]. SECOND JOINT EMBS-BMES CONFERENCE 2002, VOLS 1-3, CONFERENCE PROCEEDINGS: BIOENGINEERING - INTEGRATIVE METHODOLOGIES, NEW TECHNOLOGIES, 2002, : 72 - 73
  • [2] ON THE PATH TO THE AUTOMATIC RECOGNITION OF ACOUSTIC SPEECH SIGNALS
    UNTERBERGER
    [J]. ANGEWANDTE INFORMATIK, 1982, (09): : 445 - 450
  • [3] Approximated mutual information training for speech recognition using myoelectric signals
    Guo, Hua J.
    Chan, A. D. C.
    [J]. 2006 28TH ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY, VOLS 1-15, 2006, : 96 - 99
  • [4] Disordered Speech Recognition Using Acoustic and sEMG Signals
    Deng, Yunbin
    Patel, Rupal
    Heaton, James T.
    Colby, Glen
    Gilmore, L. Donald
    Cabrera, Joao
    Roy, Serge H.
    De Luca, Carlo J.
    Meltzner, Geoffrey S.
    [J]. INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5, 2009, : 632 - +
  • [5] Automatic speech recognition using acoustic doppler signal
    Lee, Ki-Seung
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF KOREA, 2016, 35 (01): : 74 - 82
  • [6] Acoustic Analysis for Automatic Speech Recognition
    O'Shaughnessy, Douglas
    [J]. PROCEEDINGS OF THE IEEE, 2013, 101 (05) : 1038 - 1053
  • [7] Using Automatic Speech Recognition to Measure the Intelligibility of Speech Synthesized from Brain Signals
    Varshney, Suvi
    Farias, Dana
    Brandman, David M.
    Stavisky, Sergey D.
    Miller, Lee M.
    [J]. 2023 11TH INTERNATIONAL IEEE/EMBS CONFERENCE ON NEURAL ENGINEERING, NER, 2023,
  • [8] Using Privacy-Transformed Speech in the Automatic Speech Recognition Acoustic Model Training
    Salimbajevs, Askars
    [J]. HUMAN LANGUAGE TECHNOLOGIES - THE BALTIC PERSPECTIVE (HLT 2020), 2020, 328 : 47 - 54
  • [9] Prediction of Acoustic Feature Parameters Using Myoelectric Signals
    Lee, Ki-Seung
    [J]. IEEE TRANSACTIONS ON BIOMEDICAL ENGINEERING, 2010, 57 (07) : 1587 - 1595
  • [10] Automatic Recognition of Spontaneous Emotions in Speech Using Acoustic and Lexical Features
    Truong, Khict P.
    Raaijmakers, Stephan
    [J]. MACHINE LEARNING FOR MULTIMODAL INTERACTION, PROCEEDINGS, 2008, 5237 : 161 - +