EMG-based speech recognition using dimensionality reduction methods

被引:10
|
作者
Ratnovsky, Anat [1 ]
Malayev, Sarit [1 ,2 ]
Ratnovsky, Shahar [3 ,4 ]
Naftali, Sara [1 ]
Rabin, Neta [5 ]
机构
[1] Afeka Tel Aviv Acad Coll Engn, Sch Med Engn, Tel Aviv, Israel
[2] Tel Aviv Univ, Sch Neurosci, Tel Aviv, Israel
[3] Tel Aviv Univ, Sch Elect Engn, Tel Aviv, Israel
[4] Tel Aviv Univ, Sch Comp Sci, Tel Aviv, Israel
[5] Tel Aviv Univ, Dept Ind Engn, Tel Aviv, Israel
关键词
Electromyography; Speech recognition; Automatic speech recognition; Machine learning algorithms; Feature extraction; Principal component analysis; PERFORMANCE; SIGNALS;
D O I
10.1007/s12652-021-03315-5
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Automatic speech recognition is the main form of man-machine communication. Recently, several studies have shown the ability to automatically recognize speech based on electromyography (EMG) signals of the facial muscles using machine learning methods. The objective of this study was to utilize machine learning methods for automatic identification of speech based on EMG signals. EMG signals from three facial muscles were measured from four healthy female subjects while pronouncing seven different words 50 times. Short time Fourier transform features were extracted from the EMG data. Principle component analysis (PCA) and locally linear embedding (LLE) methods were applied and compared for reducing the dimensions of the EMG data. K-nearest-neighbors was used to examine the ability to identify different word sets of a subject based on his own dataset, and to identify words of one subject based on another subject's dataset, utilizing an affine transformation for aligning between the reduced feature spaces of two subjects. The PCA and LLE achieved average recognizing rate of 81% for five words-sets in the single-subject approach. The best average recognition success rates for three and five words-sets were 88.8% and 74.6%, respectively, for the multi-subject classification approach. Both the PCA and LLE achieved satisfactory classification rates for both the single-subject and multi-subject approaches. The multi-subject classification approach enables robust classification of words recorded from a new subject based on another subject's dataset and thus can be applicable for people who have lost their ability to speak.
引用
收藏
页码:597 / 607
页数:11
相关论文
共 50 条
  • [1] EMG-based speech recognition using dimensionality reduction methods
    Anat Ratnovsky
    Sarit Malayev
    Shahar Ratnovsky
    Sara Naftali
    Neta Rabin
    [J]. Journal of Ambient Intelligence and Humanized Computing, 2023, 14 : 597 - 607
  • [2] SESSION-INDEPENDENT EMG-BASED SPEECH RECOGNITION
    Wand, Michael
    Schultz, Tanja
    [J]. BIOSIGNALS 2011, 2011, : 295 - 300
  • [3] An Investigation of Dimensionality Reduction Techniques for EMG-based Force Estimation
    Hajian, Gelareh
    Etemad, Ali
    Morin, Evelyn
    [J]. 2019 41ST ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY (EMBC), 2019, : 698 - 701
  • [4] ANALYSIS OF PHONE CONFUSION IN EMG-BASED SPEECH RECOGNITION
    Wand, Michael
    Schultz, Tanja
    [J]. 2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2011, : 757 - 760
  • [5] Modeling coarticulation in EMG-based continuous speech recognition
    Schultz, Tanja
    Wand, Michael
    [J]. SPEECH COMMUNICATION, 2010, 52 (04) : 341 - 353
  • [6] A Spectral Mapping Method for EMG-based Recognition of Silent Speech
    Janke, Matthias
    Wand, Michael
    Schultz, Tanja
    [J]. BIO-INSPIRED HUMAN- MACHINE INTERFACES AND HEALTHCARE APPLICATIONS, 2010, : 22 - 31
  • [7] Impact of Different Feedback Mechanisms in EMG-based Speech Recognition
    Herff, Christian
    Janke, Matthias
    Wand, Michael
    Schultz, Tanja
    [J]. 12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 2224 - 2227
  • [8] Tackling Speaking Mode Varieties in EMG-Based Speech Recognition
    Wand, Michael
    Janke, Matthias
    Schultz, Tanja
    [J]. IEEE TRANSACTIONS ON BIOMEDICAL ENGINEERING, 2014, 61 (10) : 2515 - 2526
  • [9] Multi-stream HMM for EMG-based speech recognition
    Manabe, H
    Zhang, Z
    [J]. PROCEEDINGS OF THE 26TH ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY, VOLS 1-7, 2004, 26 : 4389 - 4392
  • [10] Impact of Different Speaking Modes on EMG-based Speech Recognition
    Wand, Michael
    Jou, Szu-Chen Stan
    Toth, Arthur R.
    Schultz, Tanja
    [J]. INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5, 2009, : 640 - +