Raspberry Pi-based robust speech command recognition for normal and hearing-impaired (HI)

被引:0
|
作者
A. Revathi
N. Sasikaladevi
D. Arunprasanth
N. Raju
机构
[1] SASTRA Deemed University,School of Electrical & Electronics Engineering
[2] SASTRA Deemed University,School of Computing
[3] Thanjavur Medical College and Hospital,undefined
来源
关键词
Speech command recognition; Hearing-impaired; Spectrogram; CNN; Machine learning techniques; Raspberry Pi hardware;
D O I
暂无
中图分类号
学科分类号
摘要
The speech command identification system has become a necessary tool to transcribe speech into text, for performing hands-free control of devices and hazardous processes, etc. It also finds applications in searching the contents online over voice and speech-to-text conversion for differently-abled persons. This work includes the extraction of the spectrogram from speech signals, applying 80% of the features to the 2D convolutional neural network (CNN) layered architecture, and creating CNN group models.CNN models are used to test features to recognize the words uttered by normal and Hearing-impaired (HI). The system's performance is assessed based on the recognition rate for spectrogram, Melspectrogram and Gammatonegram features and CNN. In addition, the speech intelligibility of HI speeches is enhanced using the phase spectrum compensation (PSC) technique. Decision-level fusion of spectrogram features for regular speech recognition, HI speech recognition without PSC and HI speech recognition with PSC have provided an accuracy of 95%, 98% and 99%, respectively. Twenty isolated words are considered for regular speech command recognition, and ten isolated digits are regarded for a HI speech recognition system. This automated speech command recognition is implemented in real-time using Raspberry Pi hardware, and the validation error for the test data is 0.57692%.
引用
收藏
页码:51589 / 51613
页数:24
相关论文
共 50 条
  • [1] Raspberry Pi-based robust speech command recognition for normal and hearing-impaired (HI)
    Revathi, A.
    Sasikaladevi, N.
    Arunprasanth, D.
    Raju, N.
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 83 (17) : 51589 - 51613
  • [2] SPEECH RECOGNITION AND THE ARTICULATION INDEX FOR NORMAL AND HEARING-IMPAIRED LISTENERS
    KAMM, CA
    DIRKS, DD
    BELL, TS
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1985, 77 (01): : 281 - 288
  • [3] Formant transition duration and speech recognition in normal and hearing-impaired listeners
    Turner, CW
    Smith, SJ
    Aldridge, PL
    Stewart, SL
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1997, 101 (05): : 2822 - 2825
  • [4] Cued Speech Recognition for Augmentative Communication in Normal-hearing and Hearing-impaired Subjects
    Heracleous, Panikos
    Beautemps, Denis
    Abboutabit, Noureddine
    INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5, 2009, : 1419 - 1422
  • [5] Use of temporal envelope cues in speech recognition by normal and hearing-impaired listeners
    1600, American Inst of Physics, Woodbury, NY, USA (97):
  • [6] USE OF TEMPORAL ENVELOPE CUES IN SPEECH RECOGNITION BY NORMAL AND HEARING-IMPAIRED LISTENERS
    TURNER, CW
    SOUZA, PE
    FORGET, LN
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1995, 97 (04): : 2568 - 2576
  • [7] APPLICATION OF THE ARTICULATION INDEX AND THE SPEECH TRANSMISSION INDEX TO THE RECOGNITION OF SPEECH BY NORMAL-HEARING AND HEARING-IMPAIRED LISTENERS
    HUMES, LE
    DIRKS, DD
    BELL, TS
    AHLSTROM, C
    KINCAID, GE
    JOURNAL OF SPEECH AND HEARING RESEARCH, 1986, 29 (04): : 447 - 462
  • [8] A model of speech recognition for hearing-impaired listeners based on deep learning
    Rossbach, Jana
    Kollmeier, Birger
    Meyer, Bernd T.
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2022, 151 (03): : 1417 - 1427
  • [9] RECOGNITION OF SYNTHETIC SPEECH BY HEARING-IMPAIRED ELDERLY LISTENERS
    HUMES, LE
    NELSON, KJ
    PISONI, DB
    JOURNAL OF SPEECH AND HEARING RESEARCH, 1991, 34 (05): : 1180 - 1184
  • [10] INTELLIGIBILITY OF SYNTHETIC SPEECH FOR NORMAL-HEARING AND HEARING-IMPAIRED LISTENERS
    KANGAS, KA
    ALLEN, GD
    JOURNAL OF SPEECH AND HEARING DISORDERS, 1990, 55 (04): : 751 - 755