Non-Speech Sounds Classification for People with Hearing Disabilities

被引:0
|
作者
Lozano, H. [1 ]
Hernaez, I.
Navas, E.
Gonzalez, F. J. [1 ]
Idigoras, I. [1 ]
机构
[1] Robotiker Tecnalia, E-48170 Zamudio, Bizkaia, Spain
来源
关键词
Deaf; Assistive products; sound recognition; GMM;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
People with hearing disabilities experience the problems that stem from not being able to detect or identify sounds on a daily basis. Studying the techniques and algorithms which enable this task to be performed automatically may lead to significant technological progress which will offer huge benefits to deaf people. With the objective of developing an application which is capable of detecting and classifying the different sounds that may emerge in the home, a study is being carried out which shows the most important parameters for processing impulsive sounds such as door bells, alarm clocks, a baby crying... which obtain high accuracy ratios and give the classifier high reliability. To date, an initial prototype has been developed which implements a GMM (Gaussian Mixture Model) classifier which is based on the Gaussian probability distribution for sound event prediction. In order to check the classifier's accuracy, typical speech recognition parameters have been used, such as MFCC (Mel frequency cepstral coefficient), as well as parameters used to recognise musical instruments and background sounds: Spectral Centroid, Roll-Off Point and ZCR (16 parameters in total). By varying a series of factors (number of parameters, the sounds used to train the classifier...) the GMM's behaviour has been analysed obtaining results with over 90% accuracy in frames and up to 100% accuracy using the sound average, identifying doors, telephones and alarm clocks.
引用
收藏
页码:276 / 280
页数:5
相关论文
共 50 条
  • [21] Auditory hallucinations and the mismatch negativity: Processing speech and non-speech sounds in schizophrenia
    Fisher, Derek J.
    Labelle, Alain
    Knott, Verner J.
    INTERNATIONAL JOURNAL OF PSYCHOPHYSIOLOGY, 2008, 70 (01) : 3 - 15
  • [22] Sensitivity of the human auditory cortex to acoustic degradation of speech and non-speech sounds
    Miettinen, Ismo
    Tiitinen, Hannu
    Alku, Paavo
    May, Patrick J. C.
    BMC NEUROSCIENCE, 2010, 11
  • [23] Listening to speech and non-speech sounds activates phonological and semantic knowledge differently
    Bartolotti, James
    Schroeder, Scott R.
    Hayakawa, Sayuri
    Rochanavibhata, Sirada
    Chen, Peiyao
    Marian, Viorica
    QUARTERLY JOURNAL OF EXPERIMENTAL PSYCHOLOGY, 2020, 73 (08): : 1135 - 1149
  • [24] Call Analysis with Classification Using Speech and Non-Speech Features
    Ju, Yun-Cheng
    Wang, Ye-Yi
    Acero, Alex
    INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 1902 - 1905
  • [25] Robust speech/non-speech classification in heterogeneous multimedia content
    Huijbiegts, Marijn
    de Jong, Fianciska
    SPEECH COMMUNICATION, 2011, 53 (02) : 143 - 153
  • [26] Speech and Non-speech identification and classification using KNN algorithm
    Priya, T. Lakshmi
    Raajan, N. R.
    Raju, N.
    Preethi, P.
    Mathini, S.
    INTERNATIONAL CONFERENCE ON MODELLING OPTIMIZATION AND COMPUTING, 2012, 38 : 952 - 958
  • [27] Central processing of speech sounds and non-speech sounds with similar spectral distribution: An auditory evoked potential study
    Kaneshiro, Shinsuke
    Hiraumi, Harukazu
    Sato, Hiroaki
    AURIS NASUS LARYNX, 2020, 47 (05) : 727 - 733
  • [28] The processing of speech and non-speech sounds in aphasic patients as reflected by the mismatch negativity (MMN)
    Ilvonen, T
    Kujala, T
    Kozou, H
    Kiesiläinen, A
    Salonen, O
    Alku, P
    Näätänen, R
    NEUROSCIENCE LETTERS, 2004, 366 (03) : 235 - 240
  • [29] Plastic cortical changes induced by learning to communicate with non-speech sounds
    Kujala, A
    Huotilainen, M
    Uther, M
    Shtyrov, Y
    Monto, S
    Ilmoniemi, RJ
    Näätänen, R
    NEUROREPORT, 2003, 14 (13) : 1683 - 1687
  • [30] Environmental noise reduction based on speech/non-speech identification for hearing aids
    Itoh, K
    Mizushima, M
    1997 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I - V: VOL I: PLENARY, EXPERT SUMMARIES, SPECIAL, AUDIO, UNDERWATER ACOUSTICS, VLSI; VOL II: SPEECH PROCESSING; VOL III: SPEECH PROCESSING, DIGITAL SIGNAL PROCESSING; VOL IV: MULTIDIMENSIONAL SIGNAL PROCESSING, NEURAL NETWORKS, 1997, : 419 - 422