Non-Speech Sounds Classification for People with Hearing Disabilities

被引:0
|
作者
Lozano, H. [1 ]
Hernaez, I.
Navas, E.
Gonzalez, F. J. [1 ]
Idigoras, I. [1 ]
机构
[1] Robotiker Tecnalia, E-48170 Zamudio, Bizkaia, Spain
来源
关键词
Deaf; Assistive products; sound recognition; GMM;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
People with hearing disabilities experience the problems that stem from not being able to detect or identify sounds on a daily basis. Studying the techniques and algorithms which enable this task to be performed automatically may lead to significant technological progress which will offer huge benefits to deaf people. With the objective of developing an application which is capable of detecting and classifying the different sounds that may emerge in the home, a study is being carried out which shows the most important parameters for processing impulsive sounds such as door bells, alarm clocks, a baby crying... which obtain high accuracy ratios and give the classifier high reliability. To date, an initial prototype has been developed which implements a GMM (Gaussian Mixture Model) classifier which is based on the Gaussian probability distribution for sound event prediction. In order to check the classifier's accuracy, typical speech recognition parameters have been used, such as MFCC (Mel frequency cepstral coefficient), as well as parameters used to recognise musical instruments and background sounds: Spectral Centroid, Roll-Off Point and ZCR (16 parameters in total). By varying a series of factors (number of parameters, the sounds used to train the classifier...) the GMM's behaviour has been analysed obtaining results with over 90% accuracy in frames and up to 100% accuracy using the sound average, identifying doors, telephones and alarm clocks.
引用
收藏
页码:276 / 280
页数:5
相关论文
共 50 条
  • [1] LOCALIZATION OF SPEECH AND NON-SPEECH SOUNDS
    SHIGENO, S
    OYAMA, T
    JAPANESE PSYCHOLOGICAL RESEARCH, 1983, 25 (02) : 112 - 117
  • [2] Perception of speech and non-speech sounds by listeners with real and simulated sensorineural hearing loss
    Lum, DS
    Braida, LD
    JOURNAL OF PHONETICS, 2000, 28 (03) : 343 - 366
  • [3] Classification of Non-Speech Human Sounds: Feature Selection and Snoring Sound Analysis
    Liao, Wen-Hung
    Lin, Yu-Kai
    2009 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN AND CYBERNETICS (SMC 2009), VOLS 1-9, 2009, : 2695 - 2700
  • [4] A semiotic approach to the design of non-speech sounds
    Murphy, Emma
    Pirhonen, Antti
    McAllister, Graham
    Yu, Wai
    HAPTIC AND AUDIO INTERACTION DESIGN, PROCEEDINGS, 2006, 4129 : 121 - 132
  • [5] On the Impact of Non-speech Sounds on Speaker Recognition
    Janicki, Artur
    TEXT, SPEECH AND DIALOGUE, TSD 2012, 2012, 7499 : 566 - 572
  • [6] The discrimination of and orienting to speech and non-speech sounds in children with autism
    Lepistö, T
    Kujala, T
    Vanhala, R
    Alku, P
    Huotilainen, M
    Näätänen, R
    BRAIN RESEARCH, 2005, 1066 (1-2) : 147 - 157
  • [7] Distinctive magnetic activity elicited by speech and non-speech sounds
    Miyagishima, K.
    Imaizumi, S.
    Mori, K.
    Yoneda, K.
    Kiritani, S.
    Yumoto, M.
    Journal of the Acoustical Society of Japan (E) (English translation of Nippon Onkyo Gakkaishi), 1994, 15 (03):
  • [8] Hemispheric processing of duration changes in speech and non-speech sounds
    Takegata, R
    Nakagawa, S
    Tonoike, M
    Näätänen, R
    NEUROREPORT, 2004, 15 (10) : 1683 - 1686
  • [9] Fuzzy integral based information fusion for classification of highly confusable non-speech sounds
    Temko, Andrey
    Macho, Dusan
    Nadeu, Climent
    PATTERN RECOGNITION, 2008, 41 (05) : 1814 - 1823
  • [10] A New Front-End for Classification of Non-Speech Sounds: A Study on Human Whistle
    Nandwana, Mahesh Kumar
    Boril, Hynek
    Hansen, John H. L.
    16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 1982 - 1986