Non-Speech Sounds Classification for People with Hearing Disabilities

被引:0
|
作者
Lozano, H. [1 ]
Hernaez, I.
Navas, E.
Gonzalez, F. J. [1 ]
Idigoras, I. [1 ]
机构
[1] Robotiker Tecnalia, E-48170 Zamudio, Bizkaia, Spain
来源
关键词
Deaf; Assistive products; sound recognition; GMM;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
People with hearing disabilities experience the problems that stem from not being able to detect or identify sounds on a daily basis. Studying the techniques and algorithms which enable this task to be performed automatically may lead to significant technological progress which will offer huge benefits to deaf people. With the objective of developing an application which is capable of detecting and classifying the different sounds that may emerge in the home, a study is being carried out which shows the most important parameters for processing impulsive sounds such as door bells, alarm clocks, a baby crying... which obtain high accuracy ratios and give the classifier high reliability. To date, an initial prototype has been developed which implements a GMM (Gaussian Mixture Model) classifier which is based on the Gaussian probability distribution for sound event prediction. In order to check the classifier's accuracy, typical speech recognition parameters have been used, such as MFCC (Mel frequency cepstral coefficient), as well as parameters used to recognise musical instruments and background sounds: Spectral Centroid, Roll-Off Point and ZCR (16 parameters in total). By varying a series of factors (number of parameters, the sounds used to train the classifier...) the GMM's behaviour has been analysed obtaining results with over 90% accuracy in frames and up to 100% accuracy using the sound average, identifying doors, telephones and alarm clocks.
引用
收藏
页码:276 / 280
页数:5
相关论文
共 50 条
  • [41] Speech/non-speech classification using multiple features for robust endpoint detection
    Shin, WH
    Lee, BS
    Lee, YK
    Lee, JS
    2000 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS, VOLS I-VI, 2000, : 1399 - 1402
  • [42] Classification of non-speech acoustic signals using structure models
    Tschöpe, C
    Hentschel, D
    Wolff, M
    Eichner, M
    Hoffmann, R
    2004 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL V, PROCEEDINGS: DESIGN AND IMPLEMENTATION OF SIGNAL PROCESSING SYSTEMS INDUSTRY TECHNOLOGY TRACKS MACHINE LEARNING FOR SIGNAL PROCESSING MULTIMEDIA SIGNAL PROCESSING SIGNAL PROCESSING FOR EDUCATION, 2004, : 653 - 656
  • [43] Auditory evoked potentials using speech and non-speech sounds in a dichotic-listening paradigm.
    Hen-Tov, JK
    Cottone, JG
    Harkavy, LA
    Squires, NK
    JOURNAL OF COGNITIVE NEUROSCIENCE, 1999, : 101 - 101
  • [44] Perceiving identical sounds as speech or non-speech modulates activity in the left posterior superior temporal sulcus
    Möttönen, R
    Calvert, GA
    Jääskeläinen, IP
    Matthews, PM
    Thesen, T
    Tuomainen, J
    Sams, M
    NEUROIMAGE, 2006, 30 (02) : 563 - 569
  • [45] Hemispheric asymmetry of the auditory short-term habituation:: speech vs. non-speech sounds
    Sörös, P
    Knecht, S
    Teismann, I
    Manemann, E
    Imai, T
    Lütkenhöner, B
    Pantev, C
    NEUROIMAGE, 2001, 13 (06) : S940 - S940
  • [46] Impact of irrelevant speech and non-speech sounds on serial recall of verbal and spatial items in children and adults
    Larissa Leist
    Thomas Lachmann
    Maria Klatte
    Scientific Reports, 15 (1)
  • [47] VOWELS, CONSONANTS, SPEECH, AND NON-SPEECH
    ADES, AE
    PSYCHOLOGICAL REVIEW, 1977, 84 (06) : 524 - 530
  • [48] Robust speech and non-speech detection
    Tian, Y
    Wang, ZY
    Lu, DJ
    CHINESE JOURNAL OF ELECTRONICS, 2002, 11 (01): : 79 - 82
  • [49] Categorical Tone Identification in Speech and Non-Speech Sounds for Chinese- and English-Native Listeners
    Liu, Chang
    2014 INTERNATIONAL CONFERENCE ON AUDIO, LANGUAGE AND IMAGE PROCESSING (ICALIP), VOLS 1-2, 2014, : 454 - 458
  • [50] Speech and non-speech processing in people with specific language impairment: A behavioural and electrophysiological study
    McArthur, GM
    Bishop, DVM
    BRAIN AND LANGUAGE, 2005, 94 (03) : 260 - 273