Non-Speech Sounds Classification for People with Hearing Disabilities

被引:0
|
作者
Lozano, H. [1 ]
Hernaez, I.
Navas, E.
Gonzalez, F. J. [1 ]
Idigoras, I. [1 ]
机构
[1] Robotiker Tecnalia, E-48170 Zamudio, Bizkaia, Spain
来源
关键词
Deaf; Assistive products; sound recognition; GMM;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
People with hearing disabilities experience the problems that stem from not being able to detect or identify sounds on a daily basis. Studying the techniques and algorithms which enable this task to be performed automatically may lead to significant technological progress which will offer huge benefits to deaf people. With the objective of developing an application which is capable of detecting and classifying the different sounds that may emerge in the home, a study is being carried out which shows the most important parameters for processing impulsive sounds such as door bells, alarm clocks, a baby crying... which obtain high accuracy ratios and give the classifier high reliability. To date, an initial prototype has been developed which implements a GMM (Gaussian Mixture Model) classifier which is based on the Gaussian probability distribution for sound event prediction. In order to check the classifier's accuracy, typical speech recognition parameters have been used, such as MFCC (Mel frequency cepstral coefficient), as well as parameters used to recognise musical instruments and background sounds: Spectral Centroid, Roll-Off Point and ZCR (16 parameters in total). By varying a series of factors (number of parameters, the sounds used to train the classifier...) the GMM's behaviour has been analysed obtaining results with over 90% accuracy in frames and up to 100% accuracy using the sound average, identifying doors, telephones and alarm clocks.
引用
收藏
页码:276 / 280
页数:5
相关论文
共 50 条
  • [31] Effects of audio-visual integration on the detection of masked speech and non-speech sounds
    Eramudugolla, Ranmalee
    Henderson, Rachel
    Mattingley, Jason B.
    BRAIN AND COGNITION, 2011, 75 (01) : 60 - 66
  • [32] Two-stage speech/non-speech classification of telephone signals
    Li Jian-Bin
    Yan Ji-Kun
    Zheng Hui
    Niu Zhong-Xia
    2006 INTERNATIONAL CONFERENCE ON COMMUNICATIONS, CIRCUITS AND SYSTEMS PROCEEDINGS, VOLS 1-4: VOL 1: SIGNAL PROCESSING, 2006, : 490 - +
  • [33] DICHOTIC AND MONOTIC INTERACTIONS BETWEEN SPEECH AND NON-SPEECH SOUNDS AT DIFFERENT STIMULUS ONSET ASYNCHRONIES
    PORTER, RJ
    MIRABILE, PJ
    PERCEPTION & PSYCHOPHYSICS, 1977, 21 (05): : 408 - 412
  • [34] Of words and whistles: Statistical learning operates similarly for identical sounds perceived as speech and non-speech
    Sweet, Sierra J.
    Van Hedger, Stephen C.
    Batterink, Laura J.
    COGNITION, 2024, 242
  • [35] Discrimination of speech and non-speech sounds following theta-burst stimulation of the motor cortex
    Rogers, Jack C.
    Moettoenen, Riikka
    Boyles, Rowan
    Watkins, Kate E.
    FRONTIERS IN PSYCHOLOGY, 2014, 5
  • [36] An event-related potential (ERP) study of duration changes in speech and non-speech sounds
    Jaramillo, M
    Alku, P
    Paavilainen, P
    NEUROREPORT, 1999, 10 (16) : 3301 - 3305
  • [37] Auditory spatial attention to speech and complex non-speech sounds in children with autism spectrum disorder
    Soskey, Laura N.
    Allen, Paul D.
    Bennetto, Loisa
    AUTISM RESEARCH, 2017, 10 (08) : 1405 - 1416
  • [38] Audemes at work: Investigating features of non-speech sounds to maximize content recognition
    Ferati, Mexhid
    Pfaff, Mark S.
    Mannheimer, Steve
    Bolchini, Davide
    INTERNATIONAL JOURNAL OF HUMAN-COMPUTER STUDIES, 2012, 70 (12) : 936 - 966
  • [39] Discrimination and categorization of speech and non-speech sounds in an MEG delayed-match-to-sample study
    Luo, H
    Husain, FT
    Horwitz, B
    Poeppel, D
    NEUROIMAGE, 2005, 28 (01) : 59 - 71
  • [40] CATEGORICAL PERCEPTION OF NON-SPEECH SOUNDS BY 2-MONTH-OLD INFANTS
    JUSCZYK, PW
    ROSNER, BS
    CUTTING, JE
    FOARD, CF
    SMITH, LB
    PERCEPTION & PSYCHOPHYSICS, 1977, 21 (01): : 50 - 54