Noise-robust acoustic signature recognition using nonlinear Hebbian learning

被引:4
|
作者
Lu, Bing [1 ]
Dibazar, Alireza [1 ]
Berger, Theodore W. [1 ]
机构
[1] Univ So Calif, Dept Biomed Engn, Los Angeles, CA 90089 USA
关键词
Independent feature extraction; Higher order statistics; Noise robustness; Nonlinear Hebbian learning; Statistical optimization; INDEPENDENT COMPONENT ANALYSIS; MAXIMUM-LIKELIHOOD; SPEAKER RECOGNITION; ALGORITHM; INFOMAX;
D O I
10.1016/j.neunet.2010.07.003
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We propose using a new biologically inspired approach nonlinear Hebbian learning (NHL) to implement acoustic signal recognition in noisy environments The proposed learning processes both spectral and temporal features of input acoustic data The spectral analysis is realized by using auditory gammatone filterbanks The temporal dynamics is addressed by analyzing gammatone-filtered feature vectors over multiple temporal frames which is called a spectro-temporal representation (STR) Given STR features the exact acoustic signatures of signals of interest and the mixing property between signals of interest and noises are generally unknown The nonlinear Hebbian learning is then employed to extract representative independent features from STRs and to reduce their dimensionality The extracted independent features of signals of interest are called signatures In the meantime of learning the synaptic weight vectors between input and output neurons are adaptively updated These weight vectors project data into a feature subspace in which signals of interest are selected while noises are attenuated Compared with linear Hebbian learning (LHL) which explores the second-order moment of data the applied NHL involves the higher-order statistics of data Therefore NHL can capture representative features that are more statistically independent than LHL can Besides the nonlinear activation function of NHL can be chosen to refer to the implicit distribution of many acoustic sounds and thus making the learning optimized in an aspect of mutual information Simulation results show that the whole proposed system can more accurately recognize signals of interest than other conventional methods in severely noisy circumstances One applicable project is detecting moving vehicles Noise-contaminated vehicle sound is recognized while other non-vehicle sounds are rejected When vehicle is contaminated by human vowel bird chirp or additive white Gaussian noise (AWGN) at SNR = 0 dB the proposed system dramatically decreases the error rate over normally used acoustic feature extraction method mel-frequency cepstral computation (MFCC) by 26% 36 3% and 60 3% respectively and over LHL by 20% 2 3% and 15 3% respectively Another applicable project is vehicle type identification The proposed system achieves better performance than LHL e g 40% improvement when gasoline heavy wheeled car is contaminated by AWGN at SNR = 5 dB More importantly the proposed system is implemented in real-time field testing for months The purpose is to detect vehicle with any make or model moving on the street with speed 10-35 mph The missing rate is 1-2% when vehicle is contaminated by any surrounding noises (human conversation animal sound airplane wind etc) at SNR = 0-20 dB The false alarm rate is around 1% To summarize this study not only provides an efficient approach to extract representative independent features from high-dimensional data but also offers robustness against severe noises Published by Elsevier Ltd
引用
收藏
页码:1252 / 1263
页数:12
相关论文
共 50 条
  • [1] Software Entity Recognition with Noise-Robust Learning
    Tai Nguyen
    Di, Yifeng
    Lee, Joohan
    Chen, Muhao
    Zhang, Tianyi
    [J]. 2023 38TH IEEE/ACM INTERNATIONAL CONFERENCE ON AUTOMATED SOFTWARE ENGINEERING, ASE, 2023, : 484 - 496
  • [2] Nonlinear Compensation Using the Gauss-Newton Method for Noise-Robust Speech Recognition
    Zhao, Yong
    Juang, Biing-Hwang
    [J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2012, 20 (08): : 2191 - 2206
  • [3] Unsupervised modulation filter learning for noise-robust speech recognition
    Agrawal, Purvi
    Ganapathy, Sriram
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2017, 142 (03): : 1686 - 1692
  • [4] Nonlinear Hebbian Learning for Noise-Independent Vehicle Sound Recognition
    Lu, Bing
    Dibazar, Alireza
    Berger, Theodore W.
    [J]. 2008 IEEE INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, VOLS 1-8, 2008, : 1336 - 1343
  • [5] Noise-robust Attention Learning for End-to-End Speech Recognition
    Higuchi, Yosuke
    Tawara, Naohiro
    Ogawa, Atsunori
    Iwata, Tomoharu
    Kobayashi, Tetsunori
    Ogawa, Tetsuji
    [J]. 28TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO 2020), 2021, : 311 - 315
  • [6] EXTENDED VTS FOR NOISE-ROBUST SPEECH RECOGNITION
    van Dalen, R. C.
    Gales, M. J. F.
    [J]. 2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS, 2009, : 3829 - 3832
  • [7] Covariance Modelling for Noise-Robust Speech Recognition
    van Dalen, R. C.
    Gales, M. J. F.
    [J]. INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 2000 - 2003
  • [8] An Overview of Noise-Robust Automatic Speech Recognition
    Li, Jinyu
    Deng, Li
    Gong, Yifan
    Haeb-Umbach, Reinhold
    [J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2014, 22 (04) : 745 - 777
  • [9] Extended VTS for Noise-Robust Speech Recognition
    van Dalen, Rogier C.
    Gales, Mark J. F.
    [J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2011, 19 (04): : 733 - 743
  • [10] Frame decorrelation for noise-robust speech recognition
    Jung, HY
    Kim, DY
    Un, CK
    [J]. ELECTRONICS LETTERS, 1996, 32 (13) : 1163 - 1164