Noise-robust acoustic signature recognition using nonlinear Hebbian learning

被引：4

作者：

Lu, Bing ^{[1
]}

Dibazar, Alireza ^{[1
]}

Berger, Theodore W. ^{[1
]}

机构：

[1] Univ So Calif, Dept Biomed Engn, Los Angeles, CA 90089 USA

来源：

NEURAL NETWORKS | 2010年 / 23卷 / 10期

关键词：

Independent feature extraction; Higher order statistics; Noise robustness; Nonlinear Hebbian learning; Statistical optimization; INDEPENDENT COMPONENT ANALYSIS; MAXIMUM-LIKELIHOOD; SPEAKER RECOGNITION; ALGORITHM; INFOMAX;

D O I：

10.1016/j.neunet.2010.07.003

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

We propose using a new biologically inspired approach nonlinear Hebbian learning (NHL) to implement acoustic signal recognition in noisy environments The proposed learning processes both spectral and temporal features of input acoustic data The spectral analysis is realized by using auditory gammatone filterbanks The temporal dynamics is addressed by analyzing gammatone-filtered feature vectors over multiple temporal frames which is called a spectro-temporal representation (STR) Given STR features the exact acoustic signatures of signals of interest and the mixing property between signals of interest and noises are generally unknown The nonlinear Hebbian learning is then employed to extract representative independent features from STRs and to reduce their dimensionality The extracted independent features of signals of interest are called signatures In the meantime of learning the synaptic weight vectors between input and output neurons are adaptively updated These weight vectors project data into a feature subspace in which signals of interest are selected while noises are attenuated Compared with linear Hebbian learning (LHL) which explores the second-order moment of data the applied NHL involves the higher-order statistics of data Therefore NHL can capture representative features that are more statistically independent than LHL can Besides the nonlinear activation function of NHL can be chosen to refer to the implicit distribution of many acoustic sounds and thus making the learning optimized in an aspect of mutual information Simulation results show that the whole proposed system can more accurately recognize signals of interest than other conventional methods in severely noisy circumstances One applicable project is detecting moving vehicles Noise-contaminated vehicle sound is recognized while other non-vehicle sounds are rejected When vehicle is contaminated by human vowel bird chirp or additive white Gaussian noise (AWGN) at SNR = 0 dB the proposed system dramatically decreases the error rate over normally used acoustic feature extraction method mel-frequency cepstral computation (MFCC) by 26% 36 3% and 60 3% respectively and over LHL by 20% 2 3% and 15 3% respectively Another applicable project is vehicle type identification The proposed system achieves better performance than LHL e g 40% improvement when gasoline heavy wheeled car is contaminated by AWGN at SNR = 5 dB More importantly the proposed system is implemented in real-time field testing for months The purpose is to detect vehicle with any make or model moving on the street with speed 10-35 mph The missing rate is 1-2% when vehicle is contaminated by any surrounding noises (human conversation animal sound airplane wind etc) at SNR = 0-20 dB The false alarm rate is around 1% To summarize this study not only provides an efficient approach to extract representative independent features from high-dimensional data but also offers robustness against severe noises Published by Elsevier Ltd

引用

页码：1252 / 1263

页数：12

共 50 条

[1] Software Entity Recognition with Noise-Robust Learning
Tai Nguyen
Di, Yifeng
Lee, Joohan
Chen, Muhao
Zhang, Tianyi
[J]. 2023 38TH IEEE/ACM INTERNATIONAL CONFERENCE ON AUTOMATED SOFTWARE ENGINEERING, ASE, 2023, : 484 - 496
[2] Nonlinear Compensation Using the Gauss-Newton Method for Noise-Robust Speech Recognition
Zhao, Yong
Juang, Biing-Hwang
[J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2012, 20 (08): : 2191 - 2206
[3] Unsupervised modulation filter learning for noise-robust speech recognition
Agrawal, Purvi
Ganapathy, Sriram
[J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2017, 142 (03): : 1686 - 1692
[4] Nonlinear Hebbian Learning for Noise-Independent Vehicle Sound Recognition
Lu, Bing
Dibazar, Alireza
Berger, Theodore W.
[J]. 2008 IEEE INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, VOLS 1-8, 2008, : 1336 - 1343
[5] Noise-robust Attention Learning for End-to-End Speech Recognition
Higuchi, Yosuke
Tawara, Naohiro
Ogawa, Atsunori
Iwata, Tomoharu
Kobayashi, Tetsunori
Ogawa, Tetsuji
[J]. 28TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO 2020), 2021, : 311 - 315
[6] EXTENDED VTS FOR NOISE-ROBUST SPEECH RECOGNITION
van Dalen, R. C.
Gales, M. J. F.
[J]. 2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS, 2009, : 3829 - 3832
[7] Covariance Modelling for Noise-Robust Speech Recognition
van Dalen, R. C.
Gales, M. J. F.
[J]. INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 2000 - 2003
[8] An Overview of Noise-Robust Automatic Speech Recognition
Li, Jinyu
Deng, Li
Gong, Yifan
Haeb-Umbach, Reinhold
[J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2014, 22 (04) : 745 - 777
[9] Extended VTS for Noise-Robust Speech Recognition
van Dalen, Rogier C.
Gales, Mark J. F.
[J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2011, 19 (04): : 733 - 743
[10] Frame decorrelation for noise-robust speech recognition
Jung, HY
Kim, DY
Un, CK
[J]. ELECTRONICS LETTERS, 1996, 32 (13) : 1163 - 1164

← 1 2 3 4 5 →