Pseudo-color cochleagram image feature and sequential feature selection for robust acoustic event recognition

被引：15

作者：

Sharan, Roneel V. ^{[1
]}

Moir, Tom J. ^{[2
]}

机构：

[1] Univ Queensland, Sch Informat Technol & Elect Engn, Brisbane, Qld 4072, Australia

[2] Auckland Univ Technol, Sch Engn, Private Bag 92006, Auckland 1142, New Zealand

来源：

APPLIED ACOUSTICS | 2018年 / 140卷

关键词：

Acoustic event recognition; Cochleagram; Pseudo-color; Sequential backward feature selection; Support vector machines; Time-frequency image; CLASSIFICATION; NOISE;

D O I：

10.1016/j.apacoust.2018.05.030

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

This work proposes the use of pseudo-color cochleagram image of sound signals for feature extraction for robust acoustic event recognition. A cochleagram is a variation of the spectrogram. It utilizes a gammatone filter and has been shown to better reveal spectral information. We propose mapping of the grayscale cochleagram image to higher dimensional color space for improved characterization from environmental noise. The resulting time frequency representation is referred as pseudo-color cochleagram image and the resulting feature, which captures the statistical distribution, as pseudo-color cochleagram image feature (PC-CIF). In addition, sequential backward feature selection is applied for selecting the most useful feature dimensions, thereby reducing the feature dimension and improving the classification performance. We evaluate the effectiveness of the proposed methods using two classifiers, k-nearest neighbor and support vector machines. The performance is evaluated on a dataset containing 50 sound classes, taken from the Real World Computing Partnership Sound Scene Database in Real Acoustical Environments, with the addition of environmental noise at various signal-to-noise ratios. The experimental results show that the proposed techniques give significant improvement in classification performance over baseline methods. The most improved results were observed at low signal-to-noise ratios.

引用

页码：198 / 204

页数：7

共 50 条

[1] Sound-Event Classification Using Pseudo-Color CENTRIST Feature and Classifier Selection
Ren, Jianfeng
Jiang, Xudong
Yuan, Junsong
FIRST INTERNATIONAL WORKSHOP ON PATTERN RECOGNITION, 2016, 0011
[2] Cochleagram Image Feature for Improved Robustness in Sound Recognition
Sharan, Roneel V.
Moir, Tom J.
2015 IEEE INTERNATIONAL CONFERENCE ON DIGITAL SIGNAL PROCESSING (DSP), 2015, : 441 - 444
[3] A lightweight network based on multi-feature pseudo-color mapping for arrhythmia recognition
Ma, Yijun
Li, Junyan
Zhang, Jinbiao
Wang, Jilin
Sun, Guozhen
Zhang, Yatao
HEALTH INFORMATION SCIENCE AND SYSTEMS, 2024, 12 (01):
[4] Infrared target tracking in multiple feature pseudo-color image with kernel density estimation
Liu, Ruiming
Lu, Yanhong
INFRARED PHYSICS & TECHNOLOGY, 2012, 55 (06) : 505 - 512
[5] Pseudo-color encoding and correlation recognition of infrared image
The Second Artillery Engineering College, Xi'an 710025, China
Guangxue Jishu/Optical Technique, 2007, 33 (SUPPL.): : 224 - 226
[6] Application of Pseudo-color Image Feature-Level Fusion in Nondestructive Testing of Wire Ropes
Zhang, Juwei
Lu, Shiliang
Chen, Jinbao
JOURNAL OF FAILURE ANALYSIS AND PREVENTION, 2020, 20 (05) : 1541 - 1553
[7] Application of Pseudo-color Image Feature-Level Fusion in Nondestructive Testing of Wire Ropes
Juwei Zhang
Shiliang Lu
Jinbao Chen
Journal of Failure Analysis and Prevention, 2020, 20 : 1541 - 1553
[8] Acoustic event recognition using cochleagram image and convolutional neural networks
Sharan, Roneel V.
Moir, Tom J.
APPLIED ACOUSTICS, 2019, 148 : 62 - 66
[9] Acoustic Feature Extraction for Robust Event Recognition on Cleaning Robot Platform
Park, Sang-wook
Rho, Jin-sang
Shin, Min-kyu
Han, David K.
Ko, Hanseok
2014 IEEE INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS (ICCE), 2014, : 147 - 148
[10] Feature analysis and selection for acoustic event detection
Zhuang, Xiaodan
Zhou, Xi
Huang, Thomas S.
Hasegawa-Johnson, Mark
2008 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-12, 2008, : 17 - 20

← 1 2 3 4 5 →