Pseudo-color cochleagram image feature and sequential feature selection for robust acoustic event recognition

被引:15
|
作者
Sharan, Roneel V. [1 ]
Moir, Tom J. [2 ]
机构
[1] Univ Queensland, Sch Informat Technol & Elect Engn, Brisbane, Qld 4072, Australia
[2] Auckland Univ Technol, Sch Engn, Private Bag 92006, Auckland 1142, New Zealand
关键词
Acoustic event recognition; Cochleagram; Pseudo-color; Sequential backward feature selection; Support vector machines; Time-frequency image; CLASSIFICATION; NOISE;
D O I
10.1016/j.apacoust.2018.05.030
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This work proposes the use of pseudo-color cochleagram image of sound signals for feature extraction for robust acoustic event recognition. A cochleagram is a variation of the spectrogram. It utilizes a gammatone filter and has been shown to better reveal spectral information. We propose mapping of the grayscale cochleagram image to higher dimensional color space for improved characterization from environmental noise. The resulting time frequency representation is referred as pseudo-color cochleagram image and the resulting feature, which captures the statistical distribution, as pseudo-color cochleagram image feature (PC-CIF). In addition, sequential backward feature selection is applied for selecting the most useful feature dimensions, thereby reducing the feature dimension and improving the classification performance. We evaluate the effectiveness of the proposed methods using two classifiers, k-nearest neighbor and support vector machines. The performance is evaluated on a dataset containing 50 sound classes, taken from the Real World Computing Partnership Sound Scene Database in Real Acoustical Environments, with the addition of environmental noise at various signal-to-noise ratios. The experimental results show that the proposed techniques give significant improvement in classification performance over baseline methods. The most improved results were observed at low signal-to-noise ratios.
引用
收藏
页码:198 / 204
页数:7
相关论文
共 50 条
  • [1] Sound-Event Classification Using Pseudo-Color CENTRIST Feature and Classifier Selection
    Ren, Jianfeng
    Jiang, Xudong
    Yuan, Junsong
    FIRST INTERNATIONAL WORKSHOP ON PATTERN RECOGNITION, 2016, 0011
  • [2] Cochleagram Image Feature for Improved Robustness in Sound Recognition
    Sharan, Roneel V.
    Moir, Tom J.
    2015 IEEE INTERNATIONAL CONFERENCE ON DIGITAL SIGNAL PROCESSING (DSP), 2015, : 441 - 444
  • [3] A lightweight network based on multi-feature pseudo-color mapping for arrhythmia recognition
    Ma, Yijun
    Li, Junyan
    Zhang, Jinbiao
    Wang, Jilin
    Sun, Guozhen
    Zhang, Yatao
    HEALTH INFORMATION SCIENCE AND SYSTEMS, 2024, 12 (01):
  • [4] Infrared target tracking in multiple feature pseudo-color image with kernel density estimation
    Liu, Ruiming
    Lu, Yanhong
    INFRARED PHYSICS & TECHNOLOGY, 2012, 55 (06) : 505 - 512
  • [5] Pseudo-color encoding and correlation recognition of infrared image
    The Second Artillery Engineering College, Xi'an 710025, China
    Guangxue Jishu/Optical Technique, 2007, 33 (SUPPL.): : 224 - 226
  • [6] Application of Pseudo-color Image Feature-Level Fusion in Nondestructive Testing of Wire Ropes
    Zhang, Juwei
    Lu, Shiliang
    Chen, Jinbao
    JOURNAL OF FAILURE ANALYSIS AND PREVENTION, 2020, 20 (05) : 1541 - 1553
  • [7] Application of Pseudo-color Image Feature-Level Fusion in Nondestructive Testing of Wire Ropes
    Juwei Zhang
    Shiliang Lu
    Jinbao Chen
    Journal of Failure Analysis and Prevention, 2020, 20 : 1541 - 1553
  • [8] Acoustic event recognition using cochleagram image and convolutional neural networks
    Sharan, Roneel V.
    Moir, Tom J.
    APPLIED ACOUSTICS, 2019, 148 : 62 - 66
  • [9] Acoustic Feature Extraction for Robust Event Recognition on Cleaning Robot Platform
    Park, Sang-wook
    Rho, Jin-sang
    Shin, Min-kyu
    Han, David K.
    Ko, Hanseok
    2014 IEEE INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS (ICCE), 2014, : 147 - 148
  • [10] Feature analysis and selection for acoustic event detection
    Zhuang, Xiaodan
    Zhou, Xi
    Huang, Thomas S.
    Hasegawa-Johnson, Mark
    2008 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-12, 2008, : 17 - 20