Sparse Representation with Temporal Max-Smoothing for Acoustic Event Detection

被引:0
|
作者
Lu, Xugang [1 ]
Shen, Peng [1 ]
Tsao, Yu [2 ]
Hori, Chiori [1 ]
Kawai, Hisashi [1 ]
机构
[1] Natl Inst Informat & Commun Technol, Koganei, Tokyo, Japan
[2] Acad Sinica, Res Ctr Informat Technol Innovat, Taipei, Taiwan
关键词
Feature learning; matching pursuit; temporal max-smoothing; acoustic event detection;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In order to incorporate long temporal-frequency structure for acoustic event detection, we have proposed a spectral patch based learning and representation method. The learned spectral patches were regarded as acoustic words which were further used in sparse encoding for acoustic feature representation and modeling. In our previous study, during feature encoding stage, each spectral patch was encoded independently. Considering that spectral patches taken from a time sequence should keep similar representations for neighboring patches after encoding, in this study, we propose to enhance the temporal correlation of feature representation using a temporal max-smoothing algorithm. The max-smoothing tries to pick up the maximum response in a local time window as the representative feature for detection task. We tested the new feature for automatic detection of acoustic events which were selected from lecture audio data. Experimental results showed that the temporal max-smoothing significantly improved the performance.
引用
收藏
页码:1176 / 1180
页数:5
相关论文
共 50 条
  • [31] Learning a sparse representation for object detection
    Agarwal, S
    Roth, D
    [J]. COMPUTER VISION - ECCV 2002, PT IV, 2002, 2353 : 113 - 127
  • [32] An image sparse representation for saliency detection
    [J]. Yang, J. (yangjun9118@gmail.com), 1600, Universitas Ahmad Dahlan, Jalan Kapas 9, Semaki, Umbul Harjo,, Yogiakarta, 55165, Indonesia (11):
  • [33] HUMAN DETECTION USING SPARSE REPRESENTATION
    Vinay, G. Krishna
    Haque, S. M.
    Babu, R. Venkatesh
    Ramakrishnan, K. R.
    [J]. 2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2012, : 1513 - 1516
  • [34] Research on Sparse Representation Method of Acoustic Microimaging Signals
    Wang, Kun
    Leng, Tao
    Mao, Jie
    Lian, Guoxuan
    Zhou, Changzhi
    [J]. APPLIED SCIENCES-BASEL, 2022, 12 (02):
  • [35] COMPLEX NMF: A NEW SPARSE REPRESENTATION FOR ACOUSTIC SIGNALS
    Kameoka, Hirokazu
    Ono, Nobutaka
    Kashino, Kunio
    Sagayama, Shigeki
    [J]. 2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS, 2009, : 3437 - +
  • [36] Sparse Representation to Localize Objects in Underwater Acoustic Images
    Akshaya, B.
    Narmadha, V
    Sharmila, Sree T.
    Rajendran, V
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON ELECTRICAL, COMPUTER AND COMMUNICATION TECHNOLOGIES, 2015,
  • [37] Unusual Event Detection using Sparse Spatio-Temporal Features and Bag of Words Model
    Mandadi, Balakrishna
    Sethi, Amit
    [J]. 2013 IEEE SECOND INTERNATIONAL CONFERENCE ON IMAGE INFORMATION PROCESSING (ICIIP), 2013, : 629 - 634
  • [38] Abnormal Event Detection Method in Surveillance Video Based on Temporal CNN and Sparse Optical Flow
    Xia, Hongxia
    Li, Ting
    Liu, Wenxuan
    Zhong, Xian
    Yuan, JingLing
    [J]. ICCDE 2019: PROCEEDINGS OF THE 2019 5TH INTERNATIONAL CONFERENCE ON COMPUTING AND DATA ENGINEERING, 2019, : 90 - 94
  • [39] Robust Infrared Small Target Detection Via Temporal Low-rank and Sparse Representation
    Wei, Haoyang
    Tan, Yihua
    Lin, Jin
    [J]. 2016 3RD INTERNATIONAL CONFERENCE ON INFORMATION SCIENCE AND CONTROL ENGINEERING (ICISCE), 2016, : 583 - 587
  • [40] Dim moving target detection algorithm based on spatio-temporal classification sparse representation
    Li, Zhengzhou
    Dai, Zhen
    Fu, Hongxia
    Hou, Qian
    Wang, Zhen
    Yang, Lijiao
    Jin, Gang
    Liu, Changju
    Li, Ruzhang
    [J]. Infrared Physics and Technology, 2014, 67 : 273 - 282