Spectrotemporal Analysis Using Local Binary Pattern Variants for Acoustic Scene Classification

被引:27
|
作者
Abidin, Shamsiah [1 ]
Togneri, Roberto [1 ]
Sohel, Ferdous [2 ]
机构
[1] Univ Western Australia, Sch Elect Elect & Comp Engn, Perth, WA 6009, Australia
[2] Murdoch Univ, Sch Engn & Informat Technol, Murdoch, WA 6150, Australia
关键词
Acoustic scene; feature extraction; fusion; localm binary patterns; time-frequency analysis;
D O I
10.1109/TASLP.2018.2854861
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In this paper, we present an approach for acoustic scene classification, which aggregates spectral and temporal features. We do this by proposing the first use of the variable-Q transform (VQT) to generate the time-frequency representation for acoustic scene classification. The VQT provides finer control over the resolution compared to the constant-Q transform (CQT) or short time fourier transform and can be tuned to better capture acoustic scene information. We then adopt a variant of the local binary pattern (LBP), the adjacent evaluation completed LBP (AECLBP), which is better suited to extracting features from acoustic time-frequency images. Our results yield a 5.2% improvement on the DCASE 2016 dataset compared to the application of standard CQT with LBP. Fusing our proposed AECLBP with HOG features, we achieve a classification accuracy of 85.5%, which outperforms one of the top performing systems.
引用
收藏
页码:2112 / 2121
页数:10
相关论文
共 50 条
  • [1] LOCAL BINARY PATTERN WITH RANDOM FOREST FOR ACOUSTIC SCENE CLASSIFICATION
    Abidin, Shamsiah
    Xia, Xianjun
    Togneri, Roberto
    Sohel, Ferdous
    [J]. 2018 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2018,
  • [2] Combining Temporal Features by Local Binary Pattern for Acoustic Scene Classification
    Yang, Wenjun
    Krishnan, Sridhar
    [J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2017, 25 (06) : 1315 - 1321
  • [4] Breast Tissue Classification Using Local Binary Pattern Variants: A Comparative Study
    George, Minu
    Zwiggelaar, Reyer
    [J]. MEDICAL IMAGE UNDERSTANDING AND ANALYSIS, MIUA 2018, 2018, 894 : 143 - 152
  • [5] Binary Pattern Descriptors for Scene Classification
    Cervantes, Salvador
    Mexicano, Adriana
    Cervantes, Jose-Antonio
    Rodriguez, Ricardo
    Fuentes-Pacheco, Jorge
    [J]. IEEE LATIN AMERICA TRANSACTIONS, 2020, 18 (01) : 83 - 91
  • [6] Scene Classification Based on Local Binary Pattern and Improved Bag of Visual Words
    Montazer, Gholam Ali
    Giveki, Davar
    Soltanshahi, Mohammad Ali
    [J]. ADVANCES IN COMPUTATIONAL INTELLIGENCE, PT I (IWANN 2015), 2015, 9094 : 241 - 251
  • [7] Texture Classification Using Local Derivative Binary Pattern
    Shang, Jun
    Xiong, Naixue
    Wan, Runze
    Guo, Bo
    [J]. JOURNAL OF INTERNET TECHNOLOGY, 2015, 16 (05): : 933 - 943
  • [8] A COMPARATIVE ANALYSIS OF LOCAL BINARY PATTERN TEXTURE CLASSIFICATION
    Doshi, Niraj
    Schaefer, Gerald
    [J]. 2012 IEEE VISUAL COMMUNICATIONS AND IMAGE PROCESSING (VCIP), 2012,
  • [9] Analysis and classification of ultrasound medical images using the Local Binary Pattern operator
    Vatamanu, Oana Astrid
    Ionescu, Mihaela
    Mihalas, Gheorghe-Ioan
    [J]. INFORMATICS, MANAGEMENT AND TECHNOLOGY IN HEALTHCARE, 2013, 190 : 175 - 178
  • [10] Local Binary Pattern Variants:A Review
    Bedi, Anterpreet Kaur
    Sunkaria, Ramesh Kumar
    Randhawa, Simarjot Kaur
    [J]. 2018 FIRST INTERNATIONAL CONFERENCE ON SECURE CYBER COMPUTING AND COMMUNICATIONS (ICSCCC 2018), 2018, : 234 - 237