Data-Dependent Feature Extraction Method Based on Non-Negative Matrix Factorization for Weakly Supervised Domestic Sound Event Detection

被引:3
|
作者
Lee, Seokjin [1 ,2 ]
Kim, Minhan [1 ]
Shin, Seunghyeon [1 ]
Park, Sooyoung [3 ]
Jeong, Youngho [3 ]
机构
[1] Kyungpook Natl Univ, Sch Elect & Elect Engn, Daegu 41566, South Korea
[2] Kyungpook Natl Univ, Sch Elect Engn, Daegu 41566, South Korea
[3] Elect & Telecommun Res Inst, Media Res Div, Daejeon 34129, South Korea
来源
APPLIED SCIENCES-BASEL | 2021年 / 11卷 / 03期
关键词
feature extraction; sound event detection; non-negative matrix factorization;
D O I
10.3390/app11031040
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
In this paper, feature extraction methods are developed based on the non-negative matrix factorization (NMF) algorithm to be applied in weakly supervised sound event detection. Recently, the development of various features and systems have been attempted to tackle the problems of acoustic scene classification and sound event detection. However, most of these systems use data-independent spectral features, e.g., Mel-spectrogram, log-Mel-spectrum, and gammatone filterbank. Some data-dependent feature extraction methods, including the NMF-based methods, recently demonstrated the potential to tackle the problems mentioned above for long-term acoustic signals. In this paper, we further develop the recently proposed NMF-based feature extraction method to enable its application in weakly supervised sound event detection. To achieve this goal, we develop a strategy for training the frequency basis matrix using a heterogeneous database consisting of strongly- and weakly-labeled data. Moreover, we develop a non-iterative version of the NMF-based feature extraction method so that the proposed feature extraction method can be applied as a part of the model structure similar to the modern "on-the-fly" transform method for the Mel-spectrogram. To detect the sound events, the temporal basis is calculated using the NMF method and then used as a feature for the mean-teacher-model-based classifier. The results are improved for the event-wise post-processing method. To evaluate the proposed system, simulations of the weakly supervised sound event detection were conducted using the Detection and Classification of Acoustic Scenes and Events 2020 Task 4 database. The results reveal that the proposed system has F1-score performance comparable with the Mel-spectrogram and gammatonegram and exhibits 3-5% better performance than the log-Mel-spectrum and constant-Q transform.
引用
收藏
页码:1 / 16
页数:16
相关论文
共 50 条
  • [1] FULLY SUPERVISED NON-NEGATIVE MATRIX FACTORIZATION FOR FEATURE EXTRACTION
    Austin, Woody
    Anderson, Dylan
    Ghosh, Joydeep
    [J]. IGARSS 2018 - 2018 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM, 2018, : 5772 - 5775
  • [2] A Survey of Polyphonic Sound Event Detection Based on Non-negative Matrix Factorization
    Manh-Quan Bui
    Viet-Hang Duong
    Mathulaprangsan, Seksan
    Bach-Tung Pham
    Lee, Wei-Jing
    Wang, Jia-Ching
    [J]. 2016 INTERNATIONAL COMPUTER SYMPOSIUM (ICS), 2016, : 351 - 354
  • [3] Label propagation based semi-supervised non-negative matrix factorization for feature extraction
    Yi, Yugen
    Shi, Yanjiao
    Zhang, Huijie
    Wang, Jianzhong
    Kong, Jun
    [J]. NEUROCOMPUTING, 2015, 149 : 1021 - 1037
  • [4] A New Feature Extraction and Recognition Method for Microexpression Based on Local Non-negative Matrix Factorization
    Gao, Junli
    Chen, Huajun
    Zhang, Xiaohua
    Guo, Jing
    Liang, Wenyu
    [J]. FRONTIERS IN NEUROROBOTICS, 2020, 14
  • [5] Non-negative matrix factorization based text mining: Feature extraction and classification
    Barman, P. C.
    Iqbal, Nadeem
    Lee, Soo-Young
    [J]. NEURAL INFORMATION PROCESSING, PT 2, PROCEEDINGS, 2006, 4233 : 703 - 712
  • [6] Image Fusion Based on Non-negative Matrix Factorization and Infrared Feature Extraction
    Mou, Jiao
    Gao, Wei
    Song, Zongxi
    [J]. 2013 6TH INTERNATIONAL CONGRESS ON IMAGE AND SIGNAL PROCESSING (CISP), VOLS 1-3, 2013, : 1046 - 1050
  • [7] Masked Non-negative Matrix Factorization for Bird Detection Using Weakly Labeled Data
    Sobieraj, Iwona
    Kong, Qiuqiang
    Plumbley, Mark D.
    [J]. 2017 25TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2017, : 1769 - 1773
  • [8] Structure Constrained Discriminative Non-negative Matrix Factorization for Feature Extraction
    Jin, Yan
    Wei, Lisi
    Yi, Yugen
    Wang, Jianzhong
    [J]. INTELLIGENT COMPUTING METHODOLOGIES, 2014, 8589 : 645 - 657
  • [9] Non-negative matrix factorization for semi-supervised data clustering
    Chen, Yanhua
    Rege, Manjeet
    Dong, Ming
    Hua, Jing
    [J]. KNOWLEDGE AND INFORMATION SYSTEMS, 2008, 17 (03) : 355 - 379
  • [10] Non-negative matrix factorization for semi-supervised data clustering
    Yanhua Chen
    Manjeet Rege
    Ming Dong
    Jing Hua
    [J]. Knowledge and Information Systems, 2008, 17 : 355 - 379