Class-wise Centroid Distance Metric Learning for Acoustic Event Detection

被引:4
|
作者
Lu, Xugang [1 ]
Shen, Peng [1 ]
Li, Sheng [1 ]
Tsao, Yu [2 ]
Kawai, Hisashi [1 ]
机构
[1] Natl Inst Informat & Commun Technol, Tokyo, Japan
[2] Acad Sinica, Res Ctr Informat Technol Innovat, Taipei, Taiwan
来源
关键词
acoustic event detection; distance metric learning; class centroids; convolutional neural network; NEURAL-NETWORK MODEL;
D O I
10.21437/Interspeech.2019-2271
中图分类号
R36 [病理学]; R76 [耳鼻咽喉科学];
学科分类号
100104 ; 100213 ;
摘要
Designing good feature extraction and classifier models is essential for obtaining high performances of acoustic event detection (AED) systems. Current state-of-the-art algorithms are based on deep neural network models that jointly learn the feature representation and classifier models. As a typical pipeline in these algorithms, several network layers with nonlinear transforms are stacked for feature extraction, and a classifier layer with a softmax transform is applied on top of these extracted features to obtain normalized probability outputs. This pipeline is directly connected to a final goal for class discrimination without explicitly considering how the features should be distributed for inter-class and intra-class samples. In this paper, we explicitly add a distance metric constraint on feature extraction process with a goal to reduce intra-class sample distances and increase inter-class sample distances. Rather than estimating the pair-wise distances of samples, the distances are efficiently calculated between samples and class cluster centroids. With this constraint, the learned features have a good property for improving the generalization of the classification models. AED experiments on an urban sound classification task were carried out to test the algorithm. Results showed that the proposed algorithm efficiently improved the performance on the current state-of-the-art deep learning algorithms.
引用
收藏
页码:3614 / 3618
页数:5
相关论文
共 50 条
  • [1] Class-wise Deep Dictionary Learning
    Singhal, Vanika
    Khurana, Prerna
    Majumdar, Angshul
    [J]. 2017 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2017, : 1125 - 1132
  • [2] Selective Pseudo-labeling and Class-wise Discriminative Fusion for Sound Event Detection
    Liang, Yunhao
    Long, Yanhua
    Li, Yijie
    Liang, Jiaen
    [J]. INTERSPEECH 2022, 2022, : 1496 - 1500
  • [3] Class-wise dictionary learning for hyperspectral image classification
    Hao, Siyuan
    Wang, Wei
    Yan, Yan
    Bruzzone, Lorenzo
    [J]. NEUROCOMPUTING, 2017, 220 : 121 - 129
  • [4] A Novel Class-wise Forgetting Detector in Continual Learning
    Pham, Xuan Cuong
    Liew, Alan Wee-chung
    Wang, Can
    [J]. 2021 INTERNATIONAL CONFERENCE ON DIGITAL IMAGE COMPUTING: TECHNIQUES AND APPLICATIONS (DICTA 2021), 2021, : 518 - 525
  • [5] Class-wise Metric Scaling for Improved Few-Shot Classification
    Liu, Ge
    Zhao, Linglan
    Li, Wei
    Guo, Dashan
    Fang, Xiangzhong
    [J]. 2021 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2021), 2021, : 586 - 595
  • [6] Improved Open World Object Detection Using Class-Wise Feature Space Learning
    Iqbal, Muhammad Ali
    Yoon, Yeo Chan
    Khan, Muhammad U. S.
    Kim, Soo Kyun
    [J]. IEEE ACCESS, 2023, 11 : 131221 - 131236
  • [7] CCL: CLASS-WISE CURRICULUM LEARNING FOR CLASS IMBALANCE PROBLEMS.
    Escudero-Vinolo, Marcos
    Lopez-Cifuentes, Alejandro
    [J]. 2022 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2022, : 1476 - 1480
  • [8] Class-Wise Denoising for Robust Learning Under Label Noise
    Gong, Chen
    Ding, Yongliang
    Han, Bo
    Niu, Gang
    Yang, Jian
    You, Jane
    Tao, Dacheng
    Sugiyama, Masashi
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (03) : 2835 - 2848
  • [9] Class-wise Thresholding for Robust Out-of-Distribution Detection
    Guarrera, Matteo
    Jin, Baihong
    Lin, Tung-Wei
    Zuluaga, Maria A.
    Chen, Yuxin
    Sangiovanni-Vincentelli, Alberto
    [J]. 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, CVPRW 2022, 2022, : 2836 - 2845
  • [10] Class-wise boundary regression by uncertainty in temporal action detection
    Chen, Yunze
    Chen, Mengjuan
    Gu, Qingyi
    [J]. IET IMAGE PROCESSING, 2022, 16 (14) : 3854 - 3862