A FRAME LOSS OF MULTIPLE INSTANCE LEARNING FOR WEAKLY SUPERVISED SOUND EVENT DETECTION

被引:2
|
作者
Wang, Xu [1 ]
Zhang, Xiangjinzi [1 ]
Zi, Yunfei [1 ]
Xiong, Shengwu [1 ]
机构
[1] Wuhan Univ Technol, Sch Comp & Artificial Intelligence, Wuhan, Peoples R China
关键词
Sound event detection (SED); weak labeling; multiple instance learning (MIL); loss function;
D O I
10.1109/ICASSP43922.2022.9746435
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Sound event detection(SED) consists of two subtasks: predicting the classes of sound events within an audio clip (audio tagging) and indicating the onset and offset times for each event (localization). One of the common approaches for SED with weak label is multiple instance learning (MIL) method. However, the general MIL method only optimizes the global loss calculated from the aggregated clip-wise predictions and weak clip labels, lacking a direct constraint on the frame-wise predictions, which leads to a large number of unreasonable prediction values. To address this issue, we explore the deterministic information that can be used to constrain the framewise predictions and based on which we design a frame loss with two terms. Experimental results on the DCASE2017 Task4 dataset demonstrate that the proposed loss can improve the performance of general MIL method. While this article focuses on SED applications, the proposed methods could be applied widely to MIL problems. Code will be available at WSSED.
引用
收藏
页码:331 / 335
页数:5
相关论文
共 50 条
  • [1] Contrastive Transformer-Based Multiple Instance Learning for Weakly Supervised Polyp Frame Detection
    Tian, Yu
    Pang, Guansong
    Liu, Fengbei
    Liu, Yuyuan
    Wang, Chong
    Chen, Yuanhong
    Verjans, Johan
    Carneiro, Gustavo
    [J]. MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION, MICCAI 2022, PT III, 2022, 13433 : 88 - 98
  • [2] Multiple Instance Deep Learning for Weakly Supervised Small-Footprint Audio Event Detection
    Tseng, Shao-Yen
    Li, Juncheng
    Wang, Yun
    Metze, Florian
    Szurley, Joseph
    Das, Samarjit
    [J]. 19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 3279 - 3283
  • [3] Discrepant multiple instance learning for weakly supervised object detection
    Gao, Wei
    Wan, Fang
    Yue, Jun
    Xu, Songcen
    Ye, Qixiang
    [J]. PATTERN RECOGNITION, 2022, 122
  • [4] Continuation Multiple Instance Learning for Weakly and Fully Supervised Object Detection
    Ye, Qixiang
    Wan, Fang
    Liu, Chang
    Huang, Qingming
    Ji, Xiangyang
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 33 (10) : 5452 - 5466
  • [5] Unbiased Multiple Instance Learning for Weakly Supervised Video Anomaly Detection
    Lv, Hui
    Yue, Zhongqi
    Sun, Qianru
    Luo, Bin
    Cui, Zhen
    Zhang, Hanwang
    [J]. 2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 8022 - 8031
  • [6] Multiple Instance Graph Learning for Weakly Supervised Remote Sensing Object Detection
    Wang, Binglu
    Zhao, Yongqiang
    Li, Xuelong
    [J]. IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
  • [7] Multiple Instance Graph Learning for Weakly Supervised Remote Sensing Object Detection
    Wang, Binglu
    Zhao, Yongqiang
    Li, Xuelong
    [J]. IEEE Transactions on Geoscience and Remote Sensing, 2022, 60
  • [8] Normality Guided Multiple Instance Learning for Weakly Supervised Video Anomaly Detection
    Park, Seongheon
    Kim, Hanjae
    Kim, Minsu
    Kim, Dahye
    Sohn, Kwanghoon
    [J]. 2023 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2023, : 2664 - 2673
  • [9] A MULTI-TASK LEARNING METHOD FOR WEAKLY SUPERVISED SOUND EVENT DETECTION
    Liu, Sichen
    Yang, Feiran
    Kang, Fang
    Yang, Jun
    [J]. 2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 8802 - 8806
  • [10] AFFINITY MIXUP FOR WEAKLY SUPERVISED SOUND EVENT DETECTION
    Izadi, Mohammad Rasool
    Stevenson, Robert
    Kloepper, Laura
    [J]. 2021 IEEE 31ST INTERNATIONAL WORKSHOP ON MACHINE LEARNING FOR SIGNAL PROCESSING (MLSP), 2021,