Frame-Level Embedding Learning for Few-shot Bioacoustic Event Detection

被引:0
|
作者
Zhang, Xueyang [1 ]
Wang, Shuxian [2 ]
Du, Jun [2 ]
Yan, Genwei [3 ]
Tang, Jigang [1 ]
Gao, Tian [1 ]
Fang, Xin [1 ]
Pan, Jia [1 ]
Gao, Jianqing [1 ]
机构
[1] iFlytek Res, Hefei, Peoples R China
[2] Univ Sci & Technol China, Hefei, Peoples R China
[3] China Univ Min & Technol, Xuzhou, Jiangsu, Peoples R China
基金
中国国家自然科学基金;
关键词
DCASE; few-shot bioacoustic event detection; frame-level embedding learning; transfer learning;
D O I
10.1109/ICME55011.2023.00134
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We propose an effective frame-level embedding learning framework for few-shot bioacoustic event detection (FSBED). First, the duration of different animal calls varies greatly, so we innovatively propose a frame-level embedding learning scheme, which can obtain adaptive event receptive fields with more accurate frame-level units. Next, we develop a transfer learning-based approach to deal with the mismatch between training and testing data. Finally, we use the idea of semi-supervised learning to solve the problem of too little labeled data in few-shot learning. By incorporating these several sets of techniques, our overall system ranked first place in the FSBED task of Detection and Classification of Acoustic Scenes and Events (DCASE) Challenge 2022.
引用
收藏
页码:750 / 755
页数:6
相关论文
共 50 条
  • [1] Transductive Feature Space Regularization for Few-shot Bioacoustic Event Detection
    Tan, Yizhou
    Ai, Haojun
    Li, Shengchen
    Zhang, Feng
    [J]. INTERSPEECH 2023, 2023, : 571 - 575
  • [2] Instance-Level Embedding Adaptation for Few-Shot Learning
    Hao, Fusheng
    Cheng, Jun
    Wang, Lei
    Cao, Jianzhong
    [J]. IEEE ACCESS, 2019, 7 : 100501 - 100511
  • [3] Active Few-Shot Learning for Sound Event Detection
    Wang, Yu
    Cartwright, Mark
    Bello, Juan Pablo
    [J]. INTERSPEECH 2022, 2022, : 1551 - 1555
  • [4] Extensively Matching for Few-shot Learning Event Detection
    Viet Dac Lai
    Dernoncourt, Franck
    Thien Huu Nguyen
    [J]. NARRATIVE UNDERSTANDING, STORYLINES, AND EVENTS, 2020, : 38 - 45
  • [5] Active few-shot learning for rare bioacoustic feature annotation
    McEwen, Ben
    Soltero, Kaspar
    Gutschmidt, Stefanie
    Bainbridge-Smith, Andrew
    Atlas, James
    Green, Richard
    [J]. ECOLOGICAL INFORMATICS, 2024, 82
  • [6] FEW-SHOT SOUND EVENT DETECTION
    Wang, Yu
    Salamon, Justin
    Bryan, Nicholas J.
    Bello, Juan Pablo
    [J]. 2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 81 - 85
  • [7] Few-shot Incremental Event Detection
    Wang, Hao
    Shi, Hanwen
    Duan, Jianyong
    [J]. ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2024, 23 (02)
  • [8] Graph Learning Regularization and Transfer Learning for Few-Shot Event Detection
    Viet Dac Lai
    Minh Van Nguyen
    Thien Huu Nguyen
    Dernoncourt, Franck
    [J]. SIGIR '21 - PROCEEDINGS OF THE 44TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2021, : 2172 - 2176
  • [9] A MUTUAL LEARNING FRAMEWORK FOR FEW-SHOT SOUND EVENT DETECTION
    Yang, Dongchao
    Wang, Helin
    Zou, Yuexian
    Ye, Zhongjie
    Wang, Wenwu
    [J]. 2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 811 - 815
  • [10] FEW-SHOT ACOUSTIC EVENT DETECTION VIA META LEARNING
    Shi, Bowen
    Sun, Ming
    Puvvada, Krishna C.
    Kao, Chieh-Chi
    Matsoukas, Spyros
    Wang, Chao
    [J]. 2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 76 - 80