CDPNet: conformer-based dual path joint modeling network for bird sound recognition

被引:0
|
作者
Huimin Guo
Haifang Jian
Yiyu Wang
Hongchang Wang
Shuaikang Zheng
Qinghua Cheng
Yuehao Li
机构
[1] Institute of Semiconductors,Laboratory of Solid State Optoelectronics Information Technology
[2] Chinese Academy of Sciences,undefined
[3] University of Chinese Academy of Sciences,undefined
[4] Shandong Normal University,undefined
来源
Applied Intelligence | 2024年 / 54卷
关键词
Bird sound recognition; Long-term time dependence; Long-term frequency dependence; Multi-scale feature fusion; Monitoring system;
D O I
暂无
中图分类号
学科分类号
摘要
引用
收藏
页码:3152 / 3168
页数:16
相关论文
共 50 条
  • [1] CDPNet: conformer-based dual path joint modeling network for bird sound recognition
    Guo, Huimin
    Jian, Haifang
    Wang, Yiyu
    Wang, Hongchang
    Cheng, Qinghua
    Zheng, Shuaikang
    Li, Yuehao
    APPLIED INTELLIGENCE, 2024, 54 (04) : 3152 - 3168
  • [2] Sampleformer: An efficient conformer-based Neural Network for Automatic Speech Recognition
    Fan, Zeping
    Zhang, Xuejun
    Huang, Min
    Bu, Zhaohui
    INTELLIGENT DATA ANALYSIS, 2024, 28 (06) : 1647 - 1659
  • [3] Efficient conformer-based speech recognition with linear attention
    Li, Shengqiang
    Xu, Menglong
    Zhang, Xiao-Lei
    2021 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2021, : 448 - 453
  • [4] Conformer-Based Human Activity Recognition Using Inertial Measurement Units
    Seenath, Sowmiya
    Dharmaraj, Menaka
    SENSORS, 2023, 23 (17)
  • [5] Efficient Conformer-Based CTC Model for Intelligent Cockpit Speech Recognition
    Guo, Hanzhi
    Chen, Yunshu
    Xie, Xukang
    Xu, Gaopeng
    Guo, Wei
    2022 13TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 2022, : 522 - 526
  • [6] A Robust Conformer-Based Speech Recognition Model for Mandarin Air Traffic Control
    Jiang, Peiyuan
    Pan, Weijun
    Zhang, Jian
    Wang, Teng
    Huang, Junxiang
    CMC-COMPUTERS MATERIALS & CONTINUA, 2023, 77 (01): : 911 - 940
  • [7] CONFORMER-BASED SPEECH RECOGNITION WITH LINEAR NYSTROM ATTENTION AND ROTARY POSITION EMBEDDING
    Samarakoon, Lahiru
    Leung, Tsun-Yat
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 8012 - 8016
  • [8] Conformer-based End-to-end Speech Recognition With Rotary Position Embedding
    Li, Shengqiang
    Xu, Menglong
    Zhang, Xiao-Lei
    2021 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2021, : 443 - 447
  • [9] Conformer-Based Speaker Recognition Model for Real-Time Multi-Scenarios
    Xuan, Xi
    Han, Runping
    Gao, Jingxin
    Computer Engineering and Applications, 2024, 60 (07) : 147 - 156
  • [10] Analysis of Self-Attention Head Diversity for Conformer-based Automatic Speech Recognition
    Audhkhasi, Kartik
    Huang, Yinghui
    Ramabhadran, Bhuvana
    Moreno, Pedro J.
    INTERSPEECH 2022, 2022, : 1026 - 1030