Handcrafted features and late fusion with deep learning for bird sound classification

被引:63
|
作者
Xie, Jie [1 ,3 ]
Zhu, Mingying [2 ]
机构
[1] Jiangnan Univ, Sch Internet Things Engn, Minist Educ, Key Lab Adv Proc Control Light Ind, Wuxi 214122, Jiangsu, Peoples R China
[2] Univ Ottawa, Dept Econ, Ottawa, ON K1N 6N5, Canada
[3] Jiangnan Univ, Jiangsu Key Lab Adv Food Mfg Equipment & Technol, Wuxi, Jiangsu, Peoples R China
关键词
Bird sound classification; Convolutional neural networks; Acoustic feature; Visual feature; ACOUSTIC CLASSIFICATION;
D O I
10.1016/j.ecoinf.2019.05.007
中图分类号
Q14 [生态学(生物生态学)];
学科分类号
071012 ; 0713 ;
摘要
Automated classification of calling bird species is useful for large-scale temporal and spatial environmental monitoring. In this paper, we investigate acoustic features, visual features, and deep learning for bird sound classification. For the deep learning approach, the Convolutional Neural Network layers are used for learning generalized features and dimension reduction, while a conventional fully connected layer is used for classification. Then, an unified end-to-end model is built by combing those three layers for classifying calling bird species. For visual and acoustic features, two traditional classifiers are compared to classify the bird sounds. Experimental results on 14 bird species indicate that our proposed deep learning method can achieve the best F1-score 94.36%, which is higher than using the acoustic features approach (88.97%) and using the visual features approach (88.87%). To further improve the classification performance, a class-based late fusion method is explored. Our final best classification F1-score is 95.95%, which is obtained by the late fusion of the acoustic features approach, the visual features approach, and deep learning.
引用
收藏
页码:74 / 81
页数:8
相关论文
共 50 条
  • [31] Breast cancer classification using deep learned features boosted with handcrafted features
    Sajid, Unaiza
    Khan, Rizwan Ahmed
    Shah, Shahid Munir
    Arif, Sheeraz
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2023, 86
  • [32] Interpreting Deep Learning Features for Myoelectric Control: A Comparison With Handcrafted Features
    Cote-Allard, Ulysse
    Campbell, Evan
    Phinyomark, Angkoon
    Laviolette, Francois
    Gosselin, Benoit
    Scheme, Erik
    FRONTIERS IN BIOENGINEERING AND BIOTECHNOLOGY, 2020, 8
  • [33] Fusion of handcrafted edge and residual learning features for image colorization
    Deshpande, Shabdali C.
    Pawer, Meenakshi M.
    Atkale, Dipali V.
    Yadav, Dhanashree M.
    SIGNAL IMAGE AND VIDEO PROCESSING, 2022, 16 (02) : 291 - 299
  • [34] Fusion of handcrafted edge and residual learning features for image colorization
    Shabdali C. Deshpande
    Meenakshi M. Pawer
    Dipali V. Atkale
    Dhanashree M. Yadav
    Signal, Image and Video Processing, 2022, 16 : 291 - 299
  • [35] Fusion of Handcrafted and Deep-Learning Features for Brain Tumor Detection and Classification Using T1-Weighted Magnetic Resonance Images
    S. Hanumanthappa
    C. D. Guruprakash
    SN Computer Science, 5 (8)
  • [36] Fusion of Handcrafted and Deep Learning Features for Large-scale Multiple Iris Presentation Attack Detection
    Yadav, Daksha
    Kohli, Naman
    Agarwal, Akshay
    Vatsa, Mayank
    Singh, Richa
    Noore, Afzel
    PROCEEDINGS 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW), 2018, : 685 - 692
  • [37] Gated fusion of handcrafted and deep features for robust automatic pronunciation assessment
    Lin, Binghuai
    Wang, Liyuan
    PROCEEDINGS OF 2022 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2022, : 1399 - 1404
  • [38] Combining Deep Learning with Handcrafted Features for Cell Nuclei Segmentation
    Narotamo, Hemaxi
    Sanches, J. Miguel
    Silveira, Margarida
    42ND ANNUAL INTERNATIONAL CONFERENCES OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY: ENABLING INNOVATIVE TECHNOLOGIES FOR GLOBAL HEALTHCARE EMBC'20, 2020, : 1428 - 1431
  • [39] Feature Fusion of Deep Spatial Features and Handcrafted Spatiotemporal Features for Human Action Recognition
    Uddin, Md Azher
    Lee, Young-Koo
    SENSORS, 2019, 19 (07)
  • [40] Overview of handcrafted features and deep learning models for leaf recognition
    Isik, Sahin
    Ozkan, Kemal
    JOURNAL OF ENGINEERING RESEARCH, 2021, 9 (01):