Handcrafted features and late fusion with deep learning for bird sound classification

被引:63
|
作者
Xie, Jie [1 ,3 ]
Zhu, Mingying [2 ]
机构
[1] Jiangnan Univ, Sch Internet Things Engn, Minist Educ, Key Lab Adv Proc Control Light Ind, Wuxi 214122, Jiangsu, Peoples R China
[2] Univ Ottawa, Dept Econ, Ottawa, ON K1N 6N5, Canada
[3] Jiangnan Univ, Jiangsu Key Lab Adv Food Mfg Equipment & Technol, Wuxi, Jiangsu, Peoples R China
关键词
Bird sound classification; Convolutional neural networks; Acoustic feature; Visual feature; ACOUSTIC CLASSIFICATION;
D O I
10.1016/j.ecoinf.2019.05.007
中图分类号
Q14 [生态学(生物生态学)];
学科分类号
071012 ; 0713 ;
摘要
Automated classification of calling bird species is useful for large-scale temporal and spatial environmental monitoring. In this paper, we investigate acoustic features, visual features, and deep learning for bird sound classification. For the deep learning approach, the Convolutional Neural Network layers are used for learning generalized features and dimension reduction, while a conventional fully connected layer is used for classification. Then, an unified end-to-end model is built by combing those three layers for classifying calling bird species. For visual and acoustic features, two traditional classifiers are compared to classify the bird sounds. Experimental results on 14 bird species indicate that our proposed deep learning method can achieve the best F1-score 94.36%, which is higher than using the acoustic features approach (88.97%) and using the visual features approach (88.87%). To further improve the classification performance, a class-based late fusion method is explored. Our final best classification F1-score is 95.95%, which is obtained by the late fusion of the acoustic features approach, the visual features approach, and deep learning.
引用
收藏
页码:74 / 81
页数:8
相关论文
共 50 条
  • [1] Late fusion of deep learning and handcrafted visual features for biomedical image modality classification
    Lee, Sheng Long
    Zare, Mohammad Reza
    Muller, Henning
    IET IMAGE PROCESSING, 2019, 13 (02) : 382 - 391
  • [2] Ensemble of handcrafted and deep features for urban sound classification
    Luz, Jederson S.
    Oliveira, Myllena C.
    Araujo, Flavio H. D.
    Magalhaes, Deborah M., V
    APPLIED ACOUSTICS, 2021, 175 (175)
  • [3] Improving mammography lesion classification by optimal fusion of handcrafted and deep transfer learning features
    Jones, Meredith A.
    Faiz, Rowzat
    Qiu, Yuchen
    Zheng, Bin
    PHYSICS IN MEDICINE AND BIOLOGY, 2022, 67 (05):
  • [4] Fusion of Handcrafted and Deep Transfer Learning Features to Improve Performance of Breast Lesion Classification
    Jones, Meredith A.
    Pham, Huong
    Gai, Tiancheng
    Zheng, Bin
    MEDICAL IMAGING 2022: COMPUTER-AIDED DIAGNOSIS, 2022, 12033
  • [5] Deep Learning and Handcrafted Features for Virus Image Classification
    Nanni, Loris
    De Luca, Eugenio
    Facin, Marco Ludovico
    Maguolo, Gianluca
    JOURNAL OF IMAGING, 2020, 6 (12)
  • [6] Deep Learning and Handcrafted Features for Thyroid Nodule Classification
    Maarouf, Ayoub Abderrazak
    Meriem, Hacini
    Hachouf, Fella
    INTERNATIONAL JOURNAL OF IMAGING SYSTEMS AND TECHNOLOGY, 2024, 34 (06)
  • [7] Incorporating Handcrafted Features into Deep Learning for Point Cloud Classification
    Hsu, Pai-Hui
    Zhuang, Zong-Yi
    REMOTE SENSING, 2020, 12 (22) : 1 - 28
  • [8] Optimal Fusion-Based Handcrafted with Deep Features for Brain Cancer Classification
    Ragab, Mahmoud
    Alshammari, Sultanah M.
    Asseri, Amer H.
    Almutiry, Waleed K.
    CMC-COMPUTERS MATERIALS & CONTINUA, 2022, 73 (01): : 801 - 815
  • [9] Pedestrian's Intention Recognition, Fusion of Handcrafted Features in a Deep Learning Approach
    Hamed, Omar
    Steinhauer, H. Joe
    THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 15795 - 15796
  • [10] AudioProtoPNet: An interpretable deep learning model for bird sound classification
    Heinrich, Rene
    Rauch, Lukas
    Sick, Bernhard
    Scholz, Christoph
    ECOLOGICAL INFORMATICS, 2025, 87