Handcrafted features and late fusion with deep learning for bird sound classification

被引:63
|
作者
Xie, Jie [1 ,3 ]
Zhu, Mingying [2 ]
机构
[1] Jiangnan Univ, Sch Internet Things Engn, Minist Educ, Key Lab Adv Proc Control Light Ind, Wuxi 214122, Jiangsu, Peoples R China
[2] Univ Ottawa, Dept Econ, Ottawa, ON K1N 6N5, Canada
[3] Jiangnan Univ, Jiangsu Key Lab Adv Food Mfg Equipment & Technol, Wuxi, Jiangsu, Peoples R China
关键词
Bird sound classification; Convolutional neural networks; Acoustic feature; Visual feature; ACOUSTIC CLASSIFICATION;
D O I
10.1016/j.ecoinf.2019.05.007
中图分类号
Q14 [生态学(生物生态学)];
学科分类号
071012 ; 0713 ;
摘要
Automated classification of calling bird species is useful for large-scale temporal and spatial environmental monitoring. In this paper, we investigate acoustic features, visual features, and deep learning for bird sound classification. For the deep learning approach, the Convolutional Neural Network layers are used for learning generalized features and dimension reduction, while a conventional fully connected layer is used for classification. Then, an unified end-to-end model is built by combing those three layers for classifying calling bird species. For visual and acoustic features, two traditional classifiers are compared to classify the bird sounds. Experimental results on 14 bird species indicate that our proposed deep learning method can achieve the best F1-score 94.36%, which is higher than using the acoustic features approach (88.97%) and using the visual features approach (88.87%). To further improve the classification performance, a class-based late fusion method is explored. Our final best classification F1-score is 95.95%, which is obtained by the late fusion of the acoustic features approach, the visual features approach, and deep learning.
引用
收藏
页码:74 / 81
页数:8
相关论文
共 50 条
  • [21] Handcrafted Feature and Deep Features Based Image Classification Using Machine Learning Models
    Yadav, Anupam
    Khatibi, Ali
    Shreenidhi, H. S.
    Gupta, Saroj Kumar
    Jadhav, Abhilasha
    Chohan, Mandeep Kaur
    Raju, G. Sanyasi
    Alkhayyat, Ahmed
    NATIONAL ACADEMY SCIENCE LETTERS-INDIA, 2025,
  • [22] Robust Vehicle Classification Based on the Combination of Deep Features and Handcrafted Features
    Jiang, Liying
    Li, Jiafeng
    Zhuo, Li
    Zhu, Ziqi
    2017 16TH IEEE INTERNATIONAL CONFERENCE ON TRUST, SECURITY AND PRIVACY IN COMPUTING AND COMMUNICATIONS / 11TH IEEE INTERNATIONAL CONFERENCE ON BIG DATA SCIENCE AND ENGINEERING / 14TH IEEE INTERNATIONAL CONFERENCE ON EMBEDDED SOFTWARE AND SYSTEMS, 2017, : 859 - 865
  • [23] Image Matching and Localization Based on Fusion of Handcrafted and Deep Features
    Song, Xianfeng
    Zou, Yi
    Shi, Zheng
    Yang, Yanfeng
    IEEE SENSORS JOURNAL, 2023, 23 (19) : 22967 - 22983
  • [24] HSDDD: A Hybrid Scheme for the Detection of Distracted Driving through Fusion of Deep Learning and Handcrafted Features
    Alkinani, Monagi H.
    Khan, Wazir Zada
    Arshad, Quratulain
    Raza, Mudassar
    SENSORS, 2022, 22 (05)
  • [25] Boosting the Performance of Deep Approaches through Fusion with Handcrafted Features
    Koutrintzes, Dimitrios
    Mathe, Eirini
    Spyrou, Evaggelos
    PROCEEDINGS OF THE 11TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION APPLICATIONS AND METHODS (ICPRAM), 2021, : 370 - 377
  • [26] Fusion of Handcrafted and Deep Features for Forgery Detection in Digital Images
    Walia, Savita
    Kumar, Krishan
    Kumar, Munish
    Gao, Xiao-Zhi
    IEEE ACCESS, 2021, 9 : 99742 - 99755
  • [27] EMG hand gesture classification using handcrafted and deep features
    Manuel Fajardo, Jose
    Gomez, Orlando
    Prieto, Flavio
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2021, 63
  • [28] Early prediction of sepsis using double fusion of deep features and handcrafted features
    Yongrui Duan
    Jiazhen Huo
    Mingzhou Chen
    Fenggang Hou
    Guoliang Yan
    Shufang Li
    Haihui Wang
    Applied Intelligence, 2023, 53 : 17903 - 17919
  • [29] Early prediction of sepsis using double fusion of deep features and handcrafted features
    Duan, Yongrui
    Huo, Jiazhen
    Chen, Mingzhou
    Hou, Fenggang
    Yan, Guoliang
    Li, Shufang
    Wang, Haihui
    APPLIED INTELLIGENCE, 2023, 53 (14) : 17903 - 17919
  • [30] Ensemble Approach on Deep and Handcrafted Features for Neonatal Bowel Sound Detection
    Burne, Lachlan
    Sitaula, Chiranjibi
    Priyadarshi, Archana
    Tracy, Mark
    Kavehei, Omid
    Hinder, Murray
    Withana, Anusha
    McEwan, Alistair
    Marzbanrad, Faezeh
    IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2023, 27 (06) : 2603 - 2613