Deep transfer learning-based bird species classification using mel spectrogram images

被引:0
|
作者
Baowaly, Mrinal Kanti [1 ]
Sarkar, Bisnu Chandra [1 ]
Walid, Md. Abul Ala [2 ,3 ]
Ahamad, Md. Martuza [1 ]
Singh, Bikash Chandra [4 ]
Alvarado, Eduardo Silva [5 ,6 ,7 ]
Ashraf, Imran [8 ]
Samad, Md. Abdus [8 ]
机构
[1] Bangabandhu Sheikh Mujibur Rahman Sci & Technol Un, Dept Comp Sci & Engn, Gopalganj, Bangladesh
[2] Khulna Univ Engn & Technol, Dept Comp Sci & Engn, Khulna, Bangladesh
[3] Bangabandhu Sheikh Mujibur Rahman Digital Univ, Dept Data Sci, Kaliakair, Bangladesh
[4] Old Dominion Univ, Sch Cybersecur, Norfolk, VA USA
[5] Univ Europea Atlantico, Santander, Spain
[6] Univ Int Iberoamericana, Campeche, Mexico
[7] Univ La Romana, La Romana, Dominican Rep
[8] Yeungnam Univ, Dept Informat & Commun Engn, Gyongsan, Gyeongbuk Do, South Korea
来源
PLOS ONE | 2024年 / 19卷 / 08期
关键词
D O I
10.1371/journal.pone.0305708
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
The classification of bird species is of significant importance in the field of ornithology, as it plays an important role in assessing and monitoring environmental dynamics, including habitat modifications, migratory behaviors, levels of pollution, and disease occurrences. Traditional methods of bird classification, such as visual identification, were time-intensive and required a high level of expertise. However, audio-based bird species classification is a promising approach that can be used to automate bird species identification. This study aims to establish an audio-based bird species classification system for 264 Eastern African bird species employing modified deep transfer learning. In particular, the pre-trained EfficientNet technique was utilized for the investigation. The study adapts the fine-tune model to learn the pertinent patterns from mel spectrogram images specific to this bird species classification task. The fine-tuned EfficientNet model combined with a type of Recurrent Neural Networks (RNNs) namely Gated Recurrent Unit (GRU) and Long short-term memory (LSTM). RNNs are employed to capture the temporal dependencies in audio signals, thereby enhancing bird species classification accuracy. The dataset utilized in this work contains nearly 17,000 bird sound recordings across a diverse range of species. The experiment was conducted with several combinations of EfficientNet and RNNs, and EfficientNet-B7 with GRU surpasses other experimental models with an accuracy of 84.03% and a macro-average precision score of 0.8342.
引用
收藏
页数:16
相关论文
共 50 条
  • [1] Automatic Classification of Bird Sounds: Using MFCC and Mel Spectrogram Features with Deep Learning
    Carvalho, Silvestre
    Gomes, Elsa Ferreira
    VIETNAM JOURNAL OF COMPUTER SCIENCE, 2023, 10 (01) : 39 - 54
  • [2] Machine Learning-Based Classification of Mosquito Wing Beats Using Mel Spectrogram Images and Ensemble Modeling
    Vamsi, Bandi
    Al Bataineh, Ali
    Doppala, Bhanu Prakash
    TRAITEMENT DU SIGNAL, 2024, 41 (04) : 2093 - 2101
  • [3] Comparison of the Effects of Mel Coefficients and Spectrogram Images via Deep Learning in Emotion Classification
    Demircan, Semiye
    Ornek, Humar Kahramanli
    TRAITEMENT DU SIGNAL, 2020, 37 (01) : 51 - 57
  • [4] Heart Sound Classification Using Deep Learning Techniques Based on Log-mel Spectrogram
    Minh Tuan Nguyen
    Wei Wen Lin
    Jin H. Huang
    Circuits, Systems, and Signal Processing, 2023, 42 : 344 - 360
  • [5] Heart Sound Classification Using Deep Learning Techniques Based on Log-mel Spectrogram
    Minh Tuan Nguyen
    Lin, Wei Wen
    Huang, Jin H.
    CIRCUITS SYSTEMS AND SIGNAL PROCESSING, 2023, 42 (01) : 344 - 360
  • [6] Transfer Learning-Based Tongue Gestures Classification Using Ultrasound Images
    Xu, Kele
    Feng, Dawei
    Zou, Shun
    MEDICAL PHYSICS, 2018, 45 (06) : E234 - E234
  • [7] On the Effect of Log-Mel Spectrogram Parameter Tuning for Deep Learning-Based Speech Emotion Recognition
    Mukhamediya, Azamat
    Fazli, Siamac
    Zollanvari, Amin
    IEEE ACCESS, 2023, 11 : 61950 - 61957
  • [8] Deep Learning-Based Object Classification for Spectral Images
    Jacome, Roman
    Lopez, Carlos
    Garcia, Hans
    Arguello, Henry
    APPLICATIONS OF COMPUTATIONAL INTELLIGENCE, COLCACI 2020, 2021, 1346 : 147 - 159
  • [9] Deep Transfer Learning-Based Automated Identification of Bird Song
    Das, Nabanita
    Padhy, Neelamadhab
    Dey, Nilanjan
    Bhattacharya, Sudipta
    Tavares, Joao Manuel R. S.
    INTERNATIONAL JOURNAL OF INTERACTIVE MULTIMEDIA AND ARTIFICIAL INTELLIGENCE, 2023, 8 (04): : 33 - 45
  • [10] Spectrogram Segmentation for Bird Species Classification based on Temporal Continuity
    Towhid, Md. Shamim
    Rahman, Md. Mijanur
    2017 20TH INTERNATIONAL CONFERENCE OF COMPUTER AND INFORMATION TECHNOLOGY (ICCIT), 2017,