Classification of animal sounds in a hyperdiverse rainforest using convolutional neural networks with data augmentation

被引:0
|
作者
Sun, Yuren [1 ]
Maeda, Tatiana Midori [2 ,3 ]
Solis-Lemus, Claudia [4 ,5 ]
Pimentel-Alarcon, Daniel [4 ,6 ]
Burivalova, Zuzana [2 ,3 ]
机构
[1] Univ Wisconsin, Dept Comp Sci, Madison, WI USA
[2] Univ Wisconsin, Nelson Inst Environm Studies, Madison, WI 53706 USA
[3] Univ Wisconsin, Dept Forest & Wildlife Ecol, Madison, WI 53706 USA
[4] Univ Wisconsin, Wisconsin Inst Discovery, Madison, WI USA
[5] Univ Wisconsin, Dept Plant Pathol, Madison, WI USA
[6] Univ Wisconsin, Dept Biostat & Med Informat, Madison, WI USA
关键词
Bioacoustics; Convolutional neural network; Conservation; Data augmentation; Passive 30 acoustic monitoring; Sound classification; Tropical forest; Transfer learning; BIODIVERSITY; CONSERVATION;
D O I
10.1016/j.ecolind.2022.109621
中图分类号
X176 [生物多样性保护];
学科分类号
090705 ;
摘要
To protect tropical forest biodiversity, we need to be able to detect it reliably, cheaply, and at scale. Automated detection of sound producing animals from passively recorded soundscapes via machine-learning approaches is a promising technique towards this goal, but it is constrained by the necessity of large training data sets. Using soundscapes from a tropical forest in Borneo and a Convolutional Neural Network model (CNN), we investigate i) the minimum viable training data set size for accurate prediction of call types ('sonotypes'), and ii) the extent to which data augmentation and transfer learning can overcome the issue of small and imbalanced training data sets. We found that even relatively high sample sizes (>80 per sonotype) lead to mediocre accuracy, which however improved significantly with data augmentation and transfer learning, including at extremely small sample sizes (3 per sonotype), regardless of taxonomic group or call characteristics. Neither transfer learning nor data augmentation alone achieved high accuracy. Our results suggest that transfer learning and data augmen-tation could make the use of CNNs to classify species' vocalizations feasible even for small soundscape-based projects with many rare species. Retraining our open-source model requires only basic programming skills which makes it possible for individual conservation initiatives to match their local context, in order to enable more evidence-informed management of biodiversity.
引用
收藏
页数:10
相关论文
共 50 条
  • [31] A Data Augmentation Methodology to Improve Age Estimation using Convolutional Neural Networks
    Oliveira, Italo de Pontes
    Peixoto Medeiros, Joao Lucas
    de Sousa, Vinicius Fernandes
    Teixeira Junior, Adalberto Gomes
    Pereira, Eanes Torres
    Gomes, Herman Martins
    [J]. 2016 29TH SIBGRAPI CONFERENCE ON GRAPHICS, PATTERNS AND IMAGES (SIBGRAPI), 2016, : 88 - 95
  • [32] Flipping Data Augmentation of Convolutional Neural Networks Using Discrete Cosine Transforms
    Ito, Izumi
    [J]. 29TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO 2021), 2021, : 1501 - 1505
  • [33] Pre-trained Convolutional Neural Networks for the Lung Sounds Classification
    Vaityshyn, Valentyn
    Porieva, Hanna
    Makarenkova, Anastasiia
    [J]. 2019 IEEE 39TH INTERNATIONAL CONFERENCE ON ELECTRONICS AND NANOTECHNOLOGY (ELNANO), 2019, : 522 - 525
  • [34] Classification of Elephant Sounds Using Parallel Convolutional Neural Network
    Leonid, T. Thomas
    Jayaparvathy, R.
    [J]. INTELLIGENT AUTOMATION AND SOFT COMPUTING, 2022, 32 (03): : 1415 - 1426
  • [35] Seabed Classification Using a Convolutional Neural Network on Explosive Sounds
    Howarth, Kira
    Neilsen, Tracianne B.
    Van Komen, David F.
    Knobles, David Paul
    [J]. IEEE JOURNAL OF OCEANIC ENGINEERING, 2022, 47 (03) : 670 - 679
  • [36] Data augmentation based morphological classification of galaxies using deep convolutional neural network
    Ansh Mittal
    Anu Soorya
    Preeti Nagrath
    D. Jude Hemanth
    [J]. Earth Science Informatics, 2020, 13 : 601 - 617
  • [37] Object classification on raw radar data using convolutional neural networks
    Han, Heejae
    Kim, Jeonghwan
    Park, Junyoung
    Lee, Yujin
    Jo, Hyunwoo
    Park, Yonghyeon
    Matson, Eric T.
    Park, Seongha
    [J]. 2019 IEEE SENSORS APPLICATIONS SYMPOSIUM (SAS), 2019,
  • [38] Environmental sound classification using a regularized deep convolutional neural network with data augmentation
    Mushtaq, Zohaib
    Su, Shun-Feng
    [J]. APPLIED ACOUSTICS, 2020, 167
  • [39] Data augmentation based morphological classification of galaxies using deep convolutional neural network
    Mittal, Ansh
    Soorya, Anu
    Nagrath, Preeti
    Hemanth, D. Jude
    [J]. EARTH SCIENCE INFORMATICS, 2020, 13 (03) : 601 - 617
  • [40] LiDAR Data Classification Using Morphological Profiles and Convolutional Neural Networks
    Wang, Aili
    He, Xin
    Ghamisi, Pedram
    Chen, Yushi
    [J]. IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2018, 15 (05) : 774 - 778