Automatic Segmentation of Audio Signals for Bird Species Identification

被引:9
|
作者
Evangelista, Thiago L. F. [1 ]
Priolli, Thales M. [1 ]
Silla, Carlos N., Jr. [1 ]
Angelico, Bruno A. [1 ]
Kaestner, Celso A. A. [2 ]
机构
[1] Univ Tecnol Fed Parana, Ave Alberto Carazzai 1640, BR-86300000 Cornelio Procopio, Parana, Brazil
[2] Univ Tecnol Fed Parana, BR-80230901 Curitiba, Parana, Brazil
关键词
Processing; Pattern Recognition; Machine Learning; Bird Species Identification; SOUNDS; CLASSIFICATION;
D O I
10.1109/ISM.2014.46
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
The identification of bird species from their audio recorded songs are nowadays used in several important applications, such as to monitor the quality of the environment and to prevent bird-plane collisions near airports. The complete identification cycle involves the use of: (a) recording devices to acquire the songs, (b) audio processing techniques to remove the noise and to select the most representative elements of the signal, (c) feature extraction procedures to obtain relevant characteristics, and (d) decision procedures to make the identification. The decision procedures can be obtained by Machine Learning (ML) algorithms, considering the problem in a standard classification scenario. One key element is this cycle is the selection of the most relevant segments of the audio for identification purposes. In this paper we show that the use of short audio segments with high amplitude - called pulses in our work - outperforms the use of the complete audio records in the species identification task. We also show how these pulses can be automatically obtained, based on measurements performed directly on the audio signal. The employed classifiers are trained using a previously labeled database of bird songs. We use a database that contains bird song recordings from 75 species which appear in the Southern Atlantic Coast of South America. Obtained results show that the use of automatically obtained pulses and a SVM classifier produce the best results; all the necessary procedures can be installed in a dedicated hardware, allowing the construction of a specific bird identification device.
引用
收藏
页码:223 / 228
页数:6
相关论文
共 50 条
  • [21] Automatic segmentation and annotation of audio archive documents
    Bohac, Marek
    Blavka, Karel
    2011 10TH INTERNATIONAL WORKSHOP ON ELECTRONICS, CONTROL, MEASUREMENT AND SIGNALS (ECMS), 2011, : 61 - 66
  • [22] An automatic approach towards audio segmentation and classification
    Pan, Wenjuan
    Wang, Zongwu
    Liu, Zhijing
    PROGRESS IN INTELLIGENCE COMPUTATION AND APPLICATIONS, PROCEEDINGS, 2007, : 405 - 408
  • [23] Automatic segmentation and indexing in a database of bird images
    Das, M
    Manmatha, R
    EIGHTH IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION, VOL II, PROCEEDINGS, 2001, : 351 - 358
  • [24] Automatic Segmentation and Deep Learning of Bird Sounds
    Koops, Hendrik Vincent
    van Balen, Jan
    Wiering, Frans
    EXPERIMENTAL IR MEETS MULTILINGUALITY, MULTIMODALITY, AND INTERACTION, 2015, 9283 : 261 - 267
  • [25] Two Convolutional Neural Networks for Bird Detection in Audio Signals
    Grill, Thomas
    Schlueter, Jan
    2017 25TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2017, : 1764 - 1768
  • [26] Fuzzy precision and recall measures for audio signals segmentation
    Ziolko, Bartosz
    FUZZY SETS AND SYSTEMS, 2015, 279 : 101 - 111
  • [27] On the segmentation of narrowly-spaced noisy audio signals
    Sattar, F
    Pwint, M
    Doraiswami, R
    IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, VOL I AND II, PROCEEDINGS, 2002, : A281 - A284
  • [28] Cardiac Murmur Effects on Automatic Segmentation of ECG Signals for Biometric Identification: Preliminary Study
    Duque-Mejia, C.
    Becerra, M. A.
    Zapata-Hernandez, C.
    Mejia-Arboleda, C.
    Castro-Ospina, A. E.
    Delgado-Trejos, E.
    Peluffo-Ordonez, Diego H.
    Rosero-Montalvo, P.
    Revelo-Fuelagan, Javier
    INTELLIGENT INFORMATION AND DATABASE SYSTEMS, ACIIDS 2019, PT I, 2019, 11431 : 269 - 279
  • [29] BILINGUAL AUDIO-SUBTITLE EXTRACTION USING AUTOMATIC SEGMENTATION OF MOVIE AUDIO
    Tsiartas, Andreas
    Ghosh, Prasanta
    Georgiou, Panayiotis G.
    Narayanan, Shrikanth
    2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2011, : 5624 - 5627
  • [30] Audio Classification of Bird Species: a Statistical Manifold Approach
    Briggs, Forrest
    Raich, Raviv
    Fern, Xiaoli Z.
    2009 9TH IEEE INTERNATIONAL CONFERENCE ON DATA MINING, 2009, : 51 - 60