Parametric representations of bird sounds for automatic species recognition

被引:142
|
作者
Somervuo, Panu
Harma, Aki
Fagerlund, Seppo
机构
[1] Aalto Univ, Neural Networks Res Ctr, FIN-02150 Espoo, Finland
[2] Philips Res, NL-5656 AA Eindhoven, Netherlands
[3] Aalto Univ, Lab Acoust & Audio Signal Proc, FIN-02150 Espoo, Finland
基金
芬兰科学院;
关键词
bird song; dynamic time warping (DTW); feature extraction; Gaussian mixture model (GMM); hidden Markov model (HMM); sinusoidal modeling;
D O I
10.1109/TASL.2006.872624
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This paper is related to the development of signal processing techniques for automatic recognition of bird species. Three different parametric representations are compared. The first representation is based on sinusoidal modeling which has been earlier found useful for highly tonal bird sounds. Mel-cepstrum parameters are used since they have been found very useful in the parallel problem of speech recognition. Finally, a vector of various descriptive features is tested because such models are popular in audio classification applications, and bird song is almost like music. We briefly introduce the methods and evaluate their performance in the classification and recognition of both individual syllables and song fragments of 14 common North-European Passerine bird species.
引用
收藏
页码:2252 / 2263
页数:12
相关论文
共 50 条
  • [41] Automatic Bird Species Filtering Using a Multimodel Approach
    Li, Wen
    Song, Dezhen
    IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING, 2015, 12 (02) : 553 - 564
  • [42] Automatic segmentation of audio signal in bird species identification
    Zottesso, Rafael H. D.
    Matsushita, Gustavo H. G.
    Lucio, Diego Rafael
    Costa, Yandre M. G.
    PROCEEDINGS OF THE 2016 35TH INTERNATIONAL CONFERENCE OF THE CHILEAN COMPUTER SCIENCE SOCIETY (SCCC), 2016,
  • [43] Automatic Segmentation of Audio Signals for Bird Species Identification
    Evangelista, Thiago L. F.
    Priolli, Thales M.
    Silla, Carlos N., Jr.
    Angelico, Bruno A.
    Kaestner, Celso A. A.
    2014 IEEE INTERNATIONAL SYMPOSIUM ON MULTIMEDIA (ISM), 2014, : 223 - 228
  • [44] Feature Set Comparison for Automatic Bird Species Identification
    Lopes, Marcelo Teider
    Koerich, Alessandro Lameiras
    Silla Junior, Carlos Nascimento
    Alves Kaestner, Celso Antonio
    2011 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC), 2011, : 965 - 970
  • [45] Automatic Frequency Feature Extraction for Bird Species Delimitation
    O'Reilly, Colm
    Kokuer, Munevver
    Jancovic, Peter
    Drennan, Regan
    Harte, Naomi
    2017 25TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2017, : 1759 - 1763
  • [46] Automatic recognition of biological shapes with and without representations of shape
    Sánchez-Marín, FJ
    ARTIFICIAL INTELLIGENCE IN MEDICINE, 2000, 18 (02) : 173 - 186
  • [47] The incidence of bird sounds, and other categories of non-focal sounds, confound the relationships between acoustic indices and bird species richness in southern China
    Galappaththi, Supun
    Goodale, Eben
    Sun, Jiajie
    Jiang, Aiwu
    Mammides, Christos
    GLOBAL ECOLOGY AND CONSERVATION, 2024, 51
  • [48] Automatic Classification of Bird Sounds: Using MFCC and Mel Spectrogram Features with Deep Learning
    Carvalho, Silvestre
    Gomes, Elsa Ferreira
    VIETNAM JOURNAL OF COMPUTER SCIENCE, 2023, 10 (01) : 39 - 54
  • [49] EVALUATION OF VARIOUS LINEAR PREDICTION PARAMETRIC REPRESENTATIONS IN VOWEL RECOGNITION
    PALIWAL, KK
    RAO, PVS
    SIGNAL PROCESSING, 1982, 4 (04) : 323 - 327
  • [50] Fractal dimensions of speech sounds: Computation and application to automatic speech recognition
    Maragos, P
    Potamianos, A
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1999, 105 (03): : 1925 - 1932