Parametric representations of bird sounds for automatic species recognition

被引:142
|
作者
Somervuo, Panu
Harma, Aki
Fagerlund, Seppo
机构
[1] Aalto Univ, Neural Networks Res Ctr, FIN-02150 Espoo, Finland
[2] Philips Res, NL-5656 AA Eindhoven, Netherlands
[3] Aalto Univ, Lab Acoust & Audio Signal Proc, FIN-02150 Espoo, Finland
基金
芬兰科学院;
关键词
bird song; dynamic time warping (DTW); feature extraction; Gaussian mixture model (GMM); hidden Markov model (HMM); sinusoidal modeling;
D O I
10.1109/TASL.2006.872624
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This paper is related to the development of signal processing techniques for automatic recognition of bird species. Three different parametric representations are compared. The first representation is based on sinusoidal modeling which has been earlier found useful for highly tonal bird sounds. Mel-cepstrum parameters are used since they have been found very useful in the parallel problem of speech recognition. Finally, a vector of various descriptive features is tested because such models are popular in audio classification applications, and bird song is almost like music. We briefly introduce the methods and evaluate their performance in the classification and recognition of both individual syllables and song fragments of 14 common North-European Passerine bird species.
引用
收藏
页码:2252 / 2263
页数:12
相关论文
共 50 条
  • [31] Automatic Target Recognition via Sparse Representations
    Estabridis, Katia
    AUTOMATIC TARGET RECOGNITION XX; ACQUISITION, TRACKING, POINTING, AND LASER SYSTEMS TECHNOLOGIES XXIV; AND OPTICAL PATTERN RECOGNITION XXI, 2010, 7696
  • [32] TPR parametric model in automatic target recognition
    Xi Tong Gong Cheng Yu Dian Zi Ji Shu/Systems Engineering and Electronics, 20 (12): : 37 - 39
  • [33] IDENTIFICATION OF BIRD SOUNDS
    WHITE, TC
    BRITISH BIRDS, 1986, 79 (08): : 406 - 407
  • [34] IDENTIFICATION OF BIRD SOUNDS
    ROGERS, MJ
    BRITISH BIRDS, 1985, 78 (04): : 188 - 188
  • [35] MODULATION IN BIRD SOUNDS
    STEIN, RC
    AUK, 1968, 85 (02): : 229 - &
  • [36] Towards Automatic Recognition of Sounds Observed in Daily Living Activity
    Shaukat, Arslan
    Younis, Ammar
    Akram, Usman
    Mohsin, Muhammad
    Mustansar, Zartasha
    PROCEEDINGS OF THE 2019 IEEE 18TH INTERNATIONAL CONFERENCE ON COGNITIVE INFORMATICS & COGNITIVE COMPUTING (ICCI*CC 2019), 2019, : 66 - 74
  • [37] ANALYSIS AND AUTOMATIC RECOGNITION OF HUMAN BEATBOX SOUNDS: A COMPARATIVE STUDY
    Picart, Benjamin
    Brognaux, Sandrine
    Dupont, Stephane
    2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 4255 - 4259
  • [38] Recognition of bird species based on spike model using bird dataset
    Mohanty, Ricky
    Mallik, Bandi Kumar
    Solanki, Sandeep Singh
    DATA IN BRIEF, 2020, 29
  • [39] On the role of audio frontends in bird species recognition
    Ghaffari, Houtan
    Devos, Paul
    ECOLOGICAL INFORMATICS, 2024, 81
  • [40] Visualization of Audio Records for Automatic Bird Species Identification
    Reyes, Angie K.
    Camargo, Jorge E.
    2015 20TH SYMPOSIUM ON SIGNAL PROCESSING, IMAGES AND COMPUTER VISION (STSIVA), 2015,