Dynamic time warping and sparse representation classification for birdsong phrase classification using limited training data

被引:36
|
作者
Tan, Lee N. [1 ]
Alwan, Abeer [1 ]
Kossan, George [2 ]
Cody, Martin L. [2 ]
Taylor, Charles E. [2 ]
机构
[1] Univ Calif Los Angeles, Dept Elect Engn, Los Angeles, CA 90095 USA
[2] Univ Calif Los Angeles, Dept Ecol & Evolutionary Biol, Los Angeles, CA 90095 USA
来源
基金
美国国家科学基金会;
关键词
RECOGNITION; SOUND; REVERBERATIONS; VOCALIZATIONS; RECORDINGS; FOREST;
D O I
10.1121/1.4906168
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Annotation of phrases in birdsongs can be helpful to behavioral and population studies. To reduce the need for manual annotation, an automated birdsong phrase classification algorithm for limited data is developed. Limited data occur because of limited recordings or the existence of rare phrases. In this paper, classification of up to 81 phrase classes of Cassin's Vireo is performed using one to five training samples per class. The algorithm involves dynamic time warping (DTW) and two passes of sparse representation (SR) classification. DTW improves the similarity between training and test phrases from the same class in the presence of individual bird differences and phrase segmentation inconsistencies. The SR classifier works by finding a sparse linear combination of training feature vectors from all classes that best approximates the test feature vector. When the class decisions from DTW and the first pass SR classification are different, SR classification is repeated using training samples from these two conflicting classes. Compared to DTW, support vector machines, and an SR classifier without DTW, the proposed classifier achieves the highest classification accuracies of 94% and 89% on manually segmented and automatically segmented phrases, respectively, from unseen Cassin's Vireo individuals, using five training samples per class. (C) 2015 Acoustical Society of America.
引用
收藏
页码:1069 / 1080
页数:12
相关论文
共 50 条
  • [1] A SPARSE REPRESENTATION-BASED CLASSIFIER FOR IN-SET BIRD PHRASE VERIFICATION AND CLASSIFICATION WITH LIMITED TRAINING DATA
    Tan, Lee Ngee
    Kossan, George
    Cody, Martin L.
    Taylor, Charles E.
    Alwan, Abeer
    [J]. 2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 763 - 767
  • [2] Evaluation of a Sparse Representation-Based Classifier For Bird Phrase Classification Under Limited Data Conditions
    Tan, Lee Ngee
    Kaewtip, Kantapon
    Cody, Martin L.
    Taylor, Charles E.
    Alwan, Abeer
    [J]. 13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3, 2012, : 2521 - 2524
  • [3] Mammogram classification using dynamic time warping
    Syed Jamal Safdar Gardezi
    Ibrahima Faye
    Jose M. Sanchez Bornot
    Nidal Kamel
    Mohammad Hussain
    [J]. Multimedia Tools and Applications, 2018, 77 : 3941 - 3962
  • [4] Chromosome classification using dynamic time warping
    Legrand, Benoit
    Chang, C. S.
    Ong, S. H.
    Neo, Soek-Ying
    Palanisamy, Nallasivarn
    [J]. PATTERN RECOGNITION LETTERS, 2008, 29 (03) : 215 - 222
  • [5] Motion Classification Using Dynamic Time Warping
    Adistambha, Kevin
    Ritz, Christian H.
    Burnett, Ian S.
    [J]. 2008 IEEE 10TH WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING, VOLS 1 AND 2, 2008, : 626 - +
  • [6] Mammogram classification using dynamic time warping
    Gardezi, Syed Jamal Safdar
    Faye, Ibrahima
    Bornot, Jose M. Sanchez
    Kamel, Nidal
    Hussain, Mohammad
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2018, 77 (03) : 3941 - 3962
  • [7] Classification of temporal data using dynamic time warping and compressed learning
    Huang, Shih-Feng
    Lu, Hong-Ping
    [J]. BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2020, 57
  • [8] ECG frame classification using dynamic time warping
    Huang, B
    Kinsner, W
    [J]. IEEE CCEC 2002: CANADIAN CONFERENCE ON ELECTRCIAL AND COMPUTER ENGINEERING, VOLS 1-3, CONFERENCE PROCEEDINGS, 2002, : 1105 - 1110
  • [9] Classification of surgical processes using dynamic time warping
    Forestier, Germain
    Lalys, Florent
    Riffaud, Laurent
    Trelhu, Brivael
    Jannin, Pierre
    [J]. JOURNAL OF BIOMEDICAL INFORMATICS, 2012, 45 (02) : 255 - 264
  • [10] Classification of genomic signals using dynamic time warping
    Skutkova, Helena
    Vitek, Martin
    Babula, Petr
    Kizek, Rene
    Provaznik, Ivo
    [J]. BMC BIOINFORMATICS, 2013, 14