Dynamic time warping and sparse representation classification for birdsong phrase classification using limited training data

被引:36
|
作者
Tan, Lee N. [1 ]
Alwan, Abeer [1 ]
Kossan, George [2 ]
Cody, Martin L. [2 ]
Taylor, Charles E. [2 ]
机构
[1] Univ Calif Los Angeles, Dept Elect Engn, Los Angeles, CA 90095 USA
[2] Univ Calif Los Angeles, Dept Ecol & Evolutionary Biol, Los Angeles, CA 90095 USA
来源
基金
美国国家科学基金会;
关键词
RECOGNITION; SOUND; REVERBERATIONS; VOCALIZATIONS; RECORDINGS; FOREST;
D O I
10.1121/1.4906168
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Annotation of phrases in birdsongs can be helpful to behavioral and population studies. To reduce the need for manual annotation, an automated birdsong phrase classification algorithm for limited data is developed. Limited data occur because of limited recordings or the existence of rare phrases. In this paper, classification of up to 81 phrase classes of Cassin's Vireo is performed using one to five training samples per class. The algorithm involves dynamic time warping (DTW) and two passes of sparse representation (SR) classification. DTW improves the similarity between training and test phrases from the same class in the presence of individual bird differences and phrase segmentation inconsistencies. The SR classifier works by finding a sparse linear combination of training feature vectors from all classes that best approximates the test feature vector. When the class decisions from DTW and the first pass SR classification are different, SR classification is repeated using training samples from these two conflicting classes. Compared to DTW, support vector machines, and an SR classifier without DTW, the proposed classifier achieves the highest classification accuracies of 94% and 89% on manually segmented and automatically segmented phrases, respectively, from unseen Cassin's Vireo individuals, using five training samples per class. (C) 2015 Acoustical Society of America.
引用
收藏
页码:1069 / 1080
页数:12
相关论文
共 50 条
  • [31] Automatic classification of killer whale vocalizations using dynamic time warping
    Brown, Judith C.
    Miller, Patrick J. O.
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2007, 122 (02): : 1201 - 1207
  • [32] Combining raw and normalized data in multivariate time series classification with dynamic time warping
    Luczak, Maciej
    [J]. JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2018, 34 (01) : 373 - 380
  • [33] Automatic classification of killer whale vocalizations using dynamic time warping
    Brown, Judith C.
    Miller, Patrick J. O.
    [J]. Journal of the Acoustical Society of America, 2007, 122 (02): : 1201 - 1207
  • [34] Automatic assessment of violin performance using dynamic time warping classification
    Giraldo, Sergio
    Ortega, Ariadna
    Perez, Alfonso
    Ramirez, Rafael
    Waddell, George
    Williamon, Aaron
    [J]. 2018 26TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2018,
  • [35] Accelerometer-based gesture recognition using dynamic time warping and sparse representation
    Wang, Haiying
    Li, Zhengshan
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2016, 75 (14) : 8637 - 8655
  • [36] Accelerometer-based gesture recognition using dynamic time warping and sparse representation
    Haiying Wang
    Zhengshan Li
    [J]. Multimedia Tools and Applications, 2016, 75 : 8637 - 8655
  • [37] Kernel sparse representation for time series classification
    Chen, Zhihua
    Zuo, Wangmeng
    Hu, Qinghua
    Lin, Liang
    [J]. INFORMATION SCIENCES, 2015, 292 : 15 - 26
  • [38] Electrocardiogram data mining based on frame classification by dynamic time warping matching
    Zhang, Gong
    Kinsner, Witold
    Huang, Bin
    [J]. COMPUTER METHODS IN BIOMECHANICS AND BIOMEDICAL ENGINEERING, 2009, 12 (06) : 701 - 707
  • [39] Enhanced Weighted Dynamic Time Warping for Time Series Classification
    Anantasech, Pichamon
    Ratanamahatana, Chotirat Ann
    [J]. THIRD INTERNATIONAL CONGRESS ON INFORMATION AND COMMUNICATION TECHNOLOGY, 2019, 797 : 655 - 664
  • [40] A Scalable Segmented Dynamic Time Warping for Time Series Classification
    Ma, Ruizhe
    Ahmadzadeh, Azim
    Boubrahimi, Soukaina Filali
    Angryk, Rafal A.
    [J]. ARTIFICIAL INTELLIGENCE AND SOFT COMPUTING, ICAISC 2019, PT II, 2019, 11509 : 407 - 419