EMBEDDING TIME WARPING IN EXEMPLAR-BASED SPARSE REPRESENTATIONS OF SPEECH

被引:0
|
作者
Yilmaz, Emre [1 ]
Gemmeke, Jort F. [1 ]
Van Hamme, Hugo [1 ]
机构
[1] Katholieke Univ Leuven, Dept ESAT, Louvain, Belgium
关键词
Exemplar-based speech recognition; sparse representations; time warping;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This paper describes a new sparse representation model for speech that allows time warping as an extension to a recently proposed sparse representations-based speech recognition system. This recognition system uses exemplars to model the acoustics which are labeled speech occurrences of different length extracted from the training data. Exemplars are organized in multiple dictionaries on the basis of their class and length. Input speech segments are approximated as a sparse linear combination of the exemplars using these dictionaries and a reconstruction error-based decoding is adopted in order to find the best matching class sequence. With the current sparse representation model using a dictionary and a weight vector to approximate an input speech segment, it is not possible to compare input speech segments with exemplars of different lengths. The goal of this work is to introduce a novel sparse representation model which allows time warping using a third matrix which linearly combines consecutive frames in order to shrink or expand the approximation. Preliminary results have shown the feasibility of the proposed sparse representation model.
引用
收藏
页码:8076 / 8080
页数:5
相关论文
共 50 条
  • [21] Enhancing Exemplar-Based Posteriors for Speech Recognition Tasks
    Sainath, Tara N.
    Nahamoo, David
    Kanevsky, Dimitri
    Ramabhadran, Bhuvana
    13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3, 2012, : 2127 - 2130
  • [22] Coupled Dictionaries for Exemplar-Based Speech Enhancement and Automatic Speech Recognition
    Baby, Deepak
    Virtanen, Tuomas
    Gemmeke, Jort F.
    van Hamme, Hugo
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2015, 23 (11) : 1788 - 1799
  • [23] Noise Robust Exemplar Matching Using Sparse Representations of Speech
    Yilmaz, Emre
    Gemmeke, Jort Florent
    Van Hamme, Hugo
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2014, 22 (08) : 1306 - 1319
  • [24] Exemplar-Based Sparse Representation Features: From TIMIT to LVCSR
    Sainath, Tara N.
    Ramabhadran, Bhuvana
    Picheny, Michael
    Nahamoo, David
    Kanevsky, Dimitri
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2011, 19 (08): : 2598 - 2613
  • [25] Multipitch Estimation of Piano Music by Exemplar-Based Sparse Representation
    Lee, Cheng-Te
    Yang, Yi-Hsuan
    Chen, Homer H.
    IEEE TRANSACTIONS ON MULTIMEDIA, 2012, 14 (03) : 608 - 618
  • [26] MULTIPITCH ESTIMATION AND INSTRUMENT RECOGNITION BY EXEMPLAR-BASED SPARSE REPRESENTATION
    Degawa, Ikuo
    Sato, Kei
    Ikehara, Masaaki
    2013 ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS AND COMPUTERS, 2013, : 560 - 564
  • [27] EXEMPLAR-BASED SPARSE REPRESENTATION OF TIMBRE AND PROSODY FOR VOICE CONVERSION
    Ming, Huaiping
    Huang, Dongyan
    Xie, Lei
    Zhang, Shaofei
    Dong, Minghui
    Li, Haizhou
    2016 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING PROCEEDINGS, 2016, : 5175 - 5179
  • [28] Exemplar-Based Sparse Representation With Residual Compensation for Voice Conversion
    Wu, Zhizheng
    Virtanen, Tuomas
    Chng, Eng Siong
    Li, Haizhou
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2014, 22 (10) : 1506 - 1521
  • [29] Exemplar-based phonology: It's about time
    Kirchner, R
    WCCFL 23: PROCEEDINGS OF THE 23RD WEST COAST CONFERENCE ON FORMAL LINGUISTICS, 2004, : 464 - 474
  • [30] COMBINING EXEMPLAR-BASED CATEGORY REPRESENTATIONS AND CONNECTIONIST LEARNING RULES
    NOSOFSKY, RM
    KRUSCHKE, JK
    MCKINLEY, SC
    JOURNAL OF EXPERIMENTAL PSYCHOLOGY-LEARNING MEMORY AND COGNITION, 1992, 18 (02) : 211 - 233