EMBEDDING TIME WARPING IN EXEMPLAR-BASED SPARSE REPRESENTATIONS OF SPEECH

被引:0
|
作者
Yilmaz, Emre [1 ]
Gemmeke, Jort F. [1 ]
Van Hamme, Hugo [1 ]
机构
[1] Katholieke Univ Leuven, Dept ESAT, Louvain, Belgium
关键词
Exemplar-based speech recognition; sparse representations; time warping;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This paper describes a new sparse representation model for speech that allows time warping as an extension to a recently proposed sparse representations-based speech recognition system. This recognition system uses exemplars to model the acoustics which are labeled speech occurrences of different length extracted from the training data. Exemplars are organized in multiple dictionaries on the basis of their class and length. Input speech segments are approximated as a sparse linear combination of the exemplars using these dictionaries and a reconstruction error-based decoding is adopted in order to find the best matching class sequence. With the current sparse representation model using a dictionary and a weight vector to approximate an input speech segment, it is not possible to compare input speech segments with exemplars of different lengths. The goal of this work is to introduce a novel sparse representation model which allows time warping using a third matrix which linearly combines consecutive frames in order to shrink or expand the approximation. Preliminary results have shown the feasibility of the proposed sparse representation model.
引用
收藏
页码:8076 / 8080
页数:5
相关论文
共 50 条
  • [41] Towards exemplar-based polysemy
    Rais-Ghasem, M
    Corriveau, JP
    PROCEEDINGS OF THE TWENTY FIRST ANNUAL CONFERENCE OF THE COGNITIVE SCIENCE SOCIETY, 1999, : 566 - 571
  • [42] MANY-TO-ONE VOICE CONVERSION USING EXEMPLAR-BASED SPARSE REPRESENTATION
    Aihara, Ryo
    Takiguchi, Tetsuya
    Ariki, Yasuo
    2015 IEEE WORKSHOP ON APPLICATIONS OF SIGNAL PROCESSING TO AUDIO AND ACOUSTICS (WASPAA), 2015,
  • [43] Exemplar-Based Colour Constancy
    Joze, Hamid Reza Vaezi
    Drew, Mark S.
    PROCEEDINGS OF THE BRITISH MACHINE VISION CONFERENCE 2012, 2012,
  • [44] Improvement of the exemplar-based inpainting
    Huang, Weijie
    Zhang, Guoshan
    JOURNAL OF ELECTRONIC IMAGING, 2016, 25 (05)
  • [45] Exemplar-Based Face Parsing
    Smith, Brandon M.
    Zhang, Li
    Brandt, Jonathan
    Lin, Zhe
    Yang, Jianchao
    2013 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2013, : 3484 - 3491
  • [46] Real-Time Exemplar-Based Face Sketch Synthesis
    Song, Yibing
    Bao, Linchao
    Yang, Qingxiong
    Yang, Ming-Hsuan
    COMPUTER VISION - ECCV 2014, PT VI, 2014, 8694 : 800 - 813
  • [47] Integrated exemplar-based template matching and statistical modeling for continuous speech recognition
    Xie Sun
    Yunxin Zhao
    EURASIP Journal on Audio, Speech, and Music Processing, 2014
  • [48] Integrated exemplar-based template matching and statistical modeling for continuous speech recognition
    Sun, Xie
    Zhao, Yunxin
    EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2014,
  • [49] Noise robust Automatic Speech Recognition system by integrating Robust Principal Component Analysis (RPCA) and Exemplar-based Sparse Representation
    Gavrilescu, Mihai
    PROCEEDINGS OF THE 2015 7TH INTERNATIONAL CONFERENCE ON ELECTRONICS, COMPUTERS AND ARTIFICIAL INTELLIGENCE (ECAI), 2015, : S29 - S33
  • [50] Multi-level Exemplar-Based Duration Generation for Expressive Speech Synthesis
    Abou-Zleikha, Mohamed
    Szekely, Eva
    Cahill, Peter
    Carson-Berndsen, Julie
    PROCEEDINGS OF THE 6TH INTERNATIONAL CONFERENCE ON SPEECH PROSODY, VOLS I AND II, 2012, : 59 - 62