Exemplar-Based Processing for Speech Recognition

被引:35
|
作者
Sainath, Tara N. [1 ]
Ramabhadran, Bhuvana [2 ,3 ]
Nahamoo, David
Kanevsky, Dimitri [4 ,5 ]
Van Compernolle, Dirk [6 ,7 ]
Demuynck, Kris [8 ]
Gemmeke, Jort Florent
Bellegarda, Jerome R.
Sundaram, Shiva [9 ]
机构
[1] IBM TJ Watson Ctr, Speech & Language Algorithms Grp, Yorktown Hts, NY USA
[2] IBM TJ Watson Ctr, Speech Transcript & Synth Res Grp, Yorktown Hts, NY USA
[3] Columbia Univ, Dept Elect Engn, New York, NY 10027 USA
[4] IBM TJ Watson Ctr, Dept Speech & Language Algorithms, Yorktown Hts, NY USA
[5] Inst Adv Studies, Princeton, NJ USA
[6] Katholieke Univ Leuven, Dept Elect Engn, Louvain, Belgium
[7] INTERSPEECH, Antwerp, Belgium
[8] Katholieke Univ Leuven, Dept Elect Engn ESAT, Louvain, Belgium
[9] Tech Univ Berlin, Berlin, Germany
关键词
SPARSE IMPUTATION; FACE RECOGNITION; CLASSIFICATION; RETRIEVAL; ENTROPY;
D O I
10.1109/MSP.2012.2208663
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Solving real-world classification and recognition problems requires a principled way of modeling the physical phenomena generating the observed data and the uncertainty in it. The uncertainty originates from the fact that many data generation aspects are influenced by nondirectly measurable variables or are too complex to model and hence are treated as random fluctuations. For example, in speech production, uncertainty could arise from vocal tract variations among different people or corruption by noise. The goal of modeling is to establish a generalization from the set of observed data such that accurate inference (classification, decision, recognition) can be made about the data yet to be observed, which we refer to as unseen data. © 2012 IEEE.
引用
收藏
页码:98 / 113
页数:16
相关论文
共 50 条
  • [31] ON EXEMPLAR-BASED EXEMPLAR REPRESENTATIONS - REPLY
    NOSOFSKY, RM
    JOURNAL OF EXPERIMENTAL PSYCHOLOGY-GENERAL, 1988, 117 (04) : 412 - 414
  • [32] EMBEDDING TIME WARPING IN EXEMPLAR-BASED SPARSE REPRESENTATIONS OF SPEECH
    Yilmaz, Emre
    Gemmeke, Jort F.
    Van Hamme, Hugo
    2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 8076 - 8080
  • [33] SPEECH SEGMENT CLUSTERING FOR REAL-TIME EXEMPLAR-BASED SPEECH ENHANCEMENT
    Nesbitt, David
    Crookes, Danny
    Ming, Ji
    2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 5419 - 5423
  • [34] An Analysis of Sparseness and Regularization in Exemplar-Based Methods for Speech Classification
    Kanevsky, Dimitri
    Sainath, Tara N.
    Ramabhadran, Bhuvana
    Nahamoo, David
    11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 3 AND 4, 2010, : 2846 - 2849
  • [35] Temporal Exemplar-based Bayesian Networks for Facial Expression Recognition
    Shang, Lifeng
    Chan, Kwok-Ping
    SEVENTH INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS, PROCEEDINGS, 2008, : 16 - 22
  • [37] MULTIPITCH ESTIMATION AND INSTRUMENT RECOGNITION BY EXEMPLAR-BASED SPARSE REPRESENTATION
    Degawa, Ikuo
    Sato, Kei
    Ikehara, Masaaki
    2013 ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS AND COMPUTERS, 2013, : 560 - 564
  • [38] Deep Exemplar-based Colorization
    He, Mingming
    Chen, Dongdong
    Liao, Jing
    Sander, Pedro, V
    Yuan, Lu
    ACM TRANSACTIONS ON GRAPHICS, 2018, 37 (04):
  • [39] NOISE-ROBUST SPEECH RECOGNITION WITH EXEMPLAR-BASED SPARSE REPRESENTATIONS USING ALPHA-BETA DIVERGENCE
    Yilmaz, Emre
    Gemmeke, Jort F.
    Van Hamme, Hugo
    2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,
  • [40] Towards exemplar-based polysemy
    Rais-Ghasem, M
    Corriveau, JP
    PROCEEDINGS OF THE TWENTY FIRST ANNUAL CONFERENCE OF THE COGNITIVE SCIENCE SOCIETY, 1999, : 566 - 571