Exemplar-Based Processing for Speech Recognition

被引:35
|
作者
Sainath, Tara N. [1 ]
Ramabhadran, Bhuvana [2 ,3 ]
Nahamoo, David
Kanevsky, Dimitri [4 ,5 ]
Van Compernolle, Dirk [6 ,7 ]
Demuynck, Kris [8 ]
Gemmeke, Jort Florent
Bellegarda, Jerome R.
Sundaram, Shiva [9 ]
机构
[1] IBM TJ Watson Ctr, Speech & Language Algorithms Grp, Yorktown Hts, NY USA
[2] IBM TJ Watson Ctr, Speech Transcript & Synth Res Grp, Yorktown Hts, NY USA
[3] Columbia Univ, Dept Elect Engn, New York, NY 10027 USA
[4] IBM TJ Watson Ctr, Dept Speech & Language Algorithms, Yorktown Hts, NY USA
[5] Inst Adv Studies, Princeton, NJ USA
[6] Katholieke Univ Leuven, Dept Elect Engn, Louvain, Belgium
[7] INTERSPEECH, Antwerp, Belgium
[8] Katholieke Univ Leuven, Dept Elect Engn ESAT, Louvain, Belgium
[9] Tech Univ Berlin, Berlin, Germany
关键词
SPARSE IMPUTATION; FACE RECOGNITION; CLASSIFICATION; RETRIEVAL; ENTROPY;
D O I
10.1109/MSP.2012.2208663
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Solving real-world classification and recognition problems requires a principled way of modeling the physical phenomena generating the observed data and the uncertainty in it. The uncertainty originates from the fact that many data generation aspects are influenced by nondirectly measurable variables or are too complex to model and hence are treated as random fluctuations. For example, in speech production, uncertainty could arise from vocal tract variations among different people or corruption by noise. The goal of modeling is to establish a generalization from the set of observed data such that accurate inference (classification, decision, recognition) can be made about the data yet to be observed, which we refer to as unseen data. © 2012 IEEE.
引用
下载
收藏
页码:98 / 113
页数:16
相关论文
共 50 条
  • [21] Action recognition using exemplar-based embedding
    Weinland, Daniel
    Boyer, Edmond
    2008 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOLS 1-12, 2008, : 3033 - 3039
  • [22] EXEMPLAR-BASED NOISE ROBUST AUTOMATIC SPEECH RECOGNITION USING MODULATION SPECTROGRAM FEATURES
    Baby, Deepak
    Virtanen, Tuomas
    Gemmeke, Jort F.
    Barker, Tom
    Van Hamme, Hugo
    2014 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY SLT 2014, 2014, : 519 - 524
  • [23] COUPLED DICTIONARY TRAINING FOR EXEMPLAR-BASED SPEECH ENHANCEMENT
    Baby, Deepak
    Virtanen, Tuomas
    Barker, Tom
    Van Hamme, Hugo
    2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,
  • [24] INTEGRATING META-INFORMATION INTO EXEMPLAR-BASED SPEECH RECOGNITION WITH SEGMENTAL CONDITIONAL RANDOM FIELDS
    Demuynck, Kris
    Seppi, Dino
    Van Compernolle, Dirk
    Patrick Nguyen
    Zweig, Geoffrey
    2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2011, : 5048 - 5051
  • [25] Reducing Computational Complexities of Exemplar-Based Sparse Representations With Applications to Large Vocabulary Speech Recognition
    Sainath, Tara N.
    Ramabhadran, Bhuvana
    Nahamoo, David
    Kanevsky, Dimitri
    12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 792 - 795
  • [26] HYBRID INPUT SPACES FOR EXEMPLAR-BASED NOISE ROBUST SPEECH RECOGNITION USING COUPLED DICTIONARIES
    Baby, Deepak
    Van Hamme, Hugo
    2015 23RD EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2015, : 1676 - 1680
  • [27] Exemplar-Based Recognition of Human-Object Interactions
    Hu, Jian-Fang
    Zheng, Wei-Shi
    Lai, Jianhuang
    Gong, Shaogang
    Xiang, Tao
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2016, 26 (04) : 647 - 660
  • [28] NOISE ROBUST EXEMPLAR-BASED CONNECTED DIGIT RECOGNITION
    Gemmeke, Jort F.
    Virtanen, Tuomas
    2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2010, : 4546 - 4549
  • [29] INVESTIGATIONS ON EXEMPLAR-BASED FEATURES FOR SPEECH RECOGNITION TOWARDS THOUSANDS OF HOURS OF UNSUPERVISED, NOISY DATA
    Heigold, Georg
    Nguyen, Patrick
    Weintraub, Mitchel
    Vanhoucke, Vincent
    2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2012, : 4437 - 4440
  • [30] SEMI-SUPERVISED NOISE DICTIONARY ADAPTATION FOR EXEMPLAR-BASED NOISE ROBUST SPEECH RECOGNITION
    Luan, Yi
    Saito, Daisuke
    Kashiwagi, Yosuke
    Minematsu, Nobuaki
    Hirose, Keikichi
    2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,