A probabilistic framework for feature-based speech recognition

被引:0
|
作者
Glass, J
Chang, J
McCandless, M
机构
关键词
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Most current speech recognizers use an observation space which is based on a temporal sequence of ''frames'' (e.g., Mel-cepstra). There is another class of recognizer which further processes these frames to produce a segment-based network, and represents each segment by fixed-dimensional ''features,'' In such feature-based recognizers the observation space takes the form of a temporal network of feature vectors, so that a single segmentation of an utterance will use a subset of all possible feature vectors. In this work we examine a maximum a posteriori decoding strategy for feature-based recognizers and develop a normalization criterion useful for a segment-based Viterbi or A* search. We report experimental results far the task of phonetic recognition on the TIMIT corpus where we achieved context-independent and context-dependent (using diphones) results on the core test set of 64.1% and 69.5% respectively.
引用
收藏
页码:2277 / 2280
页数:4
相关论文
共 50 条
  • [1] A feature-based hierarchical speech recognition system for Hindi
    K Samudravijaya
    R Ahuja
    N Bondale
    T Jose
    S Krishnan
    P Poddar
    xxPVS Rao
    R Raveendran
    [J]. Sadhana, 1998, 23 : 313 - 340
  • [2] A Multichannel Feature-Based Processing for Robust Speech Recognition
    Souden, Mehrez
    Kinoshita, Keisuke
    Delcroix, Marc
    Nakatani, Tomohiro
    [J]. 12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 696 - 699
  • [3] A dynamic, feature-based approach to speech modeling and recognition
    Deng, L
    [J]. 1997 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING, PROCEEDINGS, 1997, : 107 - 114
  • [4] Landmark detection for distinctive feature-based speech recognition
    Liu, SA
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1996, 100 (05): : 3417 - 3430
  • [5] A feature-based hierarchical speech recognition system for Hindi
    Samudravijaya, K
    Ahuja, R
    Bondale, N
    Jose, T
    Krishnan, S
    Poddar, P
    Rao, PVS
    Raveendran, R
    [J]. SADHANA-ACADEMY PROCEEDINGS IN ENGINEERING SCIENCES, 1998, 23 (4): : 313 - 340
  • [6] A Local Feature-based Human Motion Recognition Framework
    Lai, Yu-Chun
    Liao, Hong-Yuan Mark
    Lin, Cheng-Chung
    Chen, Jian-Ren
    Luo, Y. -F. Peter
    [J]. ISCAS: 2009 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS, VOLS 1-5, 2009, : 722 - +
  • [7] Multistream Articulatory Feature-Based Models for Visual Speech Recognition
    Saenko, Kate
    Livescu, Karen
    Glass, James
    Darrell, Trevor
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2009, 31 (09) : 1700 - 1707
  • [8] Feature-based Noise Robust Speech Recognition on an Indonesian Language Automatic Speech Recognition System
    Satriawan, Cil Hardianto
    Lestari, Dessi Puji
    [J]. 2014 International Conference on Electrical Engineering and Computer Science (ICEECS), 2014, : 42 - 46
  • [9] A probabilistic framework for segment-based speech recognition
    Glass, JR
    [J]. COMPUTER SPEECH AND LANGUAGE, 2003, 17 (2-3): : 137 - 152
  • [10] Articulatory feature-based Gender Factor minimization in Automatic Speech Recognition
    Rahman, B. K. M. Mizanur
    Ahamed, Bulbul
    Islam, Rabiul
    Huda, Mohammad Nurul
    [J]. 2013 INTERNATIONAL CONFERENCE ON INFORMATICS, ELECTRONICS & VISION (ICIEV), 2013,