A feature-based hierarchical speech recognition system for Hindi

被引:0
|
作者
K Samudravijaya
R Ahuja
N Bondale
T Jose
S Krishnan
P Poddar
xxPVS Rao
R Raveendran
机构
[1] Tata Institute of Fundamental Research,Computer Systems and Communications Group
来源
Sadhana | 1998年 / 23卷
关键词
Speech recognition; hierarchical approach; Hindi; knowledge integration; natural language processing;
D O I
暂无
中图分类号
学科分类号
摘要
This paper presents a description of a speech recognition system forHindi. The system follows a hierarchic approach to speech recognition and integrates multiple knowledge sources within statistical pattern recognition paradigms at various stages of signal decoding. Rather than make hard decisions at the level of each processing unit, relative confidence scores of individual units are propagated to higher levels. Phoneme recognition is achieved in two stages: broad acoustic classification of a frame is followed by fine acoustic classification. A semi-Markov model processes the frame level outputs of a broad acoustic maximum likelihood classifier to yield a sequence of segments with broad acoustic labels. The phonemic identities of selected classes of segments are decoded by class-dependent neural nets which are trained with class-specific feature vectors as input. Lexical access is achieved by string matching using a dynamic programming technique. A novel language processor disambiguates between multiple choices given by the acoustic recognizer to recognize the spoken sentence.
引用
收藏
页码:313 / 340
页数:27
相关论文
共 50 条
  • [1] A feature-based hierarchical speech recognition system for Hindi
    Samudravijaya, K
    Ahuja, R
    Bondale, N
    Jose, T
    Krishnan, S
    Poddar, P
    Rao, PVS
    Raveendran, R
    [J]. SADHANA-ACADEMY PROCEEDINGS IN ENGINEERING SCIENCES, 1998, 23 (4): : 313 - 340
  • [2] Feature-based Noise Robust Speech Recognition on an Indonesian Language Automatic Speech Recognition System
    Satriawan, Cil Hardianto
    Lestari, Dessi Puji
    [J]. 2014 International Conference on Electrical Engineering and Computer Science (ICEECS), 2014, : 42 - 46
  • [3] A probabilistic framework for feature-based speech recognition
    Glass, J
    Chang, J
    McCandless, M
    [J]. ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4, 1996, : 2277 - 2280
  • [4] CLSTM: Deep Feature-Based Speech Emotion Recognition Using the Hierarchical ConvLSTM Network
    Mustageem
    Kwon, Soonil
    [J]. MATHEMATICS, 2020, 8 (12) : 1 - 19
  • [5] A FEATURE-BASED SEMIVOWEL RECOGNITION SYSTEM
    ESPYWILSON, CY
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1994, 96 (01): : 65 - 72
  • [6] A feature-based face recognition system
    Campadelli, P
    Lanzarotti, R
    Savazzi, C
    [J]. 12TH INTERNATIONAL CONFERENCE ON IMAGE ANALYSIS AND PROCESSING, PROCEEDINGS, 2003, : 68 - 73
  • [7] Landmark detection for distinctive feature-based speech recognition
    Liu, SA
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1996, 100 (05): : 3417 - 3430
  • [8] A Multichannel Feature-Based Processing for Robust Speech Recognition
    Souden, Mehrez
    Kinoshita, Keisuke
    Delcroix, Marc
    Nakatani, Tomohiro
    [J]. 12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 696 - 699
  • [9] A dynamic, feature-based approach to speech modeling and recognition
    Deng, L
    [J]. 1997 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING, PROCEEDINGS, 1997, : 107 - 114
  • [10] Prosodic Feature-Based Discriminatively Trained Low Resource Speech Recognition System
    Hasija, Taniya
    Kadyan, Virender
    Guleria, Kalpna
    Alharbi, Abdullah
    Alyami, Hashem
    Goyal, Nitin
    [J]. SUSTAINABILITY, 2022, 14 (02)