POSTERIOR FEATURES APPLIED TO SPEECH RECOGNITION TASKS WITH USER-DEFINED VOCABULARY

被引:10
|
作者
Aradilla, Guillermo [1 ]
Bourlard, Herve [1 ]
Magimai-Doss, Mathew [1 ]
机构
[1] Idiap Res Inst, Martigny, Switzerland
关键词
Speech recognition; template matching; posterior features; Kullback-Leibler divergence;
D O I
10.1109/ICASSP.2009.4960457
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This paper presents a novel approach for those applications where vocabulary is defined by a set of acoustic samples. In this approach, the acoustic samples are used as reference templates in a template matching framework. The features used to describe the reference templates and the test utterances are estimates of phoneme posterior probabilities. These posteriors are obtained from a MLP trained on an auxiliary database. Thus, the speech variability present in the features is reduced by applying the speech knowledge captured by the MLP on the auxiliary database. Moreover, information theoretic dissimilarity measures can be used as local distances between features. When compared to state-of-the-art systems, this approach outperforms acoustic-based techniques and obtains comparable results to orthography-based methods. The proposed method can also be directly combined with other posterior-based HMM systems. This combination successfully exploits the complementarity between templates and parametric models.
引用
收藏
页码:3809 / 3812
页数:4
相关论文
共 50 条
  • [1] On user-defined features
    Hoffmann, CM
    Joan-Arinyo, R
    [J]. COMPUTER-AIDED DESIGN, 1998, 30 (05) : 321 - 332
  • [2] Recognition of user-defined turning features for mill/turn parts
    Li, Shiqiao
    Shah, Jami J.
    [J]. JOURNAL OF COMPUTING AND INFORMATION SCIENCE IN ENGINEERING, 2007, 7 (03) : 225 - 235
  • [3] Adequacy of a User-Defined Vocabulary to the Data Structure
    Lesot, Marie-Jeanne
    Smits, Gregory
    Pivert, Olivier
    [J]. 2013 IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS (FUZZ - IEEE 2013), 2013,
  • [4] Parametric modeling with user-defined features
    Tang, M
    Wen, Y
    Mi, XF
    Dong, JX
    [J]. PROCEEDINGS OF THE SIXTH INTERNATIONAL CONFERENCE ON COMPUTER SUPPORTED COOPERATIVE WORK IN DESIGN, 2001, : 207 - 211
  • [5] Instantiation and Manipulation of User-Defined Freeform Features
    Langerak, Thomas R.
    [J]. DETC 2008: PROCEEDINGS OF THE ASME INTERNATIONAL DESIGN ENGINEERING TECHNICAL CONFERENCES AND COMPUTERS AND INFORMATIONAL IN ENGINEERING CONFERENCE, VOL 3, PTS A AND B: 28TH COMPUTERS AND INFORMATION IN ENGINEERING CONFERENCE, 2009, : 171 - 178
  • [6] Task-Selection bias: A case for user-defined tasks
    [J]. Cordes, R.E. (cordes@us.ibm.com), 1600, Bellwether Publishing, Ltd. (13):
  • [7] Task-selection bias: A case for user-defined tasks
    Cordes, RE
    [J]. INTERNATIONAL JOURNAL OF HUMAN-COMPUTER INTERACTION, 2001, 13 (04) : 411 - 419
  • [8] Use of Micro-Modulation Features in Large Vocabulary Continuous Speech Recognition Tasks
    Dimitriadis, Dimitrios
    Bocchieri, Enrico
    [J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2015, 23 (08) : 1348 - 1357
  • [9] Decomposing user-defined tasks in a reinforcement learning setup using TextWorld
    Petsanis, Thanos
    Keroglou, Christoforos
    Kapoutsis, Athanasios Ch.
    Kosmatopoulos, Elias B.
    Sirakoulis, Georgios Ch.
    [J]. FRONTIERS IN ROBOTICS AND AI, 2023, 10
  • [10] Designing a User-defined Gesture Vocabulary for an In-vehicle Climate Control System
    Fariman, Hessam Jahani
    Alyamani, Hasan J.
    Kavakli, Manolya
    Hamey, Len
    [J]. PROCEEDINGS OF THE 28TH AUSTRALIAN COMPUTER-HUMAN INTERACTION CONFERENCE (OZCHI 2016), 2016,