Compact Acoustic Models for Embedded Speech Recognition

被引:0
|
作者
Christophe Lévy
Georges Linarès
Jean-François Bonastre
机构
关键词
Speech Recognition; Gaussian Component; Acoustic Model; Relative Gain; Subspace Cluster;
D O I
暂无
中图分类号
学科分类号
摘要
Speech recognition applications are known to require a significant amount of resources. However, embedded speech recognition only authorizes few KB of memory, few MIPS, and small amount of training data. In order to fit the resource constraints of embedded applications, an approach based on a semicontinuous HMM system using state-independent acoustic modelling is proposed. A transformation is computed and applied to the global model in order to obtain each HMM state-dependent probability density functions, authorizing to store only the transformation parameters. This approach is evaluated on two tasks: digit and voice-command recognition. A fast adaptation technique of acoustic models is also proposed. In order to significantly reduce computational costs, the adaptation is performed only on the global model (using related speaker recognition adaptation techniques) with no need for state-dependent data. The whole approach results in a relative gain of more than 20% compared to a basic HMM-based system fitting the constraints.
引用
收藏
相关论文
共 50 条
  • [1] Compact Acoustic Models for Embedded Speech Recognition
    Levy, Christophe
    Linares, Georges
    Bonastre, Jean-Francois
    EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2009,
  • [2] A SIMPLIFIED SUBSPACE GAUSSIAN MIXTURE TO COMPACT ACOUSTIC MODELS FOR SPEECH RECOGNITION
    Bouallegue, Mohamed
    Matrouf, Driss
    Linares, Georges
    2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2011, : 4896 - 4899
  • [3] Interpolation of Acoustic Models for Speech Recognition
    Fraga-Silva, Thiago
    Gauvain, Jean-Luc
    Lamel, Lori
    14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 3346 - 3350
  • [4] Compact and robust speech recognition for embedded use on microprocessors
    Hataoka, N
    Kokubo, H
    Obuchi, Y
    Amano, A
    PROCEEDINGS OF THE 2002 IEEE WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING, 2002, : 288 - 291
  • [5] Multilingual acoustic models for speech recognition and synthesis
    Kunzmann, S
    Fischer, V
    Gonzalez, J
    Emam, O
    Günther, C
    Janke, E
    2004 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL III, PROCEEDINGS: IMAGE AND MULTIDIMENSIONAL SIGNAL PROCESSING SPECIAL SESSIONS, 2004, : 745 - 748
  • [6] Dynamically configurable acoustic models for speech recognition
    Hwang, MY
    Huang, XD
    PROCEEDINGS OF THE 1998 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-6, 1998, : 669 - 672
  • [7] Acoustic-to-Phrase Models for Speech Recognition
    Gaur, Yashesh
    Li, Jinyu
    Meng, Zhong
    Gong, Yifan
    INTERSPEECH 2019, 2019, : 2240 - 2244
  • [8] Achieving a reliable compact acoustic model for embedded speech recognition system with high confusion frequency model handling
    Park, Junho
    Ko, Hanseok
    SPEECH COMMUNICATION, 2006, 48 (06) : 737 - 745
  • [9] Acoustic Coprocessor for HMM based Embedded Speech Recognition Systems
    Bapat, Ojas A.
    Fastow, Richard M.
    Olson, Jens
    IEEE TRANSACTIONS ON CONSUMER ELECTRONICS, 2013, 59 (03) : 629 - 633
  • [10] GMM-BASED ACOUSTIC MODELING FOR EMBEDDED SPEECH RECOGNITION
    Levy, Christophe
    Linares, Georges
    Bonastre, Jean-Francois
    INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 1726 - 1729