Compact Acoustic Models for Embedded Speech Recognition

被引:0
|
作者
Christophe Lévy
Georges Linarès
Jean-François Bonastre
机构
关键词
Speech Recognition; Gaussian Component; Acoustic Model; Relative Gain; Subspace Cluster;
D O I
暂无
中图分类号
学科分类号
摘要
Speech recognition applications are known to require a significant amount of resources. However, embedded speech recognition only authorizes few KB of memory, few MIPS, and small amount of training data. In order to fit the resource constraints of embedded applications, an approach based on a semicontinuous HMM system using state-independent acoustic modelling is proposed. A transformation is computed and applied to the global model in order to obtain each HMM state-dependent probability density functions, authorizing to store only the transformation parameters. This approach is evaluated on two tasks: digit and voice-command recognition. A fast adaptation technique of acoustic models is also proposed. In order to significantly reduce computational costs, the adaptation is performed only on the global model (using related speaker recognition adaptation techniques) with no need for state-dependent data. The whole approach results in a relative gain of more than 20% compared to a basic HMM-based system fitting the constraints.
引用
收藏
相关论文
共 50 条
  • [31] Boosting Thai Syllable Speech Recognition Using Acoustic Models Combination
    Tangwongsan, Supachai
    Phoophuangpairoj, Rong
    ICCEE 2008: PROCEEDINGS OF THE 2008 INTERNATIONAL CONFERENCE ON COMPUTER AND ELECTRICAL ENGINEERING, 2008, : 568 - 572
  • [32] Emotional Speech Recognition Using Acoustic Models of Decomposed Component Words
    Kaveeta, Vivatchai
    Patanukhom, Karn
    2013 SECOND IAPR ASIAN CONFERENCE ON PATTERN RECOGNITION (ACPR 2013), 2013, : 115 - 119
  • [33] Development & evaluation of different acoustic models for Malayalam continuous speech recognition
    Kurian, Cini
    Balakrishnan, Kannan
    INTERNATIONAL CONFERENCE ON COMMUNICATION TECHNOLOGY AND SYSTEM DESIGN 2011, 2012, 30 : 1081 - 1088
  • [34] Fast and Accurate Recurrent Neural Network Acoustic Models for Speech Recognition
    Sak, Hasim
    Senior, Andrew
    Rao, Kanishka
    Beaufays, Francoise
    16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 1468 - 1472
  • [35] RETRIEVING SPEAKER INFORMATION FROM PERSONALIZED ACOUSTIC MODELS FOR SPEECH RECOGNITION
    Mdhaffar, Salima
    Bonastre, Jean-Francois
    Tommasi, Marc
    Tomashenko, Natalia
    Esteve, Yannick
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 6767 - 6771
  • [36] The development of acoustic models for command and control arabic speech recognition system
    Nofal, M
    Reheem, EA
    El Henawy, H
    Abdel Kader, N
    ICEEC'04: 2004 INTERNATIONAL CONFERENCE ON ELECTRICAL, ELECTRONIC AND COMPUTER ENGINEERING, PROCEEDINGS, 2004, : 702 - 705
  • [37] Language Independent and Unsupervised Acoustic Models for Speech Recognition and Keyword Spotting
    Knill, Kate M.
    Gales, Mark J. F.
    Ragni, Anton
    Rath, Shakti P.
    15TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2014), VOLS 1-4, 2014, : 16 - 20
  • [38] Acoustic models of the elderly for large-vocabulary continuous speech recognition
    Baba, A
    Yoshizawa, S
    Yamada, M
    Lee, A
    Shikano, K
    ELECTRONICS AND COMMUNICATIONS IN JAPAN PART II-ELECTRONICS, 2004, 87 (07): : 49 - 57
  • [39] Lecture Speech Recognition by Combining Word Graphs of Various Acoustic Models
    Kosaka, Tetsuo
    Goto, Keisuke
    Ito, Takashi
    Kato, Masaharu
    11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 3 AND 4, 2010, : 2978 - 2981
  • [40] Unsupervised training of acoustic models for large vocabulary continuous speech recognition
    Wessel, F
    Ney, H
    ASRU 2001: IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING, CONFERENCE PROCEEDINGS, 2001, : 307 - 310