Speech recognition based on unified model of acoustic and language aspects of speech

被引:0
|
作者
机构
[1] Kubo, Yotaro
[2] Ogawa, Atsunori
[3] Hori, Takaaki
[4] Nakamura, Atsushi
来源
| 1600年 / Nippon Telegraph and Telephone Corp.卷 / 11期
关键词
Deep learning;
D O I
暂无
中图分类号
学科分类号
摘要
Automatic speech recognition has been attracting a lot of attention recently and is considered an important technique to achieve natural interaction between humans and machines. However, recognizing spontaneous speech is still considered to be difficult owing to the wide variety of patterns in spontaneous speech. We have been researching ways to overcome this problem and have developed a method to express both the acoustic and linguistic aspects of speech recognizers in a unified representation by integrating powerful frameworks of deep learning and a weighted finite-state transducer. We evaluated the proposed method ill an experiment to recognize a lecture speech dataset, which is coilsidered as a spontaneous speech dataset, and confirmed that the proposed method is promising for recognizing spontaneous speech.
引用
收藏
相关论文
共 50 条
  • [41] An Empirical Study of Language Model Integration for Transducer based Speech Recognition
    Zheng, Huahuan
    An, Keyu
    Ou, Zhijian
    Huang, Chen
    Ding, Ke
    Wan, Guanglu
    INTERSPEECH 2022, 2022, : 3904 - 3908
  • [42] Research on Syllable-Based Language Model in Malay Speech Recognition
    Wei, Xiangfeng
    Zhang, Quan
    Yuan, Yi
    2022 INTERNATIONAL CONFERENCE ON ASIAN LANGUAGE PROCESSING (IALP 2022), 2022, : 150 - 155
  • [43] A speech recognition model based on tri-phones for the arabic language
    Al-Diri, B.
    Sharieh, A.
    Qutiashat, M.
    Advances in Modelling and Analysis B, 2007, 50 (1-2): : 49 - 63
  • [44] Attention-based Contextual Language Model Adaptation for Speech Recognition
    Martinez, Richard Diehl
    Novotney, Scott
    Bulyko, Ivan
    Rastrow, Ariya
    Stolcke, Andreas
    Gandhe, Ankur
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL-IJCNLP 2021, 2021, : 1994 - 2003
  • [45] Language model switching based on topic detection for dialog speech recognition
    Lane, IR
    Kawahara, T
    Matsui, T
    2003 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PROCEEDINGS: SPEECH PROCESSING I, 2003, : 616 - 619
  • [46] Document filtering based on spectral clustering for speech recognition language model
    Takahashi, Shinya
    Morimoto, Tsuyoshi
    Tsuruta, Naoyuki
    IMECS 2007: INTERNATIONAL MULTICONFERENCE OF ENGINEERS AND COMPUTER SCIENTISTS, VOLS I AND II, 2007, : 393 - +
  • [47] A study of speech recognition based on RNN-RBM language model
    Li, Yaxiong, 1936, Science Press (51):
  • [48] Succeeding word prediction for speech recognition based on stochastic language model
    Zhou, M
    Nakagawa, S
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 1996, E79D (04) : 333 - 342
  • [49] Language-independent and language-adaptive acoustic modeling for speech recognition
    Schultz, T
    Waibel, A
    SPEECH COMMUNICATION, 2001, 35 (1-2) : 31 - 51
  • [50] ACOUSTIC ASPECTS OF HUMAN SPEECH
    Ustinov, Yuri
    Volkov, Nikolay
    Degtev, Dmitry
    Nikitin, Sergey
    Tyunin, Vitaly
    Stoianova, Natalya
    7TH INTERNATIONAL CONFERENCE ON EDUCATION AND SOCIAL SCIENCES (INTCESS 2020), 2020, : 81 - 84