Speech recognition based on unified model of acoustic and language aspects of speech

被引:0
|
作者
机构
[1] Kubo, Yotaro
[2] Ogawa, Atsunori
[3] Hori, Takaaki
[4] Nakamura, Atsushi
来源
| 1600年 / Nippon Telegraph and Telephone Corp.卷 / 11期
关键词
Deep learning;
D O I
暂无
中图分类号
学科分类号
摘要
Automatic speech recognition has been attracting a lot of attention recently and is considered an important technique to achieve natural interaction between humans and machines. However, recognizing spontaneous speech is still considered to be difficult owing to the wide variety of patterns in spontaneous speech. We have been researching ways to overcome this problem and have developed a method to express both the acoustic and linguistic aspects of speech recognizers in a unified representation by integrating powerful frameworks of deep learning and a weighted finite-state transducer. We evaluated the proposed method ill an experiment to recognize a lecture speech dataset, which is coilsidered as a spontaneous speech dataset, and confirmed that the proposed method is promising for recognizing spontaneous speech.
引用
收藏
相关论文
共 50 条
  • [21] Development of Hausa Acoustic Model for Speech Recognition
    Ibrahim, Umar Adam
    Boukar, Moussa Mahamat
    Suleiman, Muhammad Aliyu
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2022, 13 (05) : 503 - 508
  • [22] Integration of Metamodel and Acoustic Model for Speech Recognition
    Matsumasa, Hironori
    Takiguchi, Tetsuya
    Ariki, Yasuo
    Li, Ichao
    Nakabayashi, Toshitaka
    INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 2234 - +
  • [23] SPEECH AND LANGUAGE ASPECTS
    BRADLEY, DP
    CLEFT PALATE JOURNAL, 1977, 14 (04): : 321 - 328
  • [24] LATENT DIRICHLIET LANGUAGE MODEL FOR SPEECH RECOGNITION
    Chien, Jen-Tzung
    Chueh, Chuang-Hua
    2008 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY: SLT 2008, PROCEEDINGS, 2008, : 201 - 204
  • [25] Topic tracking language model for speech recognition
    Watanabe, Shinji
    Iwata, Tomoharu
    Hori, Takaaki
    Sako, Atsushi
    Ariki, Yasuo
    COMPUTER SPEECH AND LANGUAGE, 2011, 25 (02): : 440 - 461
  • [26] Language Model Score Regularization for Speech Recognition
    ZHANG Yike
    ZHANG Pengyuan
    YAN Yonghong
    Chinese Journal of Electronics, 2019, 28 (03) : 604 - 609
  • [27] TOPIC CACHE LANGUAGE MODEL FOR SPEECH RECOGNITION
    Chueh, Chuang-Hua
    Chien, Jen-Tzung
    2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2010, : 5194 - 5197
  • [28] Language Model Score Regularization for Speech Recognition
    Zhang Yike
    Zhang Pengyuan
    Yan Yonghong
    CHINESE JOURNAL OF ELECTRONICS, 2019, 28 (03) : 604 - 609
  • [29] Topic cache language model for speech recognition
    Department of Computer Science and Information Engineering, National Cheng Kung University, Tainan, 70101, Taiwan
    ICASSP IEEE Int Conf Acoust Speech Signal Process Proc, 2010, (5194-5197):
  • [30] A language model for Amdo Tibetan speech recognition
    Suan, Taiben
    Cai, Rangzhuoma
    Cai, Zhijie
    Zu, Ba
    Gong, Baojia
    2020 2ND INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE COMMUNICATION AND NETWORK SECURITY (CSCNS2020), 2021, 336