Speech recognition based on unified model of acoustic and language aspects of speech

被引:0
|
作者
机构
[1] Kubo, Yotaro
[2] Ogawa, Atsunori
[3] Hori, Takaaki
[4] Nakamura, Atsushi
来源
| 1600年 / Nippon Telegraph and Telephone Corp.卷 / 11期
关键词
Deep learning;
D O I
暂无
中图分类号
学科分类号
摘要
Automatic speech recognition has been attracting a lot of attention recently and is considered an important technique to achieve natural interaction between humans and machines. However, recognizing spontaneous speech is still considered to be difficult owing to the wide variety of patterns in spontaneous speech. We have been researching ways to overcome this problem and have developed a method to express both the acoustic and linguistic aspects of speech recognizers in a unified representation by integrating powerful frameworks of deep learning and a weighted finite-state transducer. We evaluated the proposed method ill an experiment to recognize a lecture speech dataset, which is coilsidered as a spontaneous speech dataset, and confirmed that the proposed method is promising for recognizing spontaneous speech.
引用
收藏
相关论文
共 50 条
  • [31] LANGUAGE MODEL VERBALIZATION FOR AUTOMATIC SPEECH RECOGNITION
    Sak, Hasim
    Beaufays, Francoise
    Nakajima, Kaisuke
    Allauzen, Cyril
    2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 8262 - 8266
  • [32] Acoustic Model Adaptation for Emotional Speech Recognition Using Twitter-Based Emotional Speech Corpus
    Kosaka, Tetsuo
    Aizawa, Yoshitaka
    Kato, Masaharu
    Nose, Takashi
    2018 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2018, : 1747 - 1751
  • [33] An Acoustic Recognition Model for English Speech Based on Improved HMM Algorithm
    Ting, Wu
    2019 11TH INTERNATIONAL CONFERENCE ON MEASURING TECHNOLOGY AND MECHATRONICS AUTOMATION (ICMTMA 2019), 2019, : 729 - 732
  • [34] Comparison on Neural Network Based Acoustic Model in Mongolian Speech Recognition
    Zhang, Hongwei
    Bao, Feilong
    Gao, Guanglai
    Zhang, Hui
    PROCEEDINGS OF THE 2016 INTERNATIONAL CONFERENCE ON ASIAN LANGUAGE PROCESSING (IALP), 2016, : 1 - 5
  • [35] SPEECH RECOGNITION - ACOUSTIC, PHONETIC AND FORMAL-LANGUAGE MODELS
    MERMELSTEIN, P
    LEVINSON, S
    BIOTELEMETRY, 1975, 2 (1-2) : 121 - 123
  • [36] Acoustic and Language Models Adaptation for Indonesian Spontaneous Speech Recognition
    Lestari, Dessi Puji
    Irfani, Angela
    2015 2ND INTERNATIONAL CONFERENCE ON ADVANCED INFORMATICS: CONCEPTS, THEORY AND APPLICATIONS ICAICTA, 2015,
  • [37] Evaluating Spoken Language Model Based on Filler Prediction Model in Speech Recognition
    Ohta, Kengo
    Tsuchiya, Masatoshi
    Nakagawa, Seiichi
    INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 1558 - +
  • [38] Chameleon: A Language Model Adaptation Toolkit for Automatic Speech Recognition of Conversational Speech
    Song, Yuanfeng
    Jiang, Di
    Zhao, Weiwei
    Xu, Qian
    Wong, Raymond Chi-Wing
    Yang, Qiang
    2019 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING AND THE 9TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (EMNLP-IJCNLP 2019): PROCEEDINGS OF SYSTEM DEMONSTRATIONS, 2019, : 37 - 42
  • [39] Language Model for Speech Recognition of Power Grid Dispatching Based on BERT
    Chen L.
    Zheng W.
    Yu H.
    Fu J.
    Liu H.
    Xia J.
    Dianwang Jishu/Power System Technology, 2021, 45 (08): : 2955 - 2961
  • [40] A CACHE-BASED NATURAL-LANGUAGE MODEL FOR SPEECH RECOGNITION
    KUHN, R
    DEMORI, R
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1990, 12 (06) : 570 - 583