Speech recognition based on unified model of acoustic and language aspects of speech

被引：0

作者：

机构：

[1] Kubo, Yotaro

[2] Ogawa, Atsunori

[3] Hori, Takaaki

[4] Nakamura, Atsushi

来源：

| 1600年 / Nippon Telegraph and Telephone Corp.卷 / 11期

关键词：

Deep learning;

D O I：

暂无

中图分类号：

学科分类号：

摘要：

Automatic speech recognition has been attracting a lot of attention recently and is considered an important technique to achieve natural interaction between humans and machines. However, recognizing spontaneous speech is still considered to be difficult owing to the wide variety of patterns in spontaneous speech. We have been researching ways to overcome this problem and have developed a method to express both the acoustic and linguistic aspects of speech recognizers in a unified representation by integrating powerful frameworks of deep learning and a weighted finite-state transducer. We evaluated the proposed method ill an experiment to recognize a lecture speech dataset, which is coilsidered as a spontaneous speech dataset, and confirmed that the proposed method is promising for recognizing spontaneous speech.

引用

共 50 条

[21] Development of Hausa Acoustic Model for Speech Recognition
Ibrahim, Umar Adam
Boukar, Moussa Mahamat
Suleiman, Muhammad Aliyu
INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2022, 13 (05) : 503 - 508
[22] Integration of Metamodel and Acoustic Model for Speech Recognition
Matsumasa, Hironori
Takiguchi, Tetsuya
Ariki, Yasuo
Li, Ichao
Nakabayashi, Toshitaka
INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 2234 - +
[23] SPEECH AND LANGUAGE ASPECTS
BRADLEY, DP
CLEFT PALATE JOURNAL, 1977, 14 (04): : 321 - 328
[24] LATENT DIRICHLIET LANGUAGE MODEL FOR SPEECH RECOGNITION
Chien, Jen-Tzung
Chueh, Chuang-Hua
2008 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY: SLT 2008, PROCEEDINGS, 2008, : 201 - 204
[25] Topic tracking language model for speech recognition
Watanabe, Shinji
Iwata, Tomoharu
Hori, Takaaki
Sako, Atsushi
Ariki, Yasuo
COMPUTER SPEECH AND LANGUAGE, 2011, 25 (02): : 440 - 461
[26] Language Model Score Regularization for Speech Recognition
ZHANG Yike
ZHANG Pengyuan
YAN Yonghong
Chinese Journal of Electronics, 2019, 28 (03) : 604 - 609
[27] TOPIC CACHE LANGUAGE MODEL FOR SPEECH RECOGNITION
Chueh, Chuang-Hua
Chien, Jen-Tzung
2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2010, : 5194 - 5197
[28] Language Model Score Regularization for Speech Recognition
Zhang Yike
Zhang Pengyuan
Yan Yonghong
CHINESE JOURNAL OF ELECTRONICS, 2019, 28 (03) : 604 - 609
[29] Topic cache language model for speech recognition
Department of Computer Science and Information Engineering, National Cheng Kung University, Tainan, 70101, Taiwan
ICASSP IEEE Int Conf Acoust Speech Signal Process Proc, 2010, (5194-5197):
[30] A language model for Amdo Tibetan speech recognition
Suan, Taiben
Cai, Rangzhuoma
Cai, Zhijie
Zu, Ba
Gong, Baojia
2020 2ND INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE COMMUNICATION AND NETWORK SECURITY (CSCNS2020), 2021, 336

← 1 2 3 4 5 →