English Speech Recognition System on Chip

被引：0

作者：

刘鸿 ^{[1
]}

钱彦旻 ^{[1
]}

刘加 ^{[1
]}

机构：

[1] Tsinghua National Laboratory for Information Science and Technology,Department of Electronic Engineering,Tsinghua University

来源：

Tsinghua Science and Technology | 2011年 / 16卷 / 01期

基金：

中国国家自然科学基金;

关键词：

non-specific human voice-consciousness; system-on-chip; mel-frequency cepstral coefficients(MFCC);

D O I：

暂无

中图分类号：

TN912.34 [语音识别与设备];

学科分类号：

0711 ;

摘要：

An English speech recognition system was implemented on a chip,called speech system-on-chip (SoC).The SoC included an application specific integrated circuit with a vector accelerator to improve performance.The sub-word model based on a continuous density hidden Markov model recognition algorithm ran on a very cheap speech chip.The algorithm was a two-stage fixed-width beam-search baseline system with a variable beam-width pruning strategy and a frame-synchronous word-level pruning strategy to significantly reduce the recognition time.Tests show that this method reduces the recognition time nearly 6 fold and the memory size nearly 2 fold compared to the original system,with less than 1% accuracy degradation for a 600 word recognition task and recognition accuracy rate of about 98%.

引用

页码：95 / 99

页数：5

共 50 条

[41] Inequity in Popular Speech Recognition Systems for Accented English Speech
Ike, Chinaemere
Polsley, Seth
Hammond, Tracy
[J]. COMPANION PROCEEDINGS OF THE 27TH INTERNATIONAL CONFERENCE ON INTELLIGENT USER INTERFACES, IUI 2022 COMPANION, 2022, : 66 - 68
[42] Lithuanian Speech Recognition Using the English Recognizer
Kasparaitis, Pijus
[J]. INFORMATICA, 2008, 19 (04) : 505 - 516
[43] Chinese-English bilingual speech recognition
Yu, SM
Hu, S
Zhang, SW
Xu, B
[J]. 2003 INTERNATIONAL CONFERENCE ON NATURAL LANGUAGE PROCESSING AND KNOWLEDGE ENGINEERING, PROCEEDINGS, 2003, : 603 - 609
[44] Automatic Speech Recognition in Diverse English Accents
Mohyuddin, Hashir
Kwak, Daehan
[J]. 2023 INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE AND COMPUTATIONAL INTELLIGENCE, CSCI 2023, 2023, : 714 - 718
[45] LIMITED RESOURCE SPEECH RECOGNITION FOR NIGERIAN ENGLISH
Amuda, Sulyman
Boril, Hynek
Sangwan, Abhijeet
Hansen, John H. L.
[J]. 2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2010, : 5090 - 5093
[46] English Speech Recognition Based on Artificial Intelligence
Bai, Tana
[J]. AGRO FOOD INDUSTRY HI-TECH, 2017, 28 (03): : 2259 - 2263
[47] Improving English Conversational Telephone Speech Recognition
Medennikov, Ivan
Prudnikov, Alexey
Zatvornitskiy, Alexander
[J]. 17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 2 - 6
[48] Multimodal English corpus for automatic speech recognition
Kunka, Bartosz
Kupryjanow, Adam
Dalka, Piotr
Bratoszewski, Piotr
Szczodrak, Maciej
Spaleniak, Pawel
Szykulski, Marcin
Czyzewski, Andrzej
[J]. 2013 SIGNAL PROCESSING: ALGORITHMS, ARCHITECTURES, ARRANGEMENTS, AND APPLICATIONS (SPA), 2013, : 106 - 111
[49] SPEECH RECOGNITION IN ENGLISH - AN EXAMINATION OF THE PSYCHOLINGUISTIC BASIS
WOTSCHKE, I
[J]. ZEITSCHRIFT FUR ANGLISTIK UND AMERIKANISTIK, 1982, 30 (01): : 50 - 65
[50] Research on Oral English Learning System Integrating AI Speech Data Recognition and Speech Quality Evaluation Algorithm
Wang, Xue
[J]. JOURNAL OF ELECTRICAL SYSTEMS, 2024, 20 (05) : 2466 - 2477

← 1 2 3 4 5 →