English Speech Recognition System on Chip

被引:0
|
作者
刘鸿 [1 ]
钱彦旻 [1 ]
刘加 [1 ]
机构
[1] Tsinghua National Laboratory for Information Science and Technology,Department of Electronic Engineering,Tsinghua University
基金
中国国家自然科学基金;
关键词
non-specific human voice-consciousness; system-on-chip; mel-frequency cepstral coefficients(MFCC);
D O I
暂无
中图分类号
TN912.34 [语音识别与设备];
学科分类号
0711 ;
摘要
An English speech recognition system was implemented on a chip,called speech system-on-chip (SoC).The SoC included an application specific integrated circuit with a vector accelerator to improve performance.The sub-word model based on a continuous density hidden Markov model recognition algorithm ran on a very cheap speech chip.The algorithm was a two-stage fixed-width beam-search baseline system with a variable beam-width pruning strategy and a frame-synchronous word-level pruning strategy to significantly reduce the recognition time.Tests show that this method reduces the recognition time nearly 6 fold and the memory size nearly 2 fold compared to the original system,with less than 1% accuracy degradation for a 600 word recognition task and recognition accuracy rate of about 98%.
引用
收藏
页码:95 / 99
页数:5
相关论文
共 50 条
  • [41] Inequity in Popular Speech Recognition Systems for Accented English Speech
    Ike, Chinaemere
    Polsley, Seth
    Hammond, Tracy
    [J]. COMPANION PROCEEDINGS OF THE 27TH INTERNATIONAL CONFERENCE ON INTELLIGENT USER INTERFACES, IUI 2022 COMPANION, 2022, : 66 - 68
  • [42] Lithuanian Speech Recognition Using the English Recognizer
    Kasparaitis, Pijus
    [J]. INFORMATICA, 2008, 19 (04) : 505 - 516
  • [43] Chinese-English bilingual speech recognition
    Yu, SM
    Hu, S
    Zhang, SW
    Xu, B
    [J]. 2003 INTERNATIONAL CONFERENCE ON NATURAL LANGUAGE PROCESSING AND KNOWLEDGE ENGINEERING, PROCEEDINGS, 2003, : 603 - 609
  • [44] Automatic Speech Recognition in Diverse English Accents
    Mohyuddin, Hashir
    Kwak, Daehan
    [J]. 2023 INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE AND COMPUTATIONAL INTELLIGENCE, CSCI 2023, 2023, : 714 - 718
  • [45] LIMITED RESOURCE SPEECH RECOGNITION FOR NIGERIAN ENGLISH
    Amuda, Sulyman
    Boril, Hynek
    Sangwan, Abhijeet
    Hansen, John H. L.
    [J]. 2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2010, : 5090 - 5093
  • [46] English Speech Recognition Based on Artificial Intelligence
    Bai, Tana
    [J]. AGRO FOOD INDUSTRY HI-TECH, 2017, 28 (03): : 2259 - 2263
  • [47] Improving English Conversational Telephone Speech Recognition
    Medennikov, Ivan
    Prudnikov, Alexey
    Zatvornitskiy, Alexander
    [J]. 17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 2 - 6
  • [48] Multimodal English corpus for automatic speech recognition
    Kunka, Bartosz
    Kupryjanow, Adam
    Dalka, Piotr
    Bratoszewski, Piotr
    Szczodrak, Maciej
    Spaleniak, Pawel
    Szykulski, Marcin
    Czyzewski, Andrzej
    [J]. 2013 SIGNAL PROCESSING: ALGORITHMS, ARCHITECTURES, ARRANGEMENTS, AND APPLICATIONS (SPA), 2013, : 106 - 111
  • [49] SPEECH RECOGNITION IN ENGLISH - AN EXAMINATION OF THE PSYCHOLINGUISTIC BASIS
    WOTSCHKE, I
    [J]. ZEITSCHRIFT FUR ANGLISTIK UND AMERIKANISTIK, 1982, 30 (01): : 50 - 65
  • [50] Research on Oral English Learning System Integrating AI Speech Data Recognition and Speech Quality Evaluation Algorithm
    Wang, Xue
    [J]. JOURNAL OF ELECTRICAL SYSTEMS, 2024, 20 (05) : 2466 - 2477