Construction of an Online Cloud Platform for Zhuang Speech Recognition and Translation with Edge-Computing-Based Deep Learning Algorithm

被引:1
|
作者
Fan, Zeping [1 ,2 ]
Huang, Min [1 ,2 ]
Zhang, Xuejun [1 ,2 ,3 ]
Liu, Rongqi [1 ,2 ]
Lyu, Xinyi [1 ]
Duan, Taisen [1 ,2 ]
Bu, Zhaohui [4 ]
Liang, Jianghua [5 ]
机构
[1] Guangxi Univ, Sch Comp & Elect & Informat, Nanning 530004, Peoples R China
[2] Guangxi Univ, Guangxi Key Lab Multimedia Commun & Network Techno, Nanning 530004, Peoples R China
[3] Guangxi Big White & Little Black Robots Co Ltd, Nanning 530007, Peoples R China
[4] Guangxi Univ, Sch Foreign Language, Nanning 530004, Peoples R China
[5] Guangxi Univ, Sch Journalism & Commun, Nanning 530004, Peoples R China
来源
APPLIED SCIENCES-BASEL | 2023年 / 13卷 / 22期
关键词
automatic speech recognition; natural language processing; neural machine translation; transformer; cloud edge computing; network programming;
D O I
10.3390/app132212184
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
The Zhuang ethnic minority in China possesses its own ethnic language and no ethnic script. Cultural exchange and transmission encounter hurdles as the Zhuang rely exclusively on oral communication. An online cloud-based platform was required to enhance linguistic communication. First, a database of 200 h of annotated Zhuang speech was created by collecting standard Zhuang speeches and improving database quality by removing transcription inconsistencies and text normalization. Second, SAformerNet, a more efficient and accurate transformer-based automatic speech recognition (ASR) network, is achieved by inserting additional downsampling modules. Subsequently, a Neural Machine Translation (NMT) model for translating Zhuang into other languages is constructed by fine-tuning the BART model and corpus filtering strategy. Finally, for the network's responsiveness to real-world needs, edge-computing techniques are applied to relieve network bandwidth pressure. An edge-computing private cloud system based on FPGA acceleration is proposed to improve model operation efficiency. Experiments show that the most critical metric of the system, model accuracy, is above 93%, and inference time is reduced by 29%. The computational delay for multi-head self-attention (MHSA) and feed-forward network (FFN) modules has been reduced by 7.1 and 1.9 times, respectively, and terminal response time is accelerated by 20% on average. Generally, the scheme provides a prototype tool for small-scale Zhuang remote natural language tasks in mountainous areas.
引用
收藏
页数:19
相关论文
共 50 条
  • [1] Construction of a Smart Supply Chain for Sand Factory Using the Edge-Computing-Based Deep Learning Algorithm
    Li, Bin
    Zhang, Xuejun
    Ban, Yanjiao
    Xu, Xianfu
    Su, Wenjun
    Chen, Jingxian
    Zhang, Shan
    Li, Feng
    Liang, Zuopeng
    Zhou, Shengkai
    [J]. Scientific Programming, 2022, 2022
  • [2] Construction of a Smart Supply Chain for Sand Factory Using the Edge-Computing-Based Deep Learning Algorithm
    Li, Bin
    Zhang, Xuejun
    Ban, Yanjiao
    Xu, Xianfu
    Su, Wenjun
    Chen, Jingxian
    Zhang, Shan
    Li, Feng
    Liang, Zuopeng
    Zhou, Shengkai
    [J]. SCIENTIFIC PROGRAMMING, 2022, 2022
  • [3] A Deep Learning Image Recognition Method Based on Edge Cloud Computing
    Wei, Rui
    [J]. Engineering Intelligent Systems, 2023, 31 (01): : 5 - 12
  • [4] Edge-Computing-Based Knowledge Distillation and Multitask Learning for Partial Discharge Recognition
    Ji, Jinsheng
    Shu, Zhou
    Li, Hongqun
    Lai, Kai Xian
    Lu, Minshan
    Jiang, Guanlin
    Wang, Wensong
    Zheng, Yuanjin
    Jiang, Xudong
    [J]. IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2024, 73
  • [5] A Nephogram Recognition Algorithm Based on Cloud Computing Platform
    Li, Tao
    Wang, Lei
    Ren, Yongjun
    Li, Xiang
    [J]. 2019 INTERNATIONAL CONFERENCE ON INTERNET OF THINGS (ITHINGS) AND IEEE GREEN COMPUTING AND COMMUNICATIONS (GREENCOM) AND IEEE CYBER, PHYSICAL AND SOCIAL COMPUTING (CPSCOM) AND IEEE SMART DATA (SMARTDATA), 2019, : 482 - 487
  • [6] Immersive online biometric authentication algorithm for online guiding based on face recognition and cloud-based mobile edge computing
    Su, Peng
    [J]. DISTRIBUTED AND PARALLEL DATABASES, 2023, 41 (1-2) : 133 - 154
  • [7] Immersive online biometric authentication algorithm for online guiding based on face recognition and cloud-based mobile edge computing
    Peng Su
    [J]. Distributed and Parallel Databases, 2023, 41 : 133 - 154
  • [9] Deep Learning Video Analytics Through Online Learning Based Edge Computing
    Liu, Heting
    Cao, Guohong
    [J]. IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2022, 21 (10) : 8193 - 8204
  • [10] Construction of Art Resource Platform Based on Distributed Pattern Recognition SoC Deep Learning Algorithm
    Qin, Yashuang
    [J]. IEEE CONSUMER ELECTRONICS MAGAZINE, 2024, 13 (04) : 81 - 89