Automatic Correction of Speech Recognized Mathematical Equations using Encoder-Decoder Attention Model

被引:1
|
作者
Mounika, Y. [1 ]
Tarakaram, Y. [1 ]
Prasanna, Y. Lakshmi [1 ]
Gupta, Deepa [1 ]
Pati, Peeta Basa [1 ]
机构
[1] Amrita Vishwa Vidyapeetham, Amrita Sch Comp, Dept Comp Sci & Engn, Bangalore, Karnataka, India
关键词
Phonetic errors; Automatic Speech Recognition (ASR); Mathematical equations; Encoder-Decoder; Attention; Word Error Rate (WER); Character Error Rate (CER);
D O I
10.1109/INDICON56171.2022.10039926
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Automated correction of common and systematic errors caused by the ASR system is part of the post-processing of automatic speech recognition (ASR). The ASR system's output is prone to grammatical, spelling and phonetic problems. The sentences which are the outputs from Automated Speech Recognition (ASR) models have been refined using error correction approaches to obtain a reduced word error rate (WER) and Character Error Rate (CER) than the initial ASR outputs. In this paper we propose a model that reduces the word error rate and character error rate of the speech recognized mathematical equations. The proposed model is the encoder-decoder with attention model.
引用
收藏
页数:6
相关论文
共 50 条
  • [1] Using FCOS and an Encoder-Decoder Model to Detect and Recognize Visual Mathematical Equations
    Wheelwright, Angel Jo
    Ng, Yiu-Kai
    [J]. 9TH INTERNATIONAL CONFERENCE ON MULTIMEDIA AND IMAGE PROCESSING, ICMIP 2024, 2024, : 44 - 51
  • [2] Automatic Generation of Chinese Couplets with Attention Based Encoder-Decoder Model
    Yuan, Shengqiong
    Zhong, Luo
    Li, Lin
    Zhang, Rui
    [J]. 2019 2ND IEEE CONFERENCE ON MULTIMEDIA INFORMATION PROCESSING AND RETRIEVAL (MIPR 2019), 2019, : 65 - 70
  • [3] Embedding Encoder-Decoder With Attention Mechanism for Monaural Speech Enhancement
    Lan, Tian
    Ye, Wenzheng
    Lyu, Yilan
    Zhang, Junyi
    Liu, Qiao
    [J]. IEEE ACCESS, 2020, 8 : 96677 - 96685
  • [4] Encoder-Decoder Model for Automatic Video Captioning Using Yolo Algorithm
    Alkalouti, Hanan Nasser
    Al Masre, Mayada Ahmed
    [J]. 2021 IEEE INTERNATIONAL IOT, ELECTRONICS AND MECHATRONICS CONFERENCE (IEMTRONICS), 2021, : 718 - 721
  • [5] Dynamic-attention based Encoder-decoder model for Speaker Extraction with Anchor speech
    Li, Hao
    Zhang, Xueliang
    Gao, Guanglai
    [J]. 2019 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2019, : 297 - 301
  • [6] A Dual Attention Encoder-Decoder Text Summarization Model
    Hakami, Nada Ali
    Mahmoud, Hanan Ahmed Hosni
    [J]. CMC-COMPUTERS MATERIALS & CONTINUA, 2023, 74 (02): : 3697 - 3710
  • [7] Automatic colon polyp detection using Convolutional Encoder-Decoder model
    Bardhi, Ornela
    Sierra-Sosa, Daniel
    Garcia-Zapirain, Begonya
    Elmaghraby, Adel
    [J]. 2017 IEEE INTERNATIONAL SYMPOSIUM ON SIGNAL PROCESSING AND INFORMATION TECHNOLOGY (ISSPIT), 2017, : 445 - 448
  • [8] An Automatic Grammar Error Correction Model Based on Encoder-Decoder Structure for English Texts
    Wang, Jiahao
    Huang, Guimin
    Wang, Yabing
    [J]. EAI ENDORSED TRANSACTIONS ON SCALABLE INFORMATION SYSTEMS, 2022, 10 (01)
  • [9] Segmental Encoder-Decoder Models for Large Vocabulary Automatic Speech Recognition
    Beck, Eugen
    Hannemann, Mirko
    Doetsch, Patrick
    Schlueter, Ralf
    Ney, Hermann
    [J]. 19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 766 - 770
  • [10] Arabic Machine Transliteration using an Attention-based Encoder-decoder Model
    Ameur, Mohamed Seghir Hadj
    Meziane, Farid
    Guessoum, Ahmed
    [J]. ARABIC COMPUTATIONAL LINGUISTICS (ACLING 2017), 2017, 117 : 287 - 297