Automatic Correction of Speech Recognized Mathematical Equations using Encoder-Decoder Attention Model

被引：1

作者：

Mounika, Y. ^{[1
]}

Tarakaram, Y. ^{[1
]}

Prasanna, Y. Lakshmi ^{[1
]}

Gupta, Deepa ^{[1
]}

Pati, Peeta Basa ^{[1
]}

机构：

[1] Amrita Vishwa Vidyapeetham, Amrita Sch Comp, Dept Comp Sci & Engn, Bangalore, Karnataka, India

来源：

2022 IEEE 19TH INDIA COUNCIL INTERNATIONAL CONFERENCE, INDICON | 2022年

关键词：

Phonetic errors; Automatic Speech Recognition (ASR); Mathematical equations; Encoder-Decoder; Attention; Word Error Rate (WER); Character Error Rate (CER);

D O I：

10.1109/INDICON56171.2022.10039926

中图分类号：

TP39 [计算机的应用];

学科分类号：

081203 ; 0835 ;

摘要：

Automated correction of common and systematic errors caused by the ASR system is part of the post-processing of automatic speech recognition (ASR). The ASR system's output is prone to grammatical, spelling and phonetic problems. The sentences which are the outputs from Automated Speech Recognition (ASR) models have been refined using error correction approaches to obtain a reduced word error rate (WER) and Character Error Rate (CER) than the initial ASR outputs. In this paper we propose a model that reduces the word error rate and character error rate of the speech recognized mathematical equations. The proposed model is the encoder-decoder with attention model.

引用

页数：6

共 50 条

[1] Using FCOS and an Encoder-Decoder Model to Detect and Recognize Visual Mathematical Equations
Wheelwright, Angel Jo
Ng, Yiu-Kai
[J]. 9TH INTERNATIONAL CONFERENCE ON MULTIMEDIA AND IMAGE PROCESSING, ICMIP 2024, 2024, : 44 - 51
[2] Automatic Generation of Chinese Couplets with Attention Based Encoder-Decoder Model
Yuan, Shengqiong
Zhong, Luo
Li, Lin
Zhang, Rui
[J]. 2019 2ND IEEE CONFERENCE ON MULTIMEDIA INFORMATION PROCESSING AND RETRIEVAL (MIPR 2019), 2019, : 65 - 70
[3] Embedding Encoder-Decoder With Attention Mechanism for Monaural Speech Enhancement
Lan, Tian
Ye, Wenzheng
Lyu, Yilan
Zhang, Junyi
Liu, Qiao
[J]. IEEE ACCESS, 2020, 8 : 96677 - 96685
[4] Encoder-Decoder Model for Automatic Video Captioning Using Yolo Algorithm
Alkalouti, Hanan Nasser
Al Masre, Mayada Ahmed
[J]. 2021 IEEE INTERNATIONAL IOT, ELECTRONICS AND MECHATRONICS CONFERENCE (IEMTRONICS), 2021, : 718 - 721
[5] Dynamic-attention based Encoder-decoder model for Speaker Extraction with Anchor speech
Li, Hao
Zhang, Xueliang
Gao, Guanglai
[J]. 2019 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2019, : 297 - 301
[6] A Dual Attention Encoder-Decoder Text Summarization Model
Hakami, Nada Ali
Mahmoud, Hanan Ahmed Hosni
[J]. CMC-COMPUTERS MATERIALS & CONTINUA, 2023, 74 (02): : 3697 - 3710
[7] Automatic colon polyp detection using Convolutional Encoder-Decoder model
Bardhi, Ornela
Sierra-Sosa, Daniel
Garcia-Zapirain, Begonya
Elmaghraby, Adel
[J]. 2017 IEEE INTERNATIONAL SYMPOSIUM ON SIGNAL PROCESSING AND INFORMATION TECHNOLOGY (ISSPIT), 2017, : 445 - 448
[8] An Automatic Grammar Error Correction Model Based on Encoder-Decoder Structure for English Texts
Wang, Jiahao
Huang, Guimin
Wang, Yabing
[J]. EAI ENDORSED TRANSACTIONS ON SCALABLE INFORMATION SYSTEMS, 2022, 10 (01)
[9] Segmental Encoder-Decoder Models for Large Vocabulary Automatic Speech Recognition
Beck, Eugen
Hannemann, Mirko
Doetsch, Patrick
Schlueter, Ralf
Ney, Hermann
[J]. 19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 766 - 770
[10] Arabic Machine Transliteration using an Attention-based Encoder-decoder Model
Ameur, Mohamed Seghir Hadj
Meziane, Farid
Guessoum, Ahmed
[J]. ARABIC COMPUTATIONAL LINGUISTICS (ACLING 2017), 2017, 117 : 287 - 297

← 1 2 3 4 5 →