Multi-Level Improvement for a Transcription Generated by Automatic Speech Recognition System for Arabic

被引：0

作者：

Amich, Heithem ^{[1
]}

Ben Mohamed, Mohamed ^{[1
]}

Zrigui, Mounir ^{[1
]}

机构：

[1] Monastir Fac Sci, LaTICE Lab, Monastir, Tunisia

来源：

INTERNATIONAL ARAB JOURNAL OF INFORMATION TECHNOLOGY | 2019年 / 16卷 / 03期

关键词：

Automatic speech recognition; multi-level improvement; language model; semantic similarity; phonetic pruning;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In this paper we will propose a novel approach to improving an automatic speech recognition system. The proposed method constructs a search space based on the relations of semantic dependence of the output of a recognition system. Then, it applies syntactic and phonetic filters so as to choose the most probable hypotheses. To achieve this objective, different techniques are deployed, such as the word2vec or the language model Recurrent Neural Networks Language Models (RNNLM) or ever the language model tagged in addition to a phonetic pruning system. The obtained results showed that the proposed approach allowed to improve the accuracy of the system especially for the recognition of mispronounced words and irrelevant words.

引用

页码：460 / 466

页数：7

共 50 条

[21] Arabic Automatic Speech Recognition: A Systematic Literature Review
Dhouib, Amira
Othman, Achraf
El Ghoul, Oussama
Khribi, Mohamed Koutheair
Al Sinani, Aisha
APPLIED SCIENCES-BASEL, 2022, 12 (17):
[22] Lexical and Phonetic Modeling for Arabic Automatic Speech Recognition
Nguyen, Long
Ng, Tim
Nguyen, Kham
Zbib, Rabih
Makhoul, John
INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5, 2009, : 708 - +
[23] Development of Multi-Level Speech based Person Authentication System
Rohan Kumar Das
Sarfaraz Jelil
S. R. Mahadeva Prasanna
Journal of Signal Processing Systems, 2017, 88 : 259 - 271
[24] Multi-level annotation in the Emu speech database management system
Cassidy, S
Harrington, J
SPEECH COMMUNICATION, 2001, 33 (1-2) : 61 - 77
[25] Development of Multi-Level Speech based Person Authentication System
Das, Rohan Kumar
Jelil, Sarfaraz
Prasanna, S. R. Mahadeva
JOURNAL OF SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY, 2017, 88 (03): : 259 - 271
[26] Multi-Dialect Arabic Speech Recognition
Ali, Abbas Raza
2020 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2020,
[27] Automatic Speech Recognition System for Malay Speaking Children Automatic Speech Recognition system
Rahman, Feisal Dani
Mohamed, Noraini
Mustafa, Mumtaz Begum
Salim, Siti Salwah
2014 THIRD ICT INTERNATIONAL STUDENT PROJECT CONFERENCE (ICT-ISPC), 2014, : 79 - 82
[28] THE ROYALFLUSH AUTOMATIC SPEECH DIARIZATION AND RECOGNITION SYSTEM FOR IN-CAR MULTI-CHANNEL AUTOMATIC SPEECH RECOGNITION CHALLENGE
Tian, Jingguang
Ye, Shuaishuai
Chen, Shunfei
Xiang, Yang
Yin, Zhaohui
Hu, Xinhui
Xu, Xinkang
2024 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING WORKSHOPS, ICASSPW 2024, 2024, : 1 - 2
[29] Improving Readability for Automatic Speech Recognition Transcription
Liao, Junwei
Eskimez, Sefik
Lu, Liyang
Shi, Yu
Gong, Ming
Shou, Linjun
Qu, Hong
Zeng, Michael
ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2023, 22 (05)
[30] Evaluating the effect of using different transcription schemes in building a speech recognition system for Arabic
Eiman Alsharhan
Allan Ramsay
Hanady Ahmed
International Journal of Speech Technology, 2022, 25 : 43 - 56

← 1 2 3 4 5 →