Multi-Level Improvement for a Transcription Generated by Automatic Speech Recognition System for Arabic

被引:0
|
作者
Amich, Heithem [1 ]
Ben Mohamed, Mohamed [1 ]
Zrigui, Mounir [1 ]
机构
[1] Monastir Fac Sci, LaTICE Lab, Monastir, Tunisia
关键词
Automatic speech recognition; multi-level improvement; language model; semantic similarity; phonetic pruning;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper we will propose a novel approach to improving an automatic speech recognition system. The proposed method constructs a search space based on the relations of semantic dependence of the output of a recognition system. Then, it applies syntactic and phonetic filters so as to choose the most probable hypotheses. To achieve this objective, different techniques are deployed, such as the word2vec or the language model Recurrent Neural Networks Language Models (RNNLM) or ever the language model tagged in addition to a phonetic pruning system. The obtained results showed that the proposed approach allowed to improve the accuracy of the system especially for the recognition of mispronounced words and irrelevant words.
引用
收藏
页码:460 / 466
页数:7
相关论文
共 50 条
  • [21] Arabic Automatic Speech Recognition: A Systematic Literature Review
    Dhouib, Amira
    Othman, Achraf
    El Ghoul, Oussama
    Khribi, Mohamed Koutheair
    Al Sinani, Aisha
    APPLIED SCIENCES-BASEL, 2022, 12 (17):
  • [22] Lexical and Phonetic Modeling for Arabic Automatic Speech Recognition
    Nguyen, Long
    Ng, Tim
    Nguyen, Kham
    Zbib, Rabih
    Makhoul, John
    INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5, 2009, : 708 - +
  • [23] Development of Multi-Level Speech based Person Authentication System
    Rohan Kumar Das
    Sarfaraz Jelil
    S. R. Mahadeva Prasanna
    Journal of Signal Processing Systems, 2017, 88 : 259 - 271
  • [24] Multi-level annotation in the Emu speech database management system
    Cassidy, S
    Harrington, J
    SPEECH COMMUNICATION, 2001, 33 (1-2) : 61 - 77
  • [25] Development of Multi-Level Speech based Person Authentication System
    Das, Rohan Kumar
    Jelil, Sarfaraz
    Prasanna, S. R. Mahadeva
    JOURNAL OF SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY, 2017, 88 (03): : 259 - 271
  • [26] Multi-Dialect Arabic Speech Recognition
    Ali, Abbas Raza
    2020 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2020,
  • [27] Automatic Speech Recognition System for Malay Speaking Children Automatic Speech Recognition system
    Rahman, Feisal Dani
    Mohamed, Noraini
    Mustafa, Mumtaz Begum
    Salim, Siti Salwah
    2014 THIRD ICT INTERNATIONAL STUDENT PROJECT CONFERENCE (ICT-ISPC), 2014, : 79 - 82
  • [28] THE ROYALFLUSH AUTOMATIC SPEECH DIARIZATION AND RECOGNITION SYSTEM FOR IN-CAR MULTI-CHANNEL AUTOMATIC SPEECH RECOGNITION CHALLENGE
    Tian, Jingguang
    Ye, Shuaishuai
    Chen, Shunfei
    Xiang, Yang
    Yin, Zhaohui
    Hu, Xinhui
    Xu, Xinkang
    2024 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING WORKSHOPS, ICASSPW 2024, 2024, : 1 - 2
  • [29] Improving Readability for Automatic Speech Recognition Transcription
    Liao, Junwei
    Eskimez, Sefik
    Lu, Liyang
    Shi, Yu
    Gong, Ming
    Shou, Linjun
    Qu, Hong
    Zeng, Michael
    ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2023, 22 (05)
  • [30] Evaluating the effect of using different transcription schemes in building a speech recognition system for Arabic
    Eiman Alsharhan
    Allan Ramsay
    Hanady Ahmed
    International Journal of Speech Technology, 2022, 25 : 43 - 56