Deep Learning Framework with Confused Sub-Set Resolution Architecture for Automatic Arabic Diacritization

被引:29
|
作者
Rashwan, Mohsen A. A. [1 ,2 ]
Al Sallab, Ahmad A. [3 ,4 ]
Raafat, Hazem M. [5 ]
Rafea, Ahmed [6 ]
机构
[1] Engn Co Dev Comp Syst RDI, Giza 12613, Egypt
[2] Cairo Univ, Fac Engn, Dept Elect & Elect Commun, Giza 00202, Egypt
[3] Valeo Interbranch Automot Software, Giza, Egypt
[4] Cairo Univ, Fac Engn, Dept Elect & Elect Commun, Giza 12613, Egypt
[5] Kuwait Univ, Dept Comp Sci, Safat 13060, Kuwait
[6] Amer Univ Cairo, Dept Comp Sci, Cairo 11835, Egypt
关键词
Arabic diacritization; classifier design; deep networks; part-of-speech (PoS) tagging;
D O I
10.1109/TASLP.2015.2395255
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
The Arabic language belongs to a group of languages that require diacritization over their characters. Modern Standard Arabic (MSA) transcripts omit the diacritics, which are essential for many machine learning tasks like Text-To-Speech (TTS) systems. In this work Arabic diacritics restoration is tackled under a deep learning framework that includes the Confused Sub-set Resolution (CSR) method to improve the classification accuracy, in addition to an Arabic Part-of-Speech (PoS) tagging framework using deep neural nets. Special focus is given to syntactic diacritization, which still suffers low accuracy as indicated in prior works. Evaluation is done versus state-of-the-art systems reported in literature, with quite challenging datasets collected from different domains. Standard datasets like the LDC Arabic Tree Bank are used in addition to custom ones we have made available online to allow other researchers to replicate these results. Results show significant improvement of the proposed techniques over other approaches, reducing the syntactic classification error to 9.9% and morphological classification error to 3% compared to 12.7% and 3.8% of the best reported results in literature, improving the error by 22% over the best reported systems.
引用
收藏
页码:505 / 516
页数:12
相关论文
共 50 条
  • [31] Modern Architecture for Deep learning-based Automatic Optical Inspection
    Richter, Johannes
    Streitferdt, Detlef
    [J]. 2019 IEEE 43RD ANNUAL COMPUTER SOFTWARE AND APPLICATIONS CONFERENCE (COMPSAC), VOL 2, 2019, : 141 - 145
  • [32] Training and validation of a deep learning architecture for the automatic analysis of coronary angiography
    Du, Tianming
    Xie, Lihua
    Zhang, Honggang
    Liu, Xuqing
    Wang, Xiaofei
    Chen, Donghao
    Xu, Yang
    Sun, Zhongwei
    Zhou, Wenhui
    Song, Lei
    Guan, Changdong
    Lansky, Alexandra J.
    Xu, Bo
    [J]. EUROINTERVENTION, 2021, 17 (01) : 32 - +
  • [33] Sch-net: a deep learning architecture for automatic detection of schizophrenia
    Jia Fu
    Sen Yang
    Fei He
    Ling He
    Yuanyuan Li
    Jing Zhang
    Xi Xiong
    [J]. BioMedical Engineering OnLine, 20
  • [34] MalDozer: Automatic framework for android malware detection using deep learning
    Karbab, ElMouatez Billah
    Debbabi, Mourad
    Derhab, Abdelouahid
    Mouheb, Djedjiga
    [J]. DIGITAL INVESTIGATION, 2018, 24 : S48 - S59
  • [35] A Deep Learning Framework Design for Automatic Blastocyst Evaluation With Multifocal Images
    Wang, Shanshan
    Zhou, Cong
    Zhang, Dan
    Chen, Lei
    Sun, Haixiang
    [J]. IEEE ACCESS, 2021, 9 : 18927 - 18934
  • [36] Multistage Framework for Automatic Face Mask Detection Using Deep Learning
    Sowmya, K. N.
    Rekha, P. M.
    Kumari, Trishala
    Debtera, Baru
    [J]. COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE, 2022, 2022
  • [37] A generalized deep learning framework for automatic rheumatoid arthritis severity grading
    More, Sujeet
    Singla, Jimmy
    [J]. JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2021, 41 (06) : 7603 - 7614
  • [38] Deep IVUS: A machine learning framework for fully automatic IVUS segmentation
    Molony, David
    Hosseini, Hossein
    Samady, Habib
    [J]. JOURNAL OF THE AMERICAN COLLEGE OF CARDIOLOGY, 2018, 72 (13) : B1 - B1
  • [39] ArRASA: Channel Optimization for Deep Learning-Based Arabic NLU Chatbot Framework
    Alruily, Meshrif
    [J]. ELECTRONICS, 2022, 11 (22)
  • [40] A Bidirectional Arabic Sign Language Framework Using Deep Learning and Fuzzy Matching Score
    Mosleh, Mogeeb A. A.
    Assiri, Adel
    Gumaei, Abdu H.
    Alkhamees, Bader Fahad
    Al-Qahtani, Manal
    [J]. MATHEMATICS, 2024, 12 (08)