A Novel Deep-learning based Approach for Automatic Diacritization of Arabic Poems using Sequence-to-Sequence Model

被引:0
|
作者
Mahmoud, Mohamed S. [1 ]
Negied, Nermin [2 ,3 ]
机构
[1] Tech Univ Munich, Sch Informat, Munich, Germany
[2] Sch Commun & Informat Engn, Giza, Egypt
[3] Nile Univ, Sch Engn & Appl Sci, Giza, Egypt
关键词
Text diacritization; deep learning; sequence-to-sequence; regex; tokenization; ANLP;
D O I
10.14569/IJACSA.2023.0140105
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Over the last 10 years, Arabic language have attracted researchers in the area of Natural Language Processing (NLP). A lot of research papers suddenly emerged in which the main work was the processing of Arabic language and its dialects too. Arabic language processing has been given a special name ANLP (Arabic Natural Language Processing). A lot of ANLP work can be found in literature including almost all NLP applications. Many researchers have been attracted also to Arabic linguistic knowledge. The work expands from Basic Language Analysis to Semantic Level Analysis. But Arabic text semantic analysis cannot be held without considering diacritization, which can greatly affect the meaning. Many Arabic texts are written without diacritization, and Diacritizing them manually is a very tiresome process that may need an expert. Automatic diacritization systems became a demand as an initial step for processing Arabic text for any Arabic Language Processing application as Arabic diacritization is very important to get a readable and understandable Arabic text. For this reason, many researchers recently worked on building systems and tools that automatically diacritize un-diacritized Arabic texts. This work presents a novel deep learning-based sequence -to-sequence model to diacritize un-diacritized Arabic poems. The proposed model was tested and achieved high diacritization accuracy rate.
引用
收藏
页码:42 / 46
页数:5
相关论文
共 50 条
  • [1] Masdar: A Novel Sequence-to-Sequence Deep Learning Model for Arabic Stemming
    Fouad, Mohammed M.
    Mahany, Ahmed
    Katib, Iyad
    [J]. INTELLIGENT SYSTEMS AND APPLICATIONS, VOL 2, 2020, 1038 : 363 - 373
  • [2] Simple Extensible Deep Learning Model for Automatic Arabic Diacritization
    Abbad, Hamza
    Xiong, Shengwu
    [J]. ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2022, 21 (02)
  • [3] A novel sequence-to-sequence based deep learning model for satellite cloud image time series prediction
    Lian, Jie
    Wu, Shixin
    Huang, Sirong
    Zhao, Qin
    [J]. ATMOSPHERIC RESEARCH, 2024, 306
  • [4] DYNAMIC FORECASTING OF PATIENT-SPECIFIC KIDNEY TRANSPLANT FUNCTION WITH A SEQUENCE-TO-SEQUENCE DEEP-LEARNING MODEL
    van Loon, Elisabet
    Zhang, Wanqiu
    van Craenenbroeck, Amaryllis
    de Moor, Bart
    Naesens, Maarten
    [J]. TRANSPLANT INTERNATIONAL, 2021, 34 : 62 - 62
  • [5] A sequence-to-sequence model-based deep learning approach for recognizing activity of daily living for senior care
    Zhu, Hongyi
    Chen, Hsinchun
    Brown, Randall
    [J]. JOURNAL OF BIOMEDICAL INFORMATICS, 2018, 84 : 148 - 158
  • [6] A Novel Deep Learning Attention Based Sequence to Sequence Model for Automatic Abstractive Text Summarization
    Yousef Methkal Abd Algani
    [J]. International Journal of Information Technology, 2024, 16 (6) : 3597 - 3603
  • [7] Time Series Forecasting using Sequence-to-Sequence Deep Learning Framework
    Du, Shengdong
    Li, Tianrui
    Horng, Shi-Jinn
    [J]. 2018 9TH INTERNATIONAL CONFERENCE ON PARALLEL ARCHITECTURES, ALGORITHMS AND PROGRAMMING (PAAP 2018), 2018, : 171 - 176
  • [8] A deep trajectory clustering method based on sequence-to-sequence autoencoder model
    Wang, Chao
    Lyu, Fangzheng
    Wu, Sensen
    Wang, Yuanyuan
    Xu, Liuchang
    Zhang, Feng
    Wang, Shaowen
    Wang, Yongheng
    Du, Zhenhong
    [J]. TRANSACTIONS IN GIS, 2022, 26 (04) : 1801 - 1820
  • [9] Regional prediction of ground-level ozone using a hybrid sequence-to-sequence deep learning approach
    Wang, Hong-Wei
    Li, Xiao-Bing
    Wang, Dongsheng
    Zhao, Juanhao
    He, Hong-di
    Peng, Zhong-Ren
    [J]. JOURNAL OF CLEANER PRODUCTION, 2020, 253
  • [10] Sequence-to-Sequence Acoustic-to-Phonetic Conversion Using Spectrograms and Deep Learning
    Qamhan, Mustafa A.
    Alotaibi, Yousef Ajami
    Seddiq, Yasser Mohammad
    Meftah, Ali Hamid
    Selouani, Sid Ahmed
    [J]. IEEE Access, 2021, 9 : 80209 - 80220