Arabic Text Generation Using Recurrent Neural Networks

被引:5
|
作者
Souri, Adnan [1 ]
El Maazouzi, Zakaria [1 ]
Al Achhab, Mohammed [1 ]
Eddine El Mohajir, Badr [1 ]
机构
[1] Abdelmalek Essaadi Univ, New Trend Technol Team, Natl Sch Appl Sci, Tetouan, Morocco
关键词
Arabic NLP; Recurrent Neural Networks; Text generation;
D O I
10.1007/978-3-319-96292-4_41
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we applied Recurrent Neural Networks (RNNs) Language Model on Arabic Language by training and testing it on "Arab World Books" and "Hindawi" free Arabic text datasets. While the standard architecture of RNNs does not match ideally with Arabic, we adapted a RNN model to deal with Arabic features. Our proposition in this paper is a gated Long-Short Term Memory (LSTM) model responding to some Arabic language criteria. As originality of the paper, we demonstrate the power of our LSTM model in generating Arabic text comparing to the standard LSTM model. Our results, comparing to English and Chinese text generation, have been promising and gave sufficient accuracy.
引用
收藏
页码:523 / 533
页数:11
相关论文
共 50 条
  • [1] Automatic diacritization of Arabic text using recurrent neural networks
    Gheith A. Abandah
    Alex Graves
    Balkees Al-Shagoor
    Alaa Arabiyat
    Fuad Jamour
    Majid Al-Taee
    [J]. International Journal on Document Analysis and Recognition (IJDAR), 2015, 18 : 183 - 197
  • [2] Automatic diacritization of Arabic text using recurrent neural networks
    Abandah, Gheith A.
    Graves, Alex
    Al-Shagoor, Balkees
    Arabiyat, Alaa
    Jamour, Fuad
    Al-Taee, Majid
    [J]. INTERNATIONAL JOURNAL ON DOCUMENT ANALYSIS AND RECOGNITION, 2015, 18 (02) : 183 - 197
  • [3] Automatic Text Generation in Macedonian Using Recurrent Neural Networks
    Milanova, Ivona
    Sarvanoska, Ksenija
    Srbinoski, Viktor
    Gjoreski, Hristijan
    [J]. ICT INNOVATIONS 2019: BIG DATA PROCESSING AND MINING, 2019, 1110 : 1 - 12
  • [4] Automatic dottization of Arabic text (Rasms) using deep recurrent neural networks
    Alhathloul, Zainab
    Ahmad, Irfan
    [J]. PATTERN RECOGNITION LETTERS, 2022, 162 : 47 - 55
  • [5] Robust Arabic Text Categorization by Combining Convolutional and Recurrent Neural Networks
    Ameur, Mohamed Seghir Hadj
    Belkebir, Riadh
    Guessoum, Ahmed
    [J]. ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2020, 19 (05)
  • [6] Investigating Hybrid Approaches for Arabic Text Diacritization with Recurrent Neural Networks
    Alqudah, Saba'
    Abandah, Gheith
    Arabiyat, Alaa
    [J]. 2017 IEEE JORDAN CONFERENCE ON APPLIED ELECTRICAL ENGINEERING AND COMPUTING TECHNOLOGIES (AEECT), 2017,
  • [7] Recognition of printed Arabic text using neural networks
    Amin, A
    Mansoor, W
    [J]. PROCEEDINGS OF THE FOURTH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION, VOLS 1 AND 2, 1997, : 612 - 615
  • [8] Arabic Text Diacritization Using Deep Neural Networks
    Fadel, Ali
    Tuffaha, Ibraheem
    Al-Jawarneh, Bara
    Al-Ayyoub, Mahmoud
    [J]. 2019 2ND INTERNATIONAL CONFERENCE ON COMPUTER APPLICATIONS & INFORMATION SECURITY (ICCAIS), 2019,
  • [9] High Performance Urdu and Arabic Video Text Recognition Using Convolutional Recurrent Neural Networks
    Rehman, Abdul
    Ul-Hasan, Adnan
    Shafait, Faisal
    [J]. DOCUMENT ANALYSIS AND RECOGNITION, ICDAR 2021 WORKSHOPS, PT I, 2021, 12916 : 336 - 352
  • [10] Arabic speech recognition using recurrent neural networks
    El Choubassi, MM
    El Khoury, HE
    Alagha, CEJ
    Skaf, JA
    Al-Alaoui, MA
    [J]. PROCEEDINGS OF THE 3RD IEEE INTERNATIONAL SYMPOSIUM ON SIGNAL PROCESSING AND INFORMATION TECHNOLOGY, 2003, : 543 - 547