Arabic text summarization using deep learning approach

被引:22
|
作者
Al-Maleh, Molham [1 ]
Desouki, Said [2 ]
机构
[1] Higher Inst Appl Sci & Technol, Fac Informat Technol, Damascus, Syria
[2] Arab Int Univ, Fac Informat & Commun Engn, Damascus, Syria
关键词
Natural language processing; Text summarization; Deep learning; Big data; Sequence-to-sequence framework;
D O I
10.1186/s40537-020-00386-7
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Natural language processing has witnessed remarkable progress with the advent of deep learning techniques. Text summarization, along other tasks like text translation and sentiment analysis, used deep neural network models to enhance results. The new methods of text summarization are subject to a sequence-to-sequence framework of encoder-decoder model, which is composed of neural networks trained jointly on both input and output. Deep neural networks take advantage of big datasets to improve their results. These networks are supported by the attention mechanism, which can deal with long texts more efficiently by identifying focus points in the text. They are also supported by the copy mechanism that allows the model to copy words from the source to the summary directly. In this research, we are re-implementing the basic summarization model that applies the sequence-to-sequence framework on the Arabic language, which has not witnessed the employment of this model in the text summarization before. Initially, we build an Arabic data set of summarized article headlines. This data set consists of approximately 300 thousand entries, each consisting of an article introduction and the headline corresponding to this introduction. We then apply baseline summarization models to the previous data set and compare the results using the ROUGE scale.
引用
收藏
页数:17
相关论文
共 50 条
  • [21] Bilingual Automatic Text Summarization Using Unsupervised Deep Learning
    Singh, Shashi Pal
    Kumar, Ajai
    Mangal, Abhilasha
    Singhal, Shikha
    [J]. 2016 INTERNATIONAL CONFERENCE ON ELECTRICAL, ELECTRONICS, AND OPTIMIZATION TECHNIQUES (ICEEOT), 2016, : 1195 - 1200
  • [22] Automatic Text Summarization Using Deep Reinforcement Learning and Beyond
    Sun, Gang
    Wang, Zhongxin
    Zhao, Jia
    [J]. INFORMATION TECHNOLOGY AND CONTROL, 2021, 50 (03): : 458 - 469
  • [23] A Hybrid Approach for Arabic Text Summarization Using Domain Knowledge and Genetic Algorithms
    Qasem A. Al-Radaideh
    Dareen Q. Bataineh
    [J]. Cognitive Computation, 2018, 10 : 651 - 669
  • [24] A Hybrid Approach for Arabic Text Summarization Using Domain Knowledge and Genetic Algorithms
    Al-Radaideh, Qasem A.
    Bataineh, Dareen Q.
    [J]. COGNITIVE COMPUTATION, 2018, 10 (04) : 651 - 669
  • [25] Abstractive text summarization using deep learning with a new Turkish summarization benchmark dataset
    Ertam, Fatih
    Aydin, Galip
    [J]. CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2022, 34 (09):
  • [26] Automatic Arabic Text Summarization Using Analogical Proportions
    Bilel Elayeb
    Amina Chouigui
    Myriam Bounhas
    Oussama Ben Khiroun
    [J]. Cognitive Computation, 2020, 12 : 1043 - 1069
  • [27] Automatic Arabic Text Summarization Using Analogical Proportions
    Elayeb, Bilel
    Chouigui, Amina
    Bounhas, Myriam
    Ben Khiroun, Oussama
    [J]. COGNITIVE COMPUTATION, 2020, 12 (05) : 1043 - 1069
  • [28] Cyberbullying Detection Model for Arabic Text Using Deep Learning
    Albayari, Reem
    Abdallah, Sherief
    Shaalan, Khaled
    [J]. JOURNAL OF INFORMATION & KNOWLEDGE MANAGEMENT, 2024,
  • [29] Sarcasm Detection in Arabic Short Text using Deep Learning
    Al-Jamal, Wafa' Q.
    Mustafa, Ahmad M.
    Ali, Mostafa Z.
    [J]. 2022 13TH INTERNATIONAL CONFERENCE ON INFORMATION AND COMMUNICATION SYSTEMS (ICICS), 2022, : 362 - 366
  • [30] Extractive Arabic Text Summarization-Graph-Based Approach
    AL-Khassawneh, Yazan Alaya
    Hanandeh, Essam Said
    [J]. ELECTRONICS, 2023, 12 (02)