Abstractive text summarization using deep learning with a new Turkish summarization benchmark dataset

被引:3
|
作者
Ertam, Fatih [1 ]
Aydin, Galip [2 ]
机构
[1] Firat Univ, Technol Fac, Dept Digital Forens Engn, Elazig, Turkey
[2] Firat Univ, Engn Fac, Dept Comp Engn, Elazig, Turkey
来源
关键词
abstract summarization; deep learning; information retrieval; text summarization; web scraping; FRAMEWORK; MODELS;
D O I
10.1002/cpe.6482
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Exponential increase in the amount of textual data made available on the Internet results in new challenges in terms of accessing information accurately and quickly. Text summarization can be defined as reducing the dimensions of the expressions to be summarized without spoiling the meaning. Summarization can be performed as extractive and abstractive or using both together. In this study, we focus on abstractive summarization which can produce more human-like summarization results. For the study we created a Turkish news summarization benchmark dataset from various news agency web portals by crawling the news title, short news, news content, and keywords for the last 5 years. The dataset is made publicly available for researchers. The deep learning network training was carried out by using the news headlines and short news contents from the prepared dataset and then the network was expected to create the news headline as the short news summary. To evaluate the performance of this study, Rouge-1, Rouge-2, and Rouge-L were compared using precision, sensitivity and F1 measure scores. Performance values for the study were presented for each sentence as well as by averaging the results for 50 randomly selected sentences. The F1 Measure values are 0.4317, 0.2194, and 0.4334 for Rouge-1, Rouge-2, and Rouge-L respectively. Performance results show that the approach is promising for Turkish text summarization studies and the prepared dataset will add value to the literature.
引用
收藏
页数:10
相关论文
共 50 条
  • [1] An abstractive text summarization using deep learning in Assamese
    Goutom P.J.
    Baruah N.
    Sonowal P.
    [J]. International Journal of Information Technology, 2023, 15 (5) : 2365 - 2372
  • [2] Deep Learning Based Abstractive Turkish News Summarization
    Karakoc, Enise
    Yilmaz, Burcu
    [J]. 2019 27TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2019,
  • [3] Abstractive Arabic Text Summarization Based on Deep Learning
    Wazery, Y. M.
    Saleh, Marwa E.
    Alharbi, Abdullah
    Ali, Abdelmgeid A.
    [J]. COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE, 2022, 2022
  • [4] Abstractive text summarization using adversarial learning and deep neural network
    Meenaxi Tank
    Priyank Thakkar
    [J]. Multimedia Tools and Applications, 2024, 83 : 50849 - 50870
  • [5] Abstractive text summarization using adversarial learning and deep neural network
    Tank, Meenaxi
    Thakkar, Priyank
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 83 (17) : 50849 - 50870
  • [6] Abstractive Text Summarization Using Hybrid Technique of Summarization
    Liaqat, Muhammad Irfan
    Hamid, Isma
    Nawaz, Qamar
    Shafique, Nida
    [J]. 2022 14TH INTERNATIONAL CONFERENCE ON COMMUNICATION SOFTWARE AND NETWORKS (ICCSN 2022), 2022, : 141 - 144
  • [7] WikiLingua: A New Benchmark Dataset for Cross-Lingual Abstractive Summarization
    Ladhak, Faisal
    Durmus, Esin
    Cardie, Claire
    McKeown, Kathleen
    [J]. FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EMNLP 2020, 2020, : 4034 - 4048
  • [8] INDOSUM: A New Benchmark Dataset for Indonesian Text Summarization
    Kurniawan, Kemal
    Louvan, Samuel
    [J]. 2018 INTERNATIONAL CONFERENCE ON ASIAN LANGUAGE PROCESSING (IALP), 2018, : 215 - 220
  • [9] Turkish abstractive text document summarization using text to text transfer transformer
    Ay, Betul
    Ertam, Fatih
    Fidan, Guven
    Aydin, Galip
    [J]. ALEXANDRIA ENGINEERING JOURNAL, 2023, 68 : 1 - 13
  • [10] Deep reinforcement and transfer learning for abstractive text summarization: A review
    Alomari, Ayham
    Idris, Norisma
    Sabri, Aznul Qalid Md
    Alsmadi, Izzat
    [J]. COMPUTER SPEECH AND LANGUAGE, 2022, 71