Abstractive text summarization using deep learning with a new Turkish summarization benchmark dataset

被引:3
|
作者
Ertam, Fatih [1 ]
Aydin, Galip [2 ]
机构
[1] Firat Univ, Technol Fac, Dept Digital Forens Engn, Elazig, Turkey
[2] Firat Univ, Engn Fac, Dept Comp Engn, Elazig, Turkey
来源
关键词
abstract summarization; deep learning; information retrieval; text summarization; web scraping; FRAMEWORK; MODELS;
D O I
10.1002/cpe.6482
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Exponential increase in the amount of textual data made available on the Internet results in new challenges in terms of accessing information accurately and quickly. Text summarization can be defined as reducing the dimensions of the expressions to be summarized without spoiling the meaning. Summarization can be performed as extractive and abstractive or using both together. In this study, we focus on abstractive summarization which can produce more human-like summarization results. For the study we created a Turkish news summarization benchmark dataset from various news agency web portals by crawling the news title, short news, news content, and keywords for the last 5 years. The dataset is made publicly available for researchers. The deep learning network training was carried out by using the news headlines and short news contents from the prepared dataset and then the network was expected to create the news headline as the short news summary. To evaluate the performance of this study, Rouge-1, Rouge-2, and Rouge-L were compared using precision, sensitivity and F1 measure scores. Performance values for the study were presented for each sentence as well as by averaging the results for 50 randomly selected sentences. The F1 Measure values are 0.4317, 0.2194, and 0.4334 for Rouge-1, Rouge-2, and Rouge-L respectively. Performance results show that the approach is promising for Turkish text summarization studies and the prepared dataset will add value to the literature.
引用
收藏
页数:10
相关论文
共 50 条
  • [41] Abstractive summarization with deep reinforcement learning using semantic similarity rewards
    Fikri, Figen Beken
    Oflazer, Kemal
    Yanikoglu, Berrin
    NATURAL LANGUAGE ENGINEERING, 2024, 30 (03) : 554 - 576
  • [42] Deep Learning Based Abstractive Text Summarization: Approaches, Datasets, Evaluation Measures, and Challenges
    Suleiman, Dima
    Awajan, Arafat
    Suleiman, Dima (d.suleiman@psut.edu.jo), 1600, Hindawi Limited, 410 Park Avenue, 15th Floor, 287 pmb, New York, NY 10022, United States (2020):
  • [43] Deep Learning Based Abstractive Text Summarization: Approaches, Datasets, Evaluation Measures, and Challenges
    Suleiman, Dima
    Awajan, Arafat
    MATHEMATICAL PROBLEMS IN ENGINEERING, 2020, 2020
  • [44] A Multi-Task Learning Framework for Abstractive Text Summarization
    Lu, Yao
    Liu, Linqing
    Jiang, Zhile
    Yang, Min
    Goebel, Randy
    THIRTY-THIRD AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FIRST INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / NINTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019, : 9987 - 9988
  • [45] Text analysis for Bengali Text Summarization using Deep Learning
    Al Munzir, Abdullah
    Rahman, Md. Lutfor
    Abujar, Sheikh
    Ohidujjaman
    Hossain, Syed Akhter
    2019 10TH INTERNATIONAL CONFERENCE ON COMPUTING, COMMUNICATION AND NETWORKING TECHNOLOGIES (ICCCNT), 2019,
  • [46] End to End Urdu Abstractive Text Summarization With Dataset and Improvement in Evaluation Metric
    Raza, Hassan
    Shahzad, Waseem
    IEEE ACCESS, 2024, 12 : 40311 - 40324
  • [47] Abstractive text summarization and new large-scale datasets for agglutinative languages Turkish and Hungarian
    Batuhan Baykara
    Tunga Güngör
    Language Resources and Evaluation, 2022, 56 : 973 - 1007
  • [48] Abstractive text summarization and new large-scale datasets for agglutinative languages Turkish and Hungarian
    Baykara, Batuhan
    Gungor, Tunga
    LANGUAGE RESOURCES AND EVALUATION, 2022, 56 (03) : 973 - 1007
  • [49] Extractive text summarization using deep learning approach
    Yadav A.K.
    Singh A.
    Dhiman M.
    Vineet
    Kaundal R.
    Verma A.
    Yadav D.
    International Journal of Information Technology, 2022, 14 (5) : 2407 - 2415
  • [50] Arabic text summarization using deep learning approach
    Al-Maleh, Molham
    Desouki, Said
    JOURNAL OF BIG DATA, 2020, 7 (01)