Comparison of feature-based sentence ranking methods for extractive summarization of Turkish news texts

被引:0
|
作者
Erdagi, Erturk [1 ]
Tunali, Volkan [2 ]
机构
[1] Maltepe Univ, Grad Sch Educ, Dept Comp Engn, TR-34857 Istanbul, Turkiye
[2] Maltepe Univ, Fac Engn & Nat Sci, Dept Software Engn, TR-34857 Istanbul, Turkiye
关键词
Summarization; Extractive; Sentence Ranking; Major Vowel; Minor Vowel; MODELS;
D O I
10.14744/sigma.2023.00076
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
Document summarization is the task of generating a shorter form of document with important information content. Automatic text summarization has been developed for this process and is still widely used. It is divided into two main parts as extractive summarization and abstractive summarization. In this study, we used sentence ranking methods for extractive summarization for Turkish news text within the scope of the experimental study. We used different summarization rates, 20%, 30%, 40%, 50% and 60%. Summarization results were evaluated with the ROUGE ve BLEU metrics. We proposed new methods based on major vowel harmony and minor vowel harmony features. We obtained high evaluation results in both ROUGE ve BLEU metrics with major vowel harmony and minor vowel harmony features. Additionally, we studied a hybrid model using major vowel harmony and minor vowel harmony rules together. We obtained the best results with major vowel harmony, minor vowel harmony, and hybrid model (major vowel harmony and minor vowel harmony together). We compared the three proposed methods with the BERTurk model prepared for Turkish based on Google BERT. The results obtained gave very close results to this state-of-the-art method and showed that it is worth developing.
引用
收藏
页码:321 / 334
页数:14
相关论文
共 50 条
  • [1] Feature-based Unsupervised Method for Salient Sentence Ranking in Text Summarization Task
    Nguyen Minh Phuong
    Le The Anh
    [J]. PROCEEDINGS OF THE 2024 9TH INTERNATIONAL CONFERENCE ON INTELLIGENT INFORMATION TECHNOLOGY, ICIIT 2024, 2024, : 346 - 351
  • [2] Discovering Chinese sentence patterns for feature-based opinion summarization
    Huang, Shiu-Li
    Cheng, Wen-Chi
    [J]. ELECTRONIC COMMERCE RESEARCH AND APPLICATIONS, 2015, 14 (06) : 582 - 591
  • [3] Feature Priority Based Sentence Filtering Method for Extractive Automatic Text Summarization
    Meena, Yogesh Kumar
    Gopalani, Dinesh
    [J]. INTERNATIONAL CONFERENCE ON COMPUTER, COMMUNICATION AND CONVERGENCE (ICCC 2015), 2015, 48 : 728 - 734
  • [4] Extractive text summarization based on selectivity ranking
    University of Rijeka, Department of Informatics, Rijeka, Croatia
    不详
    [J]. Int. Conf. INnov. Intell. Syst. Appl., INISTA - Proc., 2021,
  • [5] Word-sentence co-ranking for automatic extractive text summarization
    Fang, Changjian
    Mu, Dejun
    Deng, Zhenghong
    Wu, Zhiang
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2017, 72 : 189 - 195
  • [6] A Comparison of Multiple Approaches for the Extractive Summarization of Portuguese Texts
    Costa, Miguel
    Martins, Bruno
    [J]. LINGUAMATICA, 2015, 7 (01): : 23 - 40
  • [7] Extractive summarization based on word information and sentence position
    Cruz, CM
    Urrea, AM
    [J]. COMPUTATIONAL LINGUISTICS AND INTELLIGENT TEXT PROCESSING, 2005, 3406 : 653 - 656
  • [8] EXTRACTIVE TEXT SUMMARIZATION BY FEATURE- BASED SENTENCE EXTRACTION USING RULE-BASED CONCEPT
    Naik, Siya Sadashiv
    Gaonkar, Manisha Naik
    [J]. 2017 2ND IEEE INTERNATIONAL CONFERENCE ON RECENT TRENDS IN ELECTRONICS, INFORMATION & COMMUNICATION TECHNOLOGY (RTEICT), 2017, : 1364 - 1368
  • [9] Feature-Based Customer Review Summarization
    Maisto, Alessandro
    Pelosi, Serena
    [J]. ON THE MOVE TO MEANINGFUL INTERNET SYSTEMS: OTM 2014 WORKSHOPS, 2014, 8842 : 299 - 308
  • [10] AREDSUM: Adaptive Redundancy-Aware Iterative Sentence Ranking for Extractive Document Summarization
    Bi, Keping
    Jha, Rahul
    Croft, W. Bruce
    Celikyilmaz, Asli
    [J]. 16TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (EACL 2021), 2021, : 281 - 291