Feature-based Unsupervised Method for Salient Sentence Ranking in Text Summarization Task

被引:0
|
作者
Nguyen Minh Phuong [1 ]
Le The Anh [2 ]
机构
[1] Japan Adv Inst Sci & Technol, Nomi, Ishikawa, Japan
[2] FPT Univ, Can Tho, Vietnam
关键词
Unsupervised sentence scoring; salient sentence extraction; unsupervised multi-document summarization (mds);
D O I
10.1145/3654522.3654556
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Salient Sentence Ranking is an essential task that plays a vital role in Data Mining, especially in unsupervised document summarization tasks. In this paper, we introduce a simple yet effective unsupervised method to extract the salient sentences from a cluster of documents. Our method synthesizes the sentence scoring from various feature-based information containing position, topic, keyword, semantic, entity, sentence centroid -scores. The proposed method has the potential to generate large-scale pseudo-summary, which supports the tasks of summarization. To this end, our approach is able to incorporate pre-trained objectives used in pre-trained language models to diminish the problems of the lack of annotated datasets in low-resource languages like Vietnamese. We also conducted experiments to verify the effectiveness of various feature-based scoring methods and their combinations. Our experimental results on two well-known benchmark datasets, MultiNews and NewSHead, show the superiority of our proposed method compared with the previous unsupervised approaches.
引用
收藏
页码:346 / 351
页数:6
相关论文
共 50 条
  • [41] Feature-based Product Review Summarization Utilizing User Score
    Yang, Jung-Yeon
    Kim, Han-Joon
    Lee, Sang-Goo
    JOURNAL OF INFORMATION SCIENCE AND ENGINEERING, 2010, 26 (06) : 1973 - 1990
  • [42] Chinese text classification method based on sentence information enhancement and feature fusion
    Zhu, Binglin
    Pan, Wei
    HELIYON, 2024, 10 (17)
  • [43] Feature based cluster ranking approach for single document summarization
    Sharaff A.
    Jain M.
    Modugula G.
    International Journal of Information Technology, 2022, 14 (4) : 2057 - 2065
  • [44] Feature-Based Sentence Extraction Using Fuzzy Inference rules
    Suanmali, Ladda
    Salim, Naomie
    Binwahlan, Mohammed Salem
    PROCEEDINGS OF THE 2009 INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING SYSTEMS, 2009, : 511 - +
  • [45] A feature-based account of the relations signalled by sentence and clause connectives
    Knott, A
    Mellish, C
    LANGUAGE AND SPEECH, 1996, 39 : 143 - 183
  • [46] A FEATURE-BASED SENTENCE MODEL FOR EVALUATION OF SIMILAR ONLINE PRODUCTS
    Xu, Haiping
    Zhang, Yuhan
    Degroof, Richard
    JOURNAL OF ELECTRONIC COMMERCE RESEARCH, 2018, 19 (04): : 320 - 335
  • [47] Evaluation of a Sentence Ranker for Text Summarization Based on Roget's Thesaurus
    Kennedy, Alistair
    Szpakowicz, Stan
    TEXT, SPEECH AND DIALOGUE, 2010, 6231 : 101 - 108
  • [48] Unsupervised Video Summarization Based on the Diffusion Model of Feature Fusion
    Yu, Qinghao
    Yu, Hui
    Sun, Ying
    Ding, Derui
    Jian, Muwei
    IEEE TRANSACTIONS ON COMPUTATIONAL SOCIAL SYSTEMS, 2024, : 1 - 12
  • [49] Endoscopy Video Summarization based on Unsupervised Learning and Feature Discrimination
    Ben Ismail, M. Maher
    Bchir, Ouiem
    Emam, Ahmed Z.
    2013 IEEE INTERNATIONAL CONFERENCE ON VISUAL COMMUNICATIONS AND IMAGE PROCESSING (IEEE VCIP 2013), 2013,
  • [50] A NEW ENSEMBLE METHOD FOR FEATURE RANKING IN TEXT MINING
    Sadeghi, Sabereh
    Beigy, Hamid
    INTERNATIONAL JOURNAL ON ARTIFICIAL INTELLIGENCE TOOLS, 2013, 22 (03)