Feature-based Unsupervised Method for Salient Sentence Ranking in Text Summarization Task

被引:0
|
作者
Nguyen Minh Phuong [1 ]
Le The Anh [2 ]
机构
[1] Japan Adv Inst Sci & Technol, Nomi, Ishikawa, Japan
[2] FPT Univ, Can Tho, Vietnam
关键词
Unsupervised sentence scoring; salient sentence extraction; unsupervised multi-document summarization (mds);
D O I
10.1145/3654522.3654556
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Salient Sentence Ranking is an essential task that plays a vital role in Data Mining, especially in unsupervised document summarization tasks. In this paper, we introduce a simple yet effective unsupervised method to extract the salient sentences from a cluster of documents. Our method synthesizes the sentence scoring from various feature-based information containing position, topic, keyword, semantic, entity, sentence centroid -scores. The proposed method has the potential to generate large-scale pseudo-summary, which supports the tasks of summarization. To this end, our approach is able to incorporate pre-trained objectives used in pre-trained language models to diminish the problems of the lack of annotated datasets in low-resource languages like Vietnamese. We also conducted experiments to verify the effectiveness of various feature-based scoring methods and their combinations. Our experimental results on two well-known benchmark datasets, MultiNews and NewSHead, show the superiority of our proposed method compared with the previous unsupervised approaches.
引用
下载
收藏
页码:346 / 351
页数:6
相关论文
共 50 条
  • [21] EXTRACTIVE TEXT SUMMARIZATION BY FEATURE- BASED SENTENCE EXTRACTION USING RULE-BASED CONCEPT
    Naik, Siya Sadashiv
    Gaonkar, Manisha Naik
    2017 2ND IEEE INTERNATIONAL CONFERENCE ON RECENT TRENDS IN ELECTRONICS, INFORMATION & COMMUNICATION TECHNOLOGY (RTEICT), 2017, : 1364 - 1368
  • [22] An approach to sentence-selection-based text summarization
    Chen, F
    Han, KS
    Chen, GL
    2002 IEEE REGION 10 CONFERENCE ON COMPUTERS, COMMUNICATIONS, CONTROL AND POWER ENGINEERING, VOLS I-III, PROCEEDINGS, 2002, : 489 - 493
  • [23] Update Summarization via Graph-Based Sentence Ranking
    Li, Xuan
    Du, Liang
    Shen, Yi-Dong
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2013, 25 (05) : 1162 - 1174
  • [24] Text Summarization Based on Sentence Selection with Semantic Representation
    Zhang, Chi
    Zhang, Lei
    Wang, Chong-Jun
    Xie, Jun-Yuan
    2014 IEEE 26TH INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI), 2014, : 584 - 590
  • [25] An innovative approach of Bangla text summarization by introducing pronoun replacement and improved sentence ranking
    Haque M.M.
    Pervin S.
    Begum Z.
    Journal of Information Processing Systems, 2017, 13 (04): : 752 - 777
  • [26] A Query Specific Graph Based Approach to Multi-document Text Summarization: Simultaneous Cluster and Sentence Ranking
    Pandit, Sandip R.
    Potey, M. A.
    2013 INTERNATIONAL CONFERENCE ON MACHINE INTELLIGENCE AND RESEARCH ADVANCEMENT (ICMIRA 2013), 2013, : 213 - 217
  • [27] Feature-based Assessment of Text Readability
    Zhang, Lixiao
    Liu, Zaiying
    Ni, Jun
    2013 SEVENTH INTERNATIONAL CONFERENCE ON INTERNET COMPUTING FOR ENGINEERING AND SCIENCE (ICICSE 2013), 2013, : 51 - 54
  • [28] Feature-based Unsupervised Clustering for Supplier Categorization
    Irfan, Danish
    Xu Xiaofei
    Deng Shengchun
    Ye Yunming
    2008 IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS, VOLS 1-5, 2008, : 2074 - +
  • [29] iSpreadRank: Ranking sentences for extraction-based summarization using feature weight propagation in the sentence similarity network
    Yeh, Jen-Yuan
    Ke, Hao-Ren
    Yang, Wei-Pang
    EXPERT SYSTEMS WITH APPLICATIONS, 2008, 35 (03) : 1451 - 1462
  • [30] Implicit Feature Detection by Ontology Aided Feature-Based Opinion Summarization
    Kanbur, Dervis
    Aktas, Mehmet S.
    2017 INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND ENGINEERING (UBMK), 2017, : 1104 - 1109