Sentiment Lossless Summarization

被引:0
|
作者
Li, Xiaodong [1 ]
Wu, Pangjing [1 ]
Zou, Chenxin [1 ]
Xie, Haoran [2 ]
Wang, Fu Lee [3 ]
机构
[1] Hohai Univ, Coll Comp & Informat, Nanjing, Peoples R China
[2] Lingnan Univ, Dept Comp & Decis Sci, Hong Kong, Peoples R China
[3] Open Univ Hong Kong, Sch Sci & Technol, Hong Kong, Peoples R China
基金
中国国家自然科学基金;
关键词
Graph-based summarization; Extractive summarization; Sentiment analysis; TEXT; MODEL;
D O I
10.1016/j.knosys.2021.107170
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The aim of automatic text summarization (ATS) is to extract representative texts from documents and keep major points of the extracted texts consistent with the original documents. However, most existing studies ignore sentimental information loss in the summarization process, which leads to sentiment loss summarization. To address the sentiment loss issue during summarization, we introduce a sentiment compensation mechanism into document summarization and propose a graph-based extractive summarization approach named Sentiment Lossless Summarization (SLS). SLS first creates a graph representation for a document to obtain the importance score (i.e., literal indicator) of each sentence. Second, sentiment dictionaries are leveraged to analyze the sentence sentiments. Third, during each summarization iteration, the sentences with the lowest scores are iteratively removed, and the sentiment compensation weights of the remaining sentences are updated. With the help of sentiment compensation during the summarization process, sentiment consistencies between candidate summaries and the original documents are maintained. Intrinsic evaluations conducted on the DUC2001, DUC2002, DUC2004, and Multi-News datasets demonstrate that our approach outperforms baselines and state-of-the-art summarization methods in terms of Recall-Oriented Understudy for Gisting Evaluation (ROUGE) scores. Additionally, to further evaluate SLS performance in sentiment retention, extrinsic evaluations are introduced, and summary quality in terms of sentiment loss is evaluated by measuring the prediction accuracy for sentiment polarities of either movie (IMDb dataset) or product (Amazon dataset) review summaries. The experimental results demonstrate that our approach can improve prediction accuracy by at most 6% compared to the baseline. (C) 2021 Elsevier B.V. All rights reserved.
引用
收藏
页数:13
相关论文
共 50 条
  • [1] Incremental Lossless Graph Summarization
    Ko, Jihoon
    Kook, Yunbum
    Shin, Kijung
    [J]. KDD '20: PROCEEDINGS OF THE 26TH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING, 2020, : 317 - 327
  • [2] SLUGGER: Lossless Hierarchical Summarization of Massive Graphs
    Lee, Kyuhan
    Ko, Jihoon
    Shin, Kijung
    [J]. 2022 IEEE 38TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE 2022), 2022, : 472 - 484
  • [3] Sentiment Analysis and Summarization of Twitter Data
    Bahrainian, Seyed-Ali
    Dengel, Andreas
    [J]. 2013 IEEE 16TH INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE AND ENGINEERING (CSE 2013), 2013, : 227 - 234
  • [4] A survey on review summarization and sentiment classification
    Nagsen Komwad
    Paras Tiwari
    Banoth Praveen
    C. Ravindranath Chowdary
    [J]. Knowledge and Information Systems, 2022, 64 : 2289 - 2327
  • [5] Visual Sentiment Summarization of Movie Reviews
    Na, Jin-Cheon
    Thet, Tun Thura
    Khoo, Christopher S. G.
    Kyaing, Wai Yan Min
    [J]. DIGITAL LIBRARIES: FOR CULTURAL HERITAGE, KNOWLEDGE DISSEMINATION, AND FUTURE CREATION: ICADL 2011, 2011, 7008 : 277 - 287
  • [6] Sentiment Diversification for Short Review Summarization
    Al-Dhelaan, Mohammed
    Al-Suhaim, Abeer
    [J]. 2017 IEEE/WIC/ACM INTERNATIONAL CONFERENCE ON WEB INTELLIGENCE (WI 2017), 2017, : 723 - 729
  • [7] A survey on review summarization and sentiment classification
    Komwad, Nagsen
    Tiwari, Paras
    Praveen, Banoth
    Chowdary, C. Ravindranath
    [J]. KNOWLEDGE AND INFORMATION SYSTEMS, 2022, 64 (09) : 2289 - 2327
  • [8] Lossless Graph Summarization using Dense Subgraphs Discovery
    Khan, Kifayat Ullah
    Nawaz, Waqas
    Lee, Young-Koo
    [J]. ACM IMCOM 2015, PROCEEDINGS, 2015,
  • [9] Topic and sentiment aware microblog summarization for twitter
    Ali, Syed Muhammad
    Noorian, Zeinab
    Bagheri, Ebrahim
    Ding, Chen
    Al-Obeidat, Feras
    [J]. JOURNAL OF INTELLIGENT INFORMATION SYSTEMS, 2020, 54 (01) : 129 - 156
  • [10] Application of Summarization and Sentiment Analysis in the Tourism domain
    Premakumara, Nilantha
    Shiranthika, C.
    Welideniya, Praneeth
    Bandara, Chamath
    Prasad, Ishanka
    Sumathipala, Sagara
    [J]. 2019 IEEE 5TH INTERNATIONAL CONFERENCE FOR CONVERGENCE IN TECHNOLOGY (I2CT), 2019,