Opinion Summarization via Submodular Information Measures

被引:0
|
作者
Zhao, Yang [1 ]
Chow, Tommy W. S. [1 ]
机构
[1] City Univ Hong Kong, Dept Elect Engn, Hong Kong, Peoples R China
基金
中国国家自然科学基金;
关键词
Mutual information; Feature extraction; Optimization; Measurement; Data mining; Entropy; Transformers; Opinion mining; opinion summarization; sentiment analysis; submodular information measures; submodular optimization; SET;
D O I
10.1109/TKDE.2023.3235337
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper focuses on opinion summarization for constructing subjective and concise summaries representing essential opinions of online text reviews. As previous works rarely focus on the relationship between opinions, topics, and sentences, we propose a set of new requirements for Opinion-Topic-Sentence, which are essential for performing opinion summarization. We prove that Opinion-Topic-Sentence can be theoretically analyzed by submodular information measures. Thus, our proposed method can reduce redundant information, strengthen the relevance to given topics, and informatively represent the underlying emotional variations. While conventional methods require human-labeled topics for extractive summarization, we use unsupervised topic modeling methods to generate topic features. We propose four submodular functions and two optimization algorithms with proven performance bounds that can maximize opinion summarization's utility. An automatic evaluation metric, Topic-based Opinion Variance, is also derived to compensate for ROUGE-based metrics of opinion summarization evaluation. Four large, diversified, and representative corpora, OPOSUM, Opinosis, Yelp, and Amazon reviews, are used in our study. The results on these online review texts corroborate the efficacy of our proposed metric and framework.
引用
收藏
页码:11708 / 11721
页数:14
相关论文
共 50 条
  • [21] Fast Constrained Submodular Maximization: Personalized Data Summarization
    Mirzasoleiman, Baharan
    Badanidiyuru, Ashwinkumar
    Karbasi, Amin
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 48, 2016, 48
  • [22] apricot: Submodular selection for data summarization in Python']Python
    Schreiber, Jacob
    Bilmes, Jeffrey
    Noble, William Stafford
    JOURNAL OF MACHINE LEARNING RESEARCH, 2020, 21
  • [23] Streaming Submodular Maximization: Massive Data Summarization on the Fly
    Badanidiyuru, Ashwinkumar
    Mirzasoleiman, Baharan
    Karbasi, Amin
    Krause, Andreas
    PROCEEDINGS OF THE 20TH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING (KDD'14), 2014, : 671 - 680
  • [24] DIAGNOSE: Avoiding Out-of-Distribution Data Using Submodular Information Measures
    Kothawade, Suraj
    Shrivastava, Akshit
    Iyer, Venkat
    Ramakrishnan, Ganesh
    Iyer, Rishabh
    MEDICAL IMAGE LEARNING WITH LIMITED AND NOISY DATA (MILLAND 2022), 2022, 13559 : 141 - 150
  • [25] Learning Mixtures of Submodular Functions for Image Collection Summarization
    Tschiatschek, Sebastian
    Iyer, Rishabh
    Wei, Haochen
    Bilmes, Jeff
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 27 (NIPS 2014), 2014, 27
  • [26] Exploiting comments information to improve legal public opinion news abstractive summarization
    HUANG Yuxin
    YU Zhengtao
    XIANG Yan
    YU Zhiqiang
    GUO Junjun
    Frontiers of Computer Science, 2022, 16 (06)
  • [27] Exploiting comments information to improve legal public opinion news abstractive summarization
    Yuxin Huang
    Zhengtao Yu
    Yan Xiang
    Zhiqiang Yu
    Junjun Guo
    Frontiers of Computer Science, 2022, 16
  • [28] Exploiting comments information to improve legal public opinion news abstractive summarization
    Huang, Yuxin
    Yu, Zhengtao
    Xiang, Yan
    Yu, Zhiqiang
    Guo, Junjun
    FRONTIERS OF COMPUTER SCIENCE, 2022, 16 (06)
  • [29] Opinion summarization on spontaneous conversations
    Wang, Dong
    Liu, Yang
    COMPUTER SPEECH AND LANGUAGE, 2015, 34 (01): : 61 - 82
  • [30] Evaluating Opinion Summarization in Ranking
    Singh, Anil Kumar
    Thawani, Avijit
    Gupta, Anubhav
    Mundotiya, Rajesh Kumar
    INFORMATION RETRIEVAL TECHNOLOGY, AIRS 2017, 2017, 10648 : 222 - 234