Opinion Summarization via Submodular Information Measures

被引:0
|
作者
Zhao, Yang [1 ]
Chow, Tommy W. S. [1 ]
机构
[1] City Univ Hong Kong, Dept Elect Engn, Hong Kong, Peoples R China
基金
中国国家自然科学基金;
关键词
Mutual information; Feature extraction; Optimization; Measurement; Data mining; Entropy; Transformers; Opinion mining; opinion summarization; sentiment analysis; submodular information measures; submodular optimization; SET;
D O I
10.1109/TKDE.2023.3235337
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper focuses on opinion summarization for constructing subjective and concise summaries representing essential opinions of online text reviews. As previous works rarely focus on the relationship between opinions, topics, and sentences, we propose a set of new requirements for Opinion-Topic-Sentence, which are essential for performing opinion summarization. We prove that Opinion-Topic-Sentence can be theoretically analyzed by submodular information measures. Thus, our proposed method can reduce redundant information, strengthen the relevance to given topics, and informatively represent the underlying emotional variations. While conventional methods require human-labeled topics for extractive summarization, we use unsupervised topic modeling methods to generate topic features. We propose four submodular functions and two optimization algorithms with proven performance bounds that can maximize opinion summarization's utility. An automatic evaluation metric, Topic-based Opinion Variance, is also derived to compensate for ROUGE-based metrics of opinion summarization evaluation. Four large, diversified, and representative corpora, OPOSUM, Opinosis, Yelp, and Amazon reviews, are used in our study. The results on these online review texts corroborate the efficacy of our proposed metric and framework.
引用
收藏
页码:11708 / 11721
页数:14
相关论文
共 50 条
  • [31] Opinion Summarization of Web Comments
    Potthast, Martin
    Becker, Steffen
    ADVANCES IN INFORMATION RETRIEVAL, PROCEEDINGS, 2010, 5993 : 668 - 669
  • [32] Convex Aggregation for Opinion Summarization
    Iso, Hayate
    Wang, Xiaolan
    Suhara, Yoshihiko
    Angelidis, Stefanos
    Tan, Wang-Chiew
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EMNLP 2021, 2021, : 3885 - 3903
  • [33] Attributable and Scalable Opinion Summarization
    Hosking, Tom
    Tang, Hao
    Lapata, Mirella
    PROCEEDINGS OF THE 61ST ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2023): LONG PAPERS, VOL 1, 2023, : 8488 - 8505
  • [34] Compact Explanatory Opinion Summarization
    Kim, Hyun Duk
    Castellanos, Malu
    Hsu, Meichun
    Zhai, ChengXiang
    Dayal, Umeshwar
    Ghosh, Riddhiman
    PROCEEDINGS OF THE 22ND ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT (CIKM'13), 2013, : 1697 - 1702
  • [35] Opinion on the Measures of Protecting the Personal Information Right of Consumers
    Jin, Jin
    2015 4th International Conference on Social Sciences and Society (ICSSS 2015), Pt 3, 2015, 72 : 119 - 122
  • [36] Informative and Controllable Opinion Summarization
    Amplayo, Reinald Kim
    Lapata, Mirella
    16TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (EACL 2021), 2021, : 2662 - 2672
  • [37] PRISM: A Rich Class of Parameterized Submodular Information Measures for Guided Data Subset Selection
    Kothawade, Suraj
    Kaushal, Vishal
    Ramakrishnan, Ganesh
    Bilmes, Jeff
    Iyer, Rishabh
    THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 10238 - 10246
  • [38] ORIENT: Submodular Mutual Information Measures for Data Subset Selection under Distribution Shift
    Karanam, Athresh
    Killamsetty, Krishnateja
    Kokel, Harsha
    Iyer, Rishabh K.
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
  • [39] Data Summarization at Scale: A Two-Stage Submodular Approach
    Mitrovic, Marko
    Kazemi, Ehsan
    Zadimoghaddam, Morteza
    Karbasi, Amin
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 80, 2018, 80
  • [40] Deep submodular network: An application to multi-document summarization
    Ghadimi, Alireza
    Beigy, Hamid
    EXPERT SYSTEMS WITH APPLICATIONS, 2020, 152 (152)