MULTI-DOCUMENT SUMMARIZATION OF EVALUATIVE TEXT

被引:44
|
作者
Carenini, Giuseppe [1 ]
Cheung, Jackie Chi Kit [2 ]
Pauls, Adam [3 ]
机构
[1] Univ British Columbia, Dept Comp Sci, Vancouver, BC V6T 1W5, Canada
[2] Univ Toronto, Dept Comp Sci, Toronto, ON, Canada
[3] Univ Calif Berkeley, Div Comp Sci, Berkeley, CA 94720 USA
基金
加拿大自然科学与工程研究理事会;
关键词
evaluative text; natural language processing; summarization; web intelligence;
D O I
10.1111/j.1467-8640.2012.00417.x
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In many decision-making scenarios, people can benefit from knowing what other people's opinions are. As more and more evaluative documents are posted on the Web, summarizing these useful resources becomes a critical task for many organizations and individuals. This paper presents a framework for summarizing a corpus of evaluative documents about a single entity by a natural language summary. We propose two summarizers: an extractive summarizer and an abstractive one. As an additional contribution, we show how our abstractive summarizer can be modified to generate summaries tailored to a model of the user preferences that is solidly grounded in decision theory and can be effectively elicited from users. We have tested our framework in three user studies. In the first one, we compared the two summarizers. They performed equally well relative to each other quantitatively, while significantly outperforming a baseline standard approach to multidocument summarization. Trends in the results as well as qualitative comments from participants suggest that the summarizers have different strengths and weaknesses. After this initial user study, we realized that the diversity of opinions expressed in the corpus (i.e., its controversiality) might play a critical role in comparing abstraction versus extraction. To clearly pinpoint the role of controversiality, we ran a second user study in which we controlled for the degree of controversiality of the corpora that were summarized for the participants. The outcome of this study indicates that for evaluative text abstraction tends to be more effective than extraction, particularly when the corpus is controversial. In the third user study we assessed the effectiveness of our user tailoring strategy. The results of this experiment confirm that user tailored summaries are more informative than untailored ones.
引用
收藏
页码:545 / 576
页数:32
相关论文
共 50 条
  • [1] Multi-document text summarization - A survey
    Tandel, Amol
    Modi, Brijesh
    Gupta, Priyasha
    Wagle, Shreya
    Khedkar, Sujata
    [J]. PROCEEDINGS OF 2016 INTERNATIONAL CONFERENCE ON DATA MINING AND ADVANCED COMPUTING (SAPIENCE), 2016, : 336 - 339
  • [2] Study on Multi-document Summarization Based on Text Segmentation
    Wang, Meng
    Tang, Xinlai
    Wang, Xiaorong
    [J]. JOURNAL OF COMPUTERS, 2014, 9 (05) : 1241 - 1246
  • [3] Literature Study on Multi-document Text Summarization Techniques
    Shah, Chintan
    Jivani, Anjali
    [J]. SMART TRENDS IN INFORMATION TECHNOLOGY AND COMPUTER COMMUNICATIONS, SMARTCOM 2016, 2016, 628 : 442 - 451
  • [4] Improving Multi-Document Summarization via Text Classification
    Cao, Ziqiang
    Li, Wenjie
    Li, Sujian
    Wei, Furu
    [J]. THIRTY-FIRST AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2017, : 3053 - 3059
  • [5] Multi-document Text Summarization Using Sentence Extraction
    Ahuja, Ravinder
    Anand, Willson
    [J]. ARTIFICIAL INTELLIGENCE AND EVOLUTIONARY COMPUTATIONS IN ENGINEERING SYSTEMS, ICAIECES 2016, 2017, 517 : 235 - 242
  • [6] Multi-Document Text Summarization for Competitor Intelligence : A Methodology
    Chakraborti, Swapnajit
    Dey, Shubhamoy
    [J]. PROCEEDINGS OF 2014 2ND INTERNATIONAL SYMPOSIUM ON COMPUTATIONAL AND BUSINESS INTELLIGENCE (ISCBI), 2014, : 97 - 100
  • [7] Deep Learning in the Domain of Multi-Document Text Summarization
    Roul, Rajendra Kumar
    Sahoo, Jajati Keshari
    Goel, Rohan
    [J]. PATTERN RECOGNITION AND MACHINE INTELLIGENCE, PREMI 2017, 2017, 10597 : 575 - 581
  • [8] A NEW MODEL FOR ARABIC MULTI-DOCUMENT TEXT SUMMARIZATION
    Abu Maria, Khulood
    Jaber, Khalid Mohammad
    Ibrahim, Mossab Nabil
    [J]. INTERNATIONAL JOURNAL OF INNOVATIVE COMPUTING INFORMATION AND CONTROL, 2018, 14 (04): : 1443 - 1452
  • [9] A Multi-Document Coverage Reward for RELAXed Multi-Document Summarization
    Parnell, Jacob
    Unanue, Inigo Jauregi
    Piccardi, Massimo
    [J]. PROCEEDINGS OF THE 60TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2022), VOL 1: (LONG PAPERS), 2022, : 5112 - 5128
  • [10] Multi-document summarization based on link analysis and text classification
    Wu, JQ
    Wu, YZ
    Liu, J
    Zhuang, YT
    [J]. DIGITAL LIBRARIES: INTERNATIONAL COLLABORATION AND CROSS-FERTILIZATION, PROCEEDINGS, 2004, 3334 : 649 - 649