Event graphs for information retrieval and multi-document summarization

被引:62
|
作者
Glavas, Goran [1 ]
Snajder, Jan [1 ]
机构
[1] Univ Zagreb, Fac Elect Engn & Comp, Text Anal & Knowledge Engn Lab, Zagreb 10000, Croatia
关键词
Event extraction; Information extraction; Information retrieval; Multi-document summarization; Natural language processing; VECTOR-SPACE MODEL;
D O I
10.1016/j.eswa.2014.04.004
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
With the number of documents describing real-world events and event-oriented information needs rapidly growing on a daily basis, the need for efficient retrieval and concise presentation of event-related information is becoming apparent. Nonetheless, the majority of information retrieval and text summarization methods rely on shallow document representations that do not account for the semantics of events. In this article, we present event graphs, a novel event-based document representation model that filters and structures the information about events described in text. To construct the event graphs, we combine machine learning and rule-based models to extract sentence-level event mentions and determine the temporal relations between them. Building on event graphs, we present novel models for information retrieval and multi-document summarization. The information retrieval model measures the similarity between queries and documents by computing graph kernels over event graphs. The extractive multi-document summarization model selects sentences based on the relevance of the individual event mentions and the temporal structure of events. Experimental evaluation shows that our retrieval model significantly outperforms well-established retrieval models on event-oriented test collections, while the summarization model outperforms competitive models from shared multi-document summarization tasks. (C) 2014 Elsevier Ltd. All rights reserved.
引用
收藏
页码:6904 / 6916
页数:13
相关论文
共 50 条
  • [1] Personalized Multi-Document Summarization in information retrieval
    Yang, Xiao-Peng
    Liu, Xiao-Rong
    [J]. PROCEEDINGS OF 2008 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-7, 2008, : 4108 - +
  • [2] Multi-document summarization as applied in information retrieval
    Zhou, Dan
    Li, Lei
    [J]. PROCEEDINGS OF THE 2007 IEEE INTERNATIONAL CONFERENCE ON NATURAL LANGUAGE PROCESSING AND KNOWLEDGE ENGINEERING (NLP-KE'07), 2007, : 203 - +
  • [3] Two-phase Multi-document Event Summarization on Core Event Graphs
    Chen, Zengjian
    Xu, Jin
    Liao, Meng
    Xue, Tong
    He, Kun
    [J]. Journal of Artificial Intelligence Research, 2022, 74 : 1037 - 1057
  • [4] Two-phase Multi-document Event Summarization on Core Event Graphs
    Chen, Zengjian
    Xu, Jin
    Liao, Meng
    Xue, Tong
    He, Kun
    [J]. JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 2022, 74 : 1037 - 1057
  • [5] Multi-Document Summarization by Information Distance
    Long, Chong
    Huang, Minlie
    Zhu, Xiaoyan
    Li, Ming
    [J]. 2009 9TH IEEE INTERNATIONAL CONFERENCE ON DATA MINING, 2009, : 866 - +
  • [6] Identification of Event and Topic for Multi-document Summarization
    Fukumoto, Fumiyo
    Suzuki, Yoshimi
    Takasu, Atsuhiro
    Matsuyoshi, Suguru
    [J]. HUMAN LANGUAGE TECHNOLOGY: CHALLENGES FOR COMPUTER SCIENCE AND LINGUISTICS, 2016, 9561 : 304 - 316
  • [7] Multi-document summarization for terrorism information extraction
    Wang, Fu Lee
    Yang, Christopher C.
    Shi, Xiaodong
    [J]. INTELLIGENCE AND SECURITY INFORMATICS, PROCEEDINGS, 2006, 3975 : 602 - 608
  • [8] A Multi-Document Coverage Reward for RELAXed Multi-Document Summarization
    Parnell, Jacob
    Unanue, Inigo Jauregi
    Piccardi, Massimo
    [J]. PROCEEDINGS OF THE 60TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2022), VOL 1: (LONG PAPERS), 2022, : 5112 - 5128
  • [9] MULTI-DOCUMENT VIDEO SUMMARIZATION
    Wang, Feng
    Merialdo, Bernard
    [J]. ICME: 2009 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, VOLS 1-3, 2009, : 1326 - 1329
  • [10] On redundancy in multi-document summarization
    Calvo, Hiram
    Carrillo-Mendoza, Pabel
    Gelbukh, Alexander
    [J]. JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2018, 34 (05) : 3245 - 3255