Intertopic Information Mining for Query-Based Summarization

被引:10
|
作者
Ouyang, You [1 ]
Li, Wenjie [1 ]
Li, Sujian [2 ]
Lu, Qin [1 ]
机构
[1] Hong Kong Polytech Univ, Dept Comp, Hong Kong, Hong Kong, Peoples R China
[2] Peking Univ, Minist Educ, Key Lab Computat Linguist, Beijing, Peoples R China
关键词
D O I
10.1002/asi.21299
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this article, the authors address the problem of sentence ranking in summarization. Although most existing summarization approaches are concerned with the information embodied in a particular topic (including a set of documents and an associated query) for sentence ranking, they propose a novel ranking approach that incorporates intertopic information mining. Intertopic information, in contrast to intratopic information, is able to reveal pairwise topic relationships and thus can be considered as the bridge across different topics. In this article, the intertopic information is used for transferring word importance learned from known topics to unknown topics under a learning-based summarization framework. To mine this information, the authors model the topic relationship by clustering all the words in both known and unknown topics according to various kinds of word conceptual labels, which indicate the roles of the words in the topic. Based on the mined relationships, we develop a probabilistic model using manually generated summaries provided for known topics to predict ranking scores for sentences in unknown topics. A series of experiments have been conducted on the Document Understanding Conference (DUC) 2006 data set. The evaluation results show that intertopic information is indeed effective for sentence ranking and the resultant summarization system performs comparably well to the best-performing DUC participating systems on the same data set.
引用
收藏
页码:1062 / 1072
页数:11
相关论文
共 50 条
  • [1] Query-based summarization of customer reviews
    Feiguina, Olga
    Lapalme, Guy
    ADVANCES IN ARTIFICIAL INTELLIGENCE, 2007, 4509 : 452 - +
  • [2] Query-based summarization of discussion threads
    Verberne, Suzan
    Krahmer, Emiel
    Wubben, Sander
    van den Bosch, Antal
    NATURAL LANGUAGE ENGINEERING, 2020, 26 (01) : 3 - 29
  • [3] Query-Based Summarization for search lists
    Ye, Xinghuo
    Wei, Hai
    FIRST INTERNATIONAL WORKSHOP ON KNOWLEDGE DISCOVERY AND DATA MINING, PROCEEDINGS, 2007, : 330 - 333
  • [4] A query-based medical information summarization system using ontology knowledge
    Chen, Ping
    Verma, Rakesh
    19TH IEEE INTERNATIONAL SYMPOSIUM ON COMPUTER-BASED MEDICAL SYSTEMS, PROCEEDINGS, 2006, : 37 - +
  • [5] Research on Query-based Automatic Summarization of Webpage
    Chen, Zhimin
    Shen, Jie
    2009 ISECS INTERNATIONAL COLLOQUIUM ON COMPUTING, COMMUNICATION, CONTROL, AND MANAGEMENT, VOL I, 2009, : 173 - 176
  • [6] Query-based Summarization for Indonesian News Articles
    Annisa, Dininta
    Khodra, Masayu Leylia
    2017 4TH INTERNATIONAL CONFERENCE ON ADVANCED INFORMATICS, CONCEPTS, THEORY, AND APPLICATIONS (ICAICTA) PROCEEDINGS, 2017,
  • [7] Query-Based Extractive Text Summarization for Sanskrit
    Barve, Siddhi
    Desai, Shaba
    Sardinha, Razia
    PROCEEDINGS OF THE 4TH INTERNATIONAL CONFERENCE ON FRONTIERS IN INTELLIGENT COMPUTING: THEORY AND APPLICATIONS (FICTA) 2015, 2016, 404 : 559 - 568
  • [8] Mining Query-Based Subnetwork Outliers in Heterogeneous Information Networks
    Zhuang, Honglei
    Zhang, Jing
    Brova, George
    Tang, Jie
    Cam, Hasan
    Yan, Xifeng
    Han, Jiawei
    2014 IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM), 2014, : 1127 - 1132
  • [9] QUERY-BASED VIDEO SUMMARIZATION WITH PSEUDO LABEL SUPERVISION
    Huang, Jia-Hong
    Murn, Luka
    Mrak, Marta
    Worring, Marcel
    2023 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2023, : 1430 - 1434
  • [10] Improving query-based summarization using document graphs
    Mohamed, Ahmed A.
    Rajasekaran, Sanguthevar
    2006 IEEE INTERNATIONAL SYMPOSIUM ON SIGNAL PROCESSING AND INFORMATION TECHNOLOGY, VOLS 1 AND 2, 2006, : 408 - +