Intertopic Information Mining for Query-Based Summarization

被引:10
|
作者
Ouyang, You [1 ]
Li, Wenjie [1 ]
Li, Sujian [2 ]
Lu, Qin [1 ]
机构
[1] Hong Kong Polytech Univ, Dept Comp, Hong Kong, Hong Kong, Peoples R China
[2] Peking Univ, Minist Educ, Key Lab Computat Linguist, Beijing, Peoples R China
关键词
D O I
10.1002/asi.21299
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this article, the authors address the problem of sentence ranking in summarization. Although most existing summarization approaches are concerned with the information embodied in a particular topic (including a set of documents and an associated query) for sentence ranking, they propose a novel ranking approach that incorporates intertopic information mining. Intertopic information, in contrast to intratopic information, is able to reveal pairwise topic relationships and thus can be considered as the bridge across different topics. In this article, the intertopic information is used for transferring word importance learned from known topics to unknown topics under a learning-based summarization framework. To mine this information, the authors model the topic relationship by clustering all the words in both known and unknown topics according to various kinds of word conceptual labels, which indicate the roles of the words in the topic. Based on the mined relationships, we develop a probabilistic model using manually generated summaries provided for known topics to predict ranking scores for sentences in unknown topics. A series of experiments have been conducted on the Document Understanding Conference (DUC) 2006 data set. The evaluation results show that intertopic information is indeed effective for sentence ranking and the resultant summarization system performs comparably well to the best-performing DUC participating systems on the same data set.
引用
收藏
页码:1062 / 1072
页数:11
相关论文
共 50 条
  • [31] CoMSum and SIBERT: A Dataset and Neural Model for Query-Based Multi-document Summarization
    Kulkarni, Sayali
    Chammas, Sheide
    Zhu, Wan
    Sha, Fei
    Ie, Eugene
    DOCUMENT ANALYSIS AND RECOGNITION - ICDAR 2021, PT II, 2021, 12822 : 84 - 98
  • [32] A Simple, Concise, Query-based Approach to News Article Summarization Using Sentence Scoring
    Thornton, Megan
    Gao, Sophie
    Ng, Yiu-Kai
    2021 IEEE 33RD INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI 2021), 2021, : 951 - 958
  • [33] Query-Based Automatic Multi-document Summarization Extraction Method for Web Pages
    He, Qi
    Hao, Hong-Wei
    Yin, Xu-Cheng
    PROCEEDINGS OF THE 2011 2ND INTERNATIONAL CONGRESS ON COMPUTER APPLICATIONS AND COMPUTATIONAL SCIENCE, VOL 1, 2012, 144 : 107 - 112
  • [34] QVI: Query-based virtual index for distributed information retrieval
    Kim, DG
    Lee, SG
    INTERNATIONAL SOCIETY FOR COMPUTERS AND THEIR APPLICATIONS 13TH INTERNATIONAL CONFERENCE ON COMPUTERS AND THEIR APPLICATIONS, 1998, : 152 - 155
  • [35] URREF for Veracity Assessment in Query-Based Information Fusion Systems
    Blasch, Erik
    Aved, Alex
    2015 18TH INTERNATIONAL CONFERENCE ON INFORMATION FUSION (FUSION), 2015, : 58 - 65
  • [36] Query-Based Extractive Text Summarization Using Sense-Oriented Semantic Relatedness Measure
    Rahman, Nazreena
    Borah, Bhogeswar
    ARABIAN JOURNAL FOR SCIENCE AND ENGINEERING, 2024, 49 (03) : 3751 - 3792
  • [37] Query-based multi-documents summarization using linguistic knowledge and content word expansion
    Abdi, Asad
    Idris, Norisma
    Alguliyev, Rasim M.
    Aliguliyev, Ramiz M.
    SOFT COMPUTING, 2017, 21 (07) : 1785 - 1801
  • [38] Query-Based Extractive Text Summarization Using Sense-Oriented Semantic Relatedness Measure
    Nazreena Rahman
    Bhogeswar Borah
    Arabian Journal for Science and Engineering, 2024, 49 : 3751 - 3792
  • [39] Query-based multi-documents summarization using linguistic knowledge and content word expansion
    Asad Abdi
    Norisma Idris
    Rasim M. Alguliyev
    Ramiz M. Aliguliyev
    Soft Computing, 2017, 21 : 1785 - 1801
  • [40] Query-Based Data Pricing
    Koutris, Paraschos
    Upadhyaya, Prasang
    Balazinska, Magdalena
    Howe, Bill
    Suciu, Dan
    JOURNAL OF THE ACM, 2015, 62 (05)