A Scoring Model Assisted by Frequency for Multi-Document Summarization

被引:0
|
作者
Yu, Yue [1 ,3 ]
Wu, Mutong [1 ]
Su, Weifeng [1 ,2 ]
Cheung, Yiu-ming [3 ]
机构
[1] Div Sci & Technol, Comp Sci & Technol Programme, Hefei, Peoples R China
[2] BNU HKBU United Int Coll, Guangdong Key Lab AI & Multimodal Data Proc, Zhuhai, Guangdong, Peoples R China
[3] Hong Kong Baptist Univ, Dept Comp Sci, Hong Kong, Peoples R China
关键词
Multiple document summarization; Position information; Frequency; Graph;
D O I
10.1007/978-3-030-86383-8_25
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
While position information plays a significant role in sentence scoring of single document summarization, the repetition of content among different documents greatly impacts the salience scores of sentences in multi-document summarization. Introducing frequencies information can help identify important sentences which are generally ignored when only considering position information before. Therefore, in this paper, we propose a scoring model, SAFA (Self-Attention with Frequency Graph) which combines position information with frequency to identify the salience of sentences. The SAFA model constructs a frequency graph at the multi-document level based on the repetition of content of sentences, and assigns initial score values to each sentence based on the graph. The model then uses the position-aware gold scores to train a self-attention mechanism, obtaining the sentence significance at its single document level. The score of each sentence is updated by combing position and frequency information together. We train and test the SAFA model on the large-scale multi-document dataset Multi-News. The extensive experimental results show that the model incorporating frequency information in sentence scoring outperforms the other state-of-the-art extractive models.
引用
收藏
页码:309 / 320
页数:12
相关论文
共 50 条
  • [31] A novel approach to multi-document summarization
    Qiu, Li-Qing
    Pang, Bin
    Lin, Sai-Qun
    Chen, Peng
    DEXA 2007: 18TH INTERNATIONAL CONFERENCE ON DATABASE AND EXPERT SYSTEMS APPLICATIONS, PROCEEDINGS, 2007, : 187 - +
  • [32] Hierarchical Transformers for Multi-Document Summarization
    Liu, Yang
    Lapata, Mirella
    57TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2019), 2019, : 5070 - 5081
  • [33] Tiered sentence based topic model for multi-document summarization
    Akhtar, Nadeem
    Beg, M. M. Sufyan
    Javed, Hira
    Hussain, Md Muzakkir
    JOURNAL OF INFORMATION & OPTIMIZATION SCIENCES, 2022, 43 (08): : 2131 - 2141
  • [34] A hybrid model for sentence ordering in extractive multi-document summarization
    Liu, Dexi
    Zhang, Zengchang
    He, Yanxiang
    Ji, Donghong
    INFORMATION RETRIEVAL TECHNOLOGY, PROCEEDINGS, 2006, 4182 : 588 - 592
  • [35] An Improved LDA Multi-Document Summarization Model Based on TensorFlow
    Zhong, Ying
    Tang, Zhuo
    Ding, Xiaofei
    Zhu, Li
    Le, Yuquan
    Li, Kenli
    Li, Keqin
    2017 IEEE 29TH INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI 2017), 2017, : 255 - 259
  • [36] MeanSum : A Neural Model for Unsupervised Multi-Document Abstractive Summarization
    Chu, Eric
    Liu, Peter J.
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 97, 2019, 97
  • [37] Research On Multi-document Summarization Based On LDA Topic Model
    Bian, Jinqiang
    Jiang, Zengru
    Chen, Qian
    2014 SIXTH INTERNATIONAL CONFERENCE ON INTELLIGENT HUMAN-MACHINE SYSTEMS AND CYBERNETICS (IHMSC), VOL 2, 2014, : 113 - 116
  • [38] Multi-document Summarization by Creating Synthetic Document Vector Based on Language Model
    Kim, Dahae
    Lee, Jee-Hyoung
    2016 JOINT 8TH INTERNATIONAL CONFERENCE ON SOFT COMPUTING AND INTELLIGENT SYSTEMS (SCIS) AND 17TH INTERNATIONAL SYMPOSIUM ON ADVANCED INTELLIGENT SYSTEMS (ISIS), 2016, : 605 - 609
  • [39] HierMDS: a hierarchical multi-document summarization model with global–local document dependencies
    Shuaimin Li
    Jungang Xu
    Neural Computing and Applications, 2023, 35 : 18553 - 18570
  • [40] Multi-document summarization based on lexical chains
    Chen, YM
    Wang, XL
    Liu, BQ
    Proceedings of 2005 International Conference on Machine Learning and Cybernetics, Vols 1-9, 2005, : 1937 - 1942