A Scoring Model Assisted by Frequency for Multi-Document Summarization

被引:0
|
作者
Yu, Yue [1 ,3 ]
Wu, Mutong [1 ]
Su, Weifeng [1 ,2 ]
Cheung, Yiu-ming [3 ]
机构
[1] Div Sci & Technol, Comp Sci & Technol Programme, Hefei, Peoples R China
[2] BNU HKBU United Int Coll, Guangdong Key Lab AI & Multimodal Data Proc, Zhuhai, Guangdong, Peoples R China
[3] Hong Kong Baptist Univ, Dept Comp Sci, Hong Kong, Peoples R China
关键词
Multiple document summarization; Position information; Frequency; Graph;
D O I
10.1007/978-3-030-86383-8_25
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
While position information plays a significant role in sentence scoring of single document summarization, the repetition of content among different documents greatly impacts the salience scores of sentences in multi-document summarization. Introducing frequencies information can help identify important sentences which are generally ignored when only considering position information before. Therefore, in this paper, we propose a scoring model, SAFA (Self-Attention with Frequency Graph) which combines position information with frequency to identify the salience of sentences. The SAFA model constructs a frequency graph at the multi-document level based on the repetition of content of sentences, and assigns initial score values to each sentence based on the graph. The model then uses the position-aware gold scores to train a self-attention mechanism, obtaining the sentence significance at its single document level. The score of each sentence is updated by combing position and frequency information together. We train and test the SAFA model on the large-scale multi-document dataset Multi-News. The extensive experimental results show that the model incorporating frequency information in sentence scoring outperforms the other state-of-the-art extractive models.
引用
收藏
页码:309 / 320
页数:12
相关论文
共 50 条
  • [11] A document-sensitive graph model for multi-document summarization
    Wei, Furu
    Li, Wenjie
    Lu, Qin
    He, Yanxiang
    KNOWLEDGE AND INFORMATION SYSTEMS, 2010, 22 (02) : 245 - 259
  • [12] A hybrid machine learning model for multi-document summarization
    Mohamed Abdel Fattah
    Applied Intelligence, 2014, 40 : 592 - 600
  • [13] A novel contextual topic model for multi-document summarization
    Yang, Guangbing
    Wen, Dunwei
    Kinshuk
    Chen, Nian-Shing
    Sutinen, Erkki
    EXPERT SYSTEMS WITH APPLICATIONS, 2015, 42 (03) : 1340 - 1352
  • [14] A hybrid machine learning model for multi-document summarization
    Fattah, Mohamed Abdel
    APPLIED INTELLIGENCE, 2014, 40 (04) : 592 - 600
  • [15] An adjacency model for sentence ordering in multi-document summarization
    Nie, Yu
    Ji, Donghong
    Yang, Lingpeng
    INFORMATION RETRIEVAL TECHNOLOGY, PROCEEDINGS, 2006, 4182 : 313 - 322
  • [16] Semantic Argument Frequency-Based Multi-Document Summarization
    Aksoy, Cem
    Bugdayci, Ahmet
    Gur, Tunay
    Uysal, Ibrahim
    Can, Fazli
    2009 24TH INTERNATIONAL SYMPOSIUM ON COMPUTER AND INFORMATION SCIENCES, 2009, : 459 - 463
  • [17] A NEW MODEL FOR ARABIC MULTI-DOCUMENT TEXT SUMMARIZATION
    Abu Maria, Khulood
    Jaber, Khalid Mohammad
    Ibrahim, Mossab Nabil
    INTERNATIONAL JOURNAL OF INNOVATIVE COMPUTING INFORMATION AND CONTROL, 2018, 14 (04): : 1443 - 1452
  • [18] Assessing shallow sentence scoring techniques and combinations for single and multi-document summarization
    Oliveira, Hilario
    Ferreira, Rafael
    Lima, Rinaldo
    Lins, Rafael Dueire
    Freitas, Fred
    Riss, Marcelo
    Simske, Steven J.
    EXPERT SYSTEMS WITH APPLICATIONS, 2016, 65 : 68 - 86
  • [19] Weighted consensus multi-document summarization
    Wang, Dingding
    Li, Tao
    INFORMATION PROCESSING & MANAGEMENT, 2012, 48 (03) : 513 - 523
  • [20] MULTI-DOCUMENT SUMMARIZATION SYSTEMS COMPARISON
    Li, Lei
    Heng, Wei
    Liu, Ping'an
    2012 IEEE 2nd International Conference on Cloud Computing and Intelligent Systems (CCIS) Vols 1-3, 2012, : 1409 - 1413