Multi-granularity adaptive extractive document summarization with heterogeneous graph neural networks

被引:0
|
作者
Su, Wu [1 ]
Jiang, Jin [1 ]
Huang, Kaihui [1 ]
机构
[1] School of Automation and Electronic Information, Xiangtan University, Hunan Province, Xiangtan, China
关键词
Backpropagation - Benchmarking - Data mining - Probability distributions - Semantics;
D O I
10.7717/PEERJ-CS.1737
中图分类号
学科分类号
摘要
The crucial aspect of extractive document summarization lies in understanding the interrelations between sentences. Documents inherently comprise a multitude of sentences, and sentence-level models frequently fail to consider the relationships between distantly-placed sentences, resulting in the omission of significant information in the summary. Moreover, information within documents tends to be distributed sparsely, challenging the efficacy of sentence-level models. In the realm of heterogeneous graph neural networks, it has been observed that semantic nodes with varying levels of granularity encapsulate distinct semantic connections. Initially, the incorporation of edge features into the computation of dynamic graph attention networks is performed to account for node relationships. Subsequently, given the multiplicity of topics in a document or a set of documents, a topic model is employed to extract topic-specific features and the probability distribution linking these topics with sentence nodes. Last but not least, the model defines nodes with different levels of granularity-ranging from documents and topics to sentences-and these various nodes necessitate different propagation widths and depths for capturing intricate relationships in the information being disseminated. Adaptive measures are taken to learn the importance and correlation between nodes of different granularities in terms of both width and depth. Experimental evidence from two benchmark datasets highlights the superior performance of the proposed model, as assessed by ROUGE metrics, in comparison to existing approaches, even in the absence of pre-trained language models. Additionally, an ablation study confirms the positive impact of each individual module on the model’s ROUGE scores. © 2023, Su et al. All rights reserved.
引用
收藏
相关论文
共 50 条
  • [1] Multi-granularity adaptive extractive document summarization with heterogeneous graph neural networks
    Su, Wu
    Jiang, Jin
    Huang, Kaihui
    [J]. PEERJ, 2023, 11
  • [2] Multi-granularity heterogeneous graph attention networks for extractive document summarization
    Zhao, Yu
    Wang, Leilei
    Wang, Cui
    Du, Huaming
    Wei, Shaopeng
    Feng, Huali
    Yu, Zongjian
    Li, Qing
    [J]. NEURAL NETWORKS, 2022, 155 : 340 - 347
  • [3] A Multi-Granularity Heterogeneous Graph for Extractive Text Summarization
    Zhao, Henghui
    Zhang, Wensheng
    Huang, Mengxing
    Feng, Siling
    Wu, Yuanyuan
    [J]. ELECTRONICS, 2023, 12 (10)
  • [4] MULTI-GRANULARITY HETEROGENEOUS GRAPH FOR DOCUMENT-LEVEL RELATION EXTRACTION
    Tang, Hengzhu
    Cao, Yanan
    Zhang, Zhenyu
    Jia, Ruipeng
    Fang, Fang
    Wang, Shi
    [J]. 2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 7683 - 7687
  • [5] Extractive Document Summarization Based on Convolutional Neural Networks
    Zhang, Yong
    Er, Meng Joo
    Pratama, Mahardhika
    [J]. PROCEEDINGS OF THE IECON 2016 - 42ND ANNUAL CONFERENCE OF THE IEEE INDUSTRIAL ELECTRONICS SOCIETY, 2016, : 918 - 922
  • [6] GNN-MgrPool: Enhanced graph neural networks with multi-granularity pooling for graph classification
    Sun, Haichao
    Wang, Guoyin
    Liu, Qun
    Guo, Yike
    [J]. INFORMATION SCIENCES, 2024, 680
  • [7] Multi-document extractive summarization using semantic graph
    del Camino Valle, Oleyda
    Simon-Cuevas, Alfredo
    Valladares-Valdes, Eduardo
    Olivas, Jose A.
    Romero, Francisco P.
    [J]. PROCESAMIENTO DEL LENGUAJE NATURAL, 2019, (63): : 103 - 110
  • [8] Neural Extractive Summarization with Hierarchical Attentive Heterogeneous Graph Network
    Jia, Ruipeng
    Cao, Yanan
    Tang, Hengzhu
    Fang Fang
    Cong Cao
    Shi Wang
    [J]. PROCEEDINGS OF THE 2020 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP), 2020, : 3622 - 3631
  • [9] MHGEE: Event Extraction via Multi-granularity Heterogeneous Graph
    Zhang, Mingyu
    Fang, Fang
    Li, Hao
    Liu, Qingyun
    Li, Yangchun
    Wang, Hailong
    [J]. COMPUTATIONAL SCIENCE - ICCS 2022, PT I, 2022, : 473 - 487
  • [10] Extractive multi-document summarization using multilayer networks
    Tohalino, Jorge V.
    Amancio, Diego R.
    [J]. PHYSICA A-STATISTICAL MECHANICS AND ITS APPLICATIONS, 2018, 503 : 526 - 539