Adapting Static and Contextual Representations for Policy Gradient-Based Summarization

被引:0
|
作者
Lin, Ching-Sheng [1 ]
Jwo, Jung-Sing [1 ,2 ]
Lee, Cheng-Hsiung [1 ]
机构
[1] Tunghai Univ, Master Program Digital Innovat, Taichung 40704, Taiwan
[2] Tunghai Univ, Dept Comp Sci, Taichung 40704, Taiwan
关键词
automatic text summarization; GloVe; BERT; GPT; unsupervised training; policy gradient reinforcement learning;
D O I
10.3390/s23094513
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
Considering the ever-growing volume of electronic documents made available in our daily lives, the need for an efficient tool to capture their gist increases as well. Automatic text summarization, which is a process of shortening long text and extracting valuable information, has been of great interest for decades. Due to the difficulties of semantic understanding and the requirement of large training data, the development of this research field is still challenging and worth investigating. In this paper, we propose an automated text summarization approach with the adaptation of static and contextual representations based on an extractive approach to address the research gaps. To better obtain the semantic expression of the given text, we explore the combination of static embeddings from GloVe (Global Vectors) and the contextual embeddings from BERT (Bidirectional Encoder Representations from Transformer) and GPT (Generative Pre-trained Transformer) based models. In order to reduce human annotation costs, we employ policy gradient reinforcement learning to perform unsupervised training. We conduct empirical studies on the public dataset, Gigaword. The experimental results show that our approach achieves promising performance and is competitive with various state-of-the-art approaches.
引用
收藏
页数:11
相关论文
共 50 条
  • [41] A multiagent deep deterministic policy gradient-based distributed protection method for distribution network
    Peng Zeng
    Shijie Cui
    Chunhe Song
    Zhongfeng Wang
    Guangye Li
    Neural Computing and Applications, 2023, 35 : 2267 - 2278
  • [42] A GRADIENT-BASED METHOD FOR TEAM EVASION
    Liu, Shih-Yuan
    Zhou, Zhengyuan
    Tomlin, Claire
    Hedrick, Karl
    ASME 2013 DYNAMIC SYSTEMS AND CONTROL CONFERENCE, VOL. 3, 2013,
  • [43] The Gradient-Based Cache Partitioning Algorithm
    Hasenplaugh, William
    Ahuja, Pritpal S.
    Jaleel, Aamer
    Steely, Simon, Jr.
    Emer, Joel
    ACM TRANSACTIONS ON ARCHITECTURE AND CODE OPTIMIZATION, 2012, 8 (04)
  • [44] Robust Gradient-Based Markov Subsampling
    Gong, Tieliang
    Xi, Quanhan
    Xu, Chen
    THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 4004 - 4011
  • [45] Gradient-Based Competitive Learning: Theory
    Giansalvo Cirrincione
    Vincenzo Randazzo
    Pietro Barbiero
    Gabriele Ciravegna
    Eros Pasero
    Cognitive Computation, 2024, 16 : 608 - 623
  • [46] Average Gradient-Based Adversarial Attack
    Wan, Chen
    Huang, Fangjun
    Zhao, Xianfeng
    IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 : 9572 - 9585
  • [47] Gradient-Based Competitive Learning: Theory
    Cirrincione, Giansalvo
    Randazzo, Vincenzo
    Barbiero, Pietro
    Ciravegna, Gabriele
    Pasero, Eros
    COGNITIVE COMPUTATION, 2024, 16 (02) : 608 - 623
  • [48] Gradient-based adaptive importance samplers
    Elvira, Victor
    Chouzenoux, Emilie
    Akyildiz, Omer Deniz
    Martino, Luca
    JOURNAL OF THE FRANKLIN INSTITUTE-ENGINEERING AND APPLIED MATHEMATICS, 2023, 360 (13): : 9490 - 9514
  • [49] A skeletonization algorithm for gradient-based optimization
    Menten, Martin J.
    Paetzold, Johannes C.
    Zimmer, Veronika A.
    Shit, Suprosanna
    Ezhov, Ivan
    Holland, Robbie
    Probst, Monika
    Schnabel, Julia A.
    Rueckert, Daniel
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 21337 - 21346
  • [50] GRADIENT-BASED BLOCK TRUNCATION CODING
    QUWEIDER, MK
    SALARI, E
    ELECTRONICS LETTERS, 1995, 31 (05) : 353 - 355