News headline generation based on improved decoder from transformer

被引:0
|
作者
Zhengpeng Li
Jiansheng Wu
Jiawei Miao
Xinmiao Yu
机构
[1] University of Science and Technology Liaoning,
来源
关键词
D O I
暂无
中图分类号
学科分类号
摘要
Most of the news headline generation models that use the sequence-to-sequence model or recurrent network have two shortcomings: the lack of parallel ability of the model and easily repeated generation of words. It is difficult to select the important words in news and reproduce these expressions, resulting in the headline that inaccurately summarizes the news. In this work, we propose a TD-NHG model, which stands for news headline generation based on an improved decoder from the transformer. The TD-NHG uses masked multi-head self-attention to learn the feature information of different representation subspaces of news texts and uses decoding selection strategy of top-k, top-p, and punishment mechanisms (repetition-penalty) in the decoding stage. We conducted a comparative experiment on the LCSTS dataset and CSTS dataset. Rouge-1, Rouge-2, and Rouge-L on the LCSTS dataset and CSTS dataset are 31.28/38.73, 12.68/24.97, and 28.31/37.47, respectively. The experimental results demonstrate that the proposed method can improve the accuracy and diversity of news headlines.
引用
收藏
相关论文
共 50 条
  • [31] Korean-English bilingual videotext recognition for news headline generation based on a split-merge strategy
    Cheolkon Jung
    Licheng Jiao
    Journal of Real-Time Image Processing, 2016, 11 : 167 - 177
  • [32] Korean-English bilingual videotext recognition for news headline generation based on a split-merge strategy
    Jung, Cheolkon
    Jiao, Licheng
    JOURNAL OF REAL-TIME IMAGE PROCESSING, 2016, 11 (01) : 167 - 177
  • [33] COBART: Controlled, Optimized, Bidirectional and Auto-Regressive Transformer for Ad Headline Generation
    Kanungo, Yashal Shakti
    Das, Gyanendra
    Pooja, A.
    Negi, Sumit
    PROCEEDINGS OF THE 28TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, KDD 2022, 2022, : 3127 - 3136
  • [34] From headline to lifeline: does news set agenda for policy?
    Grzeslo, Jenna
    Bai, Yang
    Wang, Ryan Yang
    Min, Bumgi
    Jayakar, Krishna
    DIGITAL POLICY REGULATION AND GOVERNANCE, 2019, 21 (04) : 352 - 368
  • [35] Quantum circuit generation for amplitude encoding using a transformer decoder
    Daimon, Shunsuke
    Matsushita, Yu-ichiro
    PHYSICAL REVIEW APPLIED, 2024, 22 (04):
  • [36] MolGPT: Molecular Generation Using a Transformer-Decoder Model
    Bagal, Viraj
    Aggarwal, Rishal
    Vinod, P. K.
    Priyakumar, U. Deva
    JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2022, 62 (09) : 2064 - 2076
  • [37] A Tree-Based Structure-Aware Transformer Decoder for Image-To-Markup Generation
    Zhong, Shuhan
    Song, Sizhe
    Li, Guanyao
    Chan, S-H Gary
    PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2022, 2022, : 5751 - 5760
  • [38] Abstractive Financial News Summarization via Transformer-BiLSTM Encoder and Graph Attention-Based Decoder
    Li, Haozhou
    Peng, Qinke
    Mou, Xu
    Wang, Ying
    Zeng, Zeyuan
    Bashir, Muhammad Fiaz
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2023, 31 : 3190 - 3205
  • [39] A Study of Chinese News Headline Classification Based on Keyword Feature Expansion
    Kai Miao
    Xin He
    Junyang Yu
    Guanghui Wang
    Yongchao Chen
    International Journal of Computational Intelligence Systems, 16
  • [40] A Study of Chinese News Headline Classification Based on Keyword Feature Expansion
    Miao, Kai
    He, Xin
    Yu, Junyang
    Wang, Guanghui
    Chen, Yongchao
    INTERNATIONAL JOURNAL OF COMPUTATIONAL INTELLIGENCE SYSTEMS, 2023, 16 (01)