News headline generation based on improved decoder from transformer

被引:0
|
作者
Zhengpeng Li
Jiansheng Wu
Jiawei Miao
Xinmiao Yu
机构
[1] University of Science and Technology Liaoning,
来源
关键词
D O I
暂无
中图分类号
学科分类号
摘要
Most of the news headline generation models that use the sequence-to-sequence model or recurrent network have two shortcomings: the lack of parallel ability of the model and easily repeated generation of words. It is difficult to select the important words in news and reproduce these expressions, resulting in the headline that inaccurately summarizes the news. In this work, we propose a TD-NHG model, which stands for news headline generation based on an improved decoder from the transformer. The TD-NHG uses masked multi-head self-attention to learn the feature information of different representation subspaces of news texts and uses decoding selection strategy of top-k, top-p, and punishment mechanisms (repetition-penalty) in the decoding stage. We conducted a comparative experiment on the LCSTS dataset and CSTS dataset. Rouge-1, Rouge-2, and Rouge-L on the LCSTS dataset and CSTS dataset are 31.28/38.73, 12.68/24.97, and 28.31/37.47, respectively. The experimental results demonstrate that the proposed method can improve the accuracy and diversity of news headlines.
引用
收藏
相关论文
共 50 条
  • [41] A Multiple Learning Model Based Voting System for News Headline Classification
    Zhu, Fenhong
    Dong, Xiaozheng
    Song, Rui
    Hong, Yu
    Zhu, Qiaoming
    NATURAL LANGUAGE PROCESSING AND CHINESE COMPUTING, NLPCC 2017, 2018, 10619 : 797 - 806
  • [42] Diverse, Controllable, and Keyphrase-Aware: A Corpus and Method for News Multi-Headline Generation
    Liu, Dayiheng
    Gong, Yeyun
    Yan, Yu
    Fu, Jie
    Shao, Bo
    Jiang, Daxin
    Lv, Jiancheng
    Duan, Nan
    PROCEEDINGS OF THE 2020 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP), 2020, : 6241 - 6250
  • [43] Recurrent Glimpse-based Decoder for Detection with Transformer
    Chen, Zhe
    Zhang, Jing
    Tao, Dacheng
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 5250 - 5259
  • [44] A Study of English-Chinese News Headline Translation——from the Perspective of Memetics
    赵翠娟
    现代妇女(下旬), 2014, (04) : 240+246 - 240
  • [45] Synthetic CT generation based on CBCT using improved vision transformer CycleGAN
    Hu, Yuxin
    Zhou, Han
    Cao, Ning
    Li, Can
    Hu, Can
    SCIENTIFIC REPORTS, 2024, 14 (01):
  • [46] Improving news headline text generation quality through frequent POS-Tag patterns analysis
    Fatima, Noureen
    Daudpota, Sher Muhammad
    Kastrati, Zenun
    Imran, Ali Shariq
    Hassan, Saif
    Elmitwally, Nouh Sabri
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2023, 125
  • [47] Improve Shallow Decoder Based Transformer with Structured Expert Prediction
    Wang, Zongbing
    Han, Jingru
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING-ICANN 2024, PT VII, 2024, 15022 : 224 - 234
  • [48] TransCFD: A transformer-based decoder for flow field prediction
    Jiang, Jundou
    Li, Guanxiong
    Jiang, Yi
    Zhang, Laiping
    Deng, Xiaogang
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2023, 123
  • [49] Improved Polar Decoder Based on Deep Learning
    Xu, Weihong
    Wu, Zhizhen
    Ueng, Yeong-Luh
    You, Xiaohu
    Zhang, Chuan
    2017 IEEE INTERNATIONAL WORKSHOP ON SIGNAL PROCESSING SYSTEMS (SIPS), 2017,
  • [50] Transformer-based image generation from scene graphs
    Sortino, Renato
    Palazzo, Simone
    Rundo, Francesco
    Spampinato, Concetto
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2023, 233