Graphmax for Text Generation

被引:0
|
作者
Liu, Bin [1 ]
Yin, Guosheng [2 ]
机构
[1] Southwestern Univ Finance & Econ, Ctr Stat Res, Sch Stat, Chengdu, Peoples R China
[2] Univ Hong Kong, Dept Stat & Actuarial Sci, Hong Kong, Peoples R China
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In text generation, a large language model (LM) makes a choice of each new word based only on the former selection of its context using the softmax function. Nevertheless, the link statistics information of concurrent words based on a scene-specific corpus is valuable in choosing the next word, which can help to ensure the topic of the generated text to be aligned with the current task. To fully explore the co-occurrence information, we propose a graphmax function for task-specific text generation. Using the graph-based regularization, graphmax enables the final word choice to be determined by both the global knowledge from the LM and the local knowledge from the scene-specific corpus. The traditional softmax function is regularized with a graph total variation (GTV) term, which incorporates the local knowledge into the LM and encourages the model to consider the statistical relationships between words in a scene-specific corpus. The proposed graphmax is versatile and can be readily plugged into any large pre-trained LM for text generation and machine translation. Through extensive experiments, we demonstrate that the new GTV-based regularization can improve performances in various natural language processing (NLP) tasks in comparison with existing methods. Moreover, through human experiments, we observe that participants can easily distinguish the text generated by graphmax or softmax.
引用
收藏
页码:823 / 848
页数:26
相关论文
共 50 条
  • [21] Probabilistic Approaches for Modeling Text Structure and Their Application to Text-to-Text Generation
    Barzilay, Regina
    EMPIRICAL METHODS IN NATURAL LANGUAGE GENERATION: DATA-ORIENTED METHODS AND EMPIRICAL EVALUATION, 2010, 5790 : 1 - 12
  • [22] Text Mining and Generation (TMG)
    CEUR Workshop Proceedings, 2023, 3438
  • [23] Text Generation From Tables
    Bao, Junwei
    Tang, Duyu
    Duan, Nan
    Yan, Zhao
    Zhou, Ming
    Zhao, Tiejun
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2019, 27 (02) : 311 - 320
  • [24] Text Fingerprint Key Generation
    Hassanein, Mohamed Sameh
    Ghinea, Gheorghita
    2012 INTERNATIONAL CONFERENCE FOR INTERNET TECHNOLOGY AND SECURED TRANSACTIONS, 2012, : 603 - 609
  • [25] Stochastic text generation - Discussion
    Nicolov, N
    Oberlander, J
    Rosenfeld, R
    McKeown, KR
    Jones, KIBS
    Pereira, F
    PHILOSOPHICAL TRANSACTIONS OF THE ROYAL SOCIETY OF LONDON SERIES A-MATHEMATICAL PHYSICAL AND ENGINEERING SCIENCES, 2000, 358 (1769): : 1386 - 1387
  • [26] Text Generation in Discrete Space
    Hu, Ting
    Meinel, Christoph
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING, ICANN 2020, PT II, 2020, 12397 : 721 - 732
  • [27] Video Generation from Text
    Li, Yitong
    Min, Martin Renqiang
    Shen, Dinghan
    Carlson, David
    Carin, Lawrence
    THIRTY-SECOND AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTIETH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / EIGHTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2018, : 7065 - 7072
  • [28] Uniform Complexity for Text Generation
    Imperial, Joseph Marvin
    Madabushi, Harish Tayyar
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (EMNLP 2023), 2023, : 12025 - 12046
  • [29] TEXT GENERATION FROM GRAMMARS
    MICHAELSON, G
    INFORMATION AND SOFTWARE TECHNOLOGY, 1990, 32 (08) : 566 - 568
  • [30] Pragmatically Informative Text Generation
    Shen, Sheng
    Fried, Daniel
    Andreas, Jacob
    Klein, Dan
    2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, 2019, : 4060 - 4067