A Multi-Modal Chinese Poetry Generation Model

被引:0
|
作者
Liu, Dayiheng [1 ]
Guo, Quan [1 ]
Li, Wubo [1 ]
机构
[1] Sichuan Univ, Machine Intelligence Lab, Coll Comp Sci, Chengdu 610065, Peoples R China
基金
美国国家科学基金会;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recent studies in sequence-to-sequence learning demonstrate that RNN encoder-decoder structure can successfully generate Chinese poetry. However, existing methods can only generate poetry with a given first line or user's intent theme. In this paper, we proposed a three-stage multi-modal Chinese poetry generation approach. Given a picture, the first line, the title and the other lines of the poem are successively generated in three stages. According to the characteristics of Chinese poems, we propose a hierarchy-attention seq2seq model which can effectively capture character, phrase, and sentence information between contexts and improve the symmetry delivered in poems. In addition, the Latent Dirichlet allocation (LDA) model is utilized for title generation and improve the relevance of the whole poem and the title. Compared with strong baseline, the experimental results demonstrate the effectiveness of our approach, using machine evaluations as well as human judgments.
引用
收藏
页数:8
相关论文
共 50 条
  • [1] Lightweight multi-modal emotion recognition model based on modal generation
    Liu, Peisong
    Che, Manqiang
    Luo, Jiangchuan
    [J]. 2022 9TH INTERNATIONAL FORUM ON ELECTRICAL ENGINEERING AND AUTOMATION, IFEEA, 2022, : 430 - 435
  • [2] Multi-Modal Teaching Design in Learning Poetry
    Sun Nan
    [J]. PROCEEDINGS OF 2018 INTERNATIONAL SYMPOSIUM - REFORM AND INNOVATION OF HIGHER ENGINEERING EDUCATION, 2018, : 191 - 194
  • [3] A Multi-Modal Contrastive Diffusion Model for Therapeutic Peptide Generation
    Wang, Yongkang
    Liu, Xuan
    Huang, Feng
    Xiong, Zhankun
    Zhang, Wen
    [J]. THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 1, 2024, : 3 - 11
  • [4] A Chinese Multi-modal Relation Extraction Model for Internet Security of Finance
    Lai, Qinghan
    Ding, Shuai
    Gong, Jinghao
    Cui, Jin'an
    Liu, Song
    [J]. 52ND ANNUAL IEEE/IFIP INTERNATIONAL CONFERENCE ON DEPENDABLE SYSTEMS AND NETWORKS WORKSHOP VOLUME (DSN-W 2022), 2022, : 123 - 128
  • [5] Multi-modal Chinese Fake News Detection
    Huang, Wenxi
    Zhao, Zhangyi
    Chen, Xiaojun
    Li, Mark Junjie
    Zhang, Qin
    Fournier-Viger, Philippe
    [J]. 2023 23RD IEEE INTERNATIONAL CONFERENCE ON DATA MINING WORKSHOPS, ICDMW 2023, 2023, : 109 - 117
  • [6] The role of Chinese railway in multi-modal transport
    Liu, ZY
    Zhang, Q
    Huang, QH
    [J]. TRAFFIC AND TRANSPORTATION STUDIES, 2000, : 162 - 165
  • [7] Towards a multi-modal perceptual model
    Hollier, MP
    Voelcker, R
    [J]. BT TECHNOLOGY JOURNAL, 1997, 15 (04): : 162 - 171
  • [8] Multi-modal Background Model Initialization
    Bloisi, Domenico D.
    Grillo, Alfonso
    Pennisi, Andrea
    Iocchi, Luca
    Passaretti, Claudio
    [J]. NEW TRENDS IN IMAGE ANALYSIS AND PROCESSING - ICIAP 2015 WORKSHOPS, 2015, 9281 : 485 - 492
  • [9] "textklang" - Towards a Multi-Modal Exploration Platform for German Poetry
    Schauffler, Nadja
    Bernhart, Toni
    Blessing, Andre
    Eschenbach, Gunilla
    Gaertner, Markus
    Jung, Kerstin
    Kinder, Anna
    Koch, Julia
    Richter, Sandra
    Viehhauser, Gabriel
    Vu, Thang
    Wesemann, Lorenz
    Kuhn, Jonas
    [J]. LREC 2022: THIRTEEN INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2022, : 5345 - 5355
  • [10] Multi-modal human robot interaction for map generation
    Saito, H
    Ishimura, K
    Hattori, M
    Takamori, T
    [J]. SICE 2002: PROCEEDINGS OF THE 41ST SICE ANNUAL CONFERENCE, VOLS 1-5, 2002, : 2721 - 2724