A Multi-Modal Chinese Poetry Generation Model

被引:0
|
作者
Liu, Dayiheng [1 ]
Guo, Quan [1 ]
Li, Wubo [1 ]
机构
[1] Sichuan Univ, Machine Intelligence Lab, Coll Comp Sci, Chengdu 610065, Peoples R China
基金
美国国家科学基金会;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recent studies in sequence-to-sequence learning demonstrate that RNN encoder-decoder structure can successfully generate Chinese poetry. However, existing methods can only generate poetry with a given first line or user's intent theme. In this paper, we proposed a three-stage multi-modal Chinese poetry generation approach. Given a picture, the first line, the title and the other lines of the poem are successively generated in three stages. According to the characteristics of Chinese poems, we propose a hierarchy-attention seq2seq model which can effectively capture character, phrase, and sentence information between contexts and improve the symmetry delivered in poems. In addition, the Latent Dirichlet allocation (LDA) model is utilized for title generation and improve the relevance of the whole poem and the title. Compared with strong baseline, the experimental results demonstrate the effectiveness of our approach, using machine evaluations as well as human judgments.
引用
收藏
页数:8
相关论文
共 50 条
  • [21] A Multi-modal SPM Model for Image Classification
    Zheng, Peng
    Zhao, Zhong-Qiu
    Gao, Jun
    [J]. INTELLIGENT COMPUTING METHODOLOGIES, ICIC 2017, PT III, 2017, 10363 : 525 - 535
  • [22] A Multi-modal Graphical Model for Scene Analysis
    Namin, Sarah Taghavi
    Najafi, Mohammad
    Salzmann, Mathieu
    Petersson, Lars
    [J]. 2015 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2015, : 1006 - 1013
  • [23] Multi-Modal Summary Generation using Multi-Objective Optimization
    Jangra, Anubhav
    Saha, Sriparna
    Jatowt, Adam
    Hasanuzzaman, Mohammad
    [J]. PROCEEDINGS OF THE 43RD INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL (SIGIR '20), 2020, : 1745 - 1748
  • [24] Clothing generation by multi-modal embedding: A compatibility matrix-regularized GAN model
    Liu, Linlin
    Zhang, Haijun
    Zhou, Dongliang
    [J]. IMAGE AND VISION COMPUTING, 2021, 107
  • [25] Flexible Dual Multi-Modal Hashing for Incomplete Multi-Modal Retrieval
    Wei, Yuhong
    An, Junfeng
    [J]. INTERNATIONAL JOURNAL OF IMAGE AND GRAPHICS, 2024,
  • [26] Multi-Modal 2020: Multi-Modal Argumentation 30 Years Later
    Gilbert, Michael A.
    [J]. INFORMAL LOGIC, 2022, 42 (03): : 487 - 506
  • [27] More than Text: Multi-modal Chinese Word Segmentation
    Zhang, Dong
    Hu, Zheng
    Li, Shoushan
    Wu, Hanqian
    Zhu, Qiaoming
    Zhou, Guodong
    [J]. ACL-IJCNLP 2021: THE 59TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 11TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING, VOL 2, 2021, : 550 - 557
  • [28] MillenniumDB: A Multi-modal, Multi-model Graph Database
    Vrgoc, Domagoj
    Rojas, Carlos
    Angles, Renzo
    Arenas, Marcelo
    Calisto, Vicente
    Farias, Benjamin
    Ferrada, Sebastian
    Heuer, Tristan
    Hogan, Aidan
    Navarro, Gonzalo
    Pinto, Alexander
    Reutter, Juan
    Rosales, Henry
    Toussiant, Etienne
    [J]. COMPANION OF THE 2024 INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA, SIGMOD-COMPANION 2024, 2024, : 496 - 499
  • [29] Response generation in multi-modal dialogues with split pre-generation and cross-modal contrasting
    Li, Linqin
    Zhang, Dong
    Zhu, Suyang
    Li, Shoushan
    Zhou, Guodong
    [J]. INFORMATION PROCESSING & MANAGEMENT, 2024, 61 (01)
  • [30] Next generation chemical tools for multi-modal glycome analysis
    Gerling-Driessen, Ulla I. M.
    Driessen, Marc D.
    [J]. GLYCOBIOLOGY, 2023, 33 (11) : 1059 - 1060