Paraphrase Generation with Deep Reinforcement Learning

被引:0
|
作者
Li, Zichao [1 ]
Jiang, Xin [1 ]
Shang, Lifeng [1 ]
Li, Hang [2 ]
机构
[1] Huawei Technol, Noahs Ark Lab, Shenzhen, Peoples R China
[2] Toutiao AI Lab, Beijing, Peoples R China
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Automatic generation of paraphrases from a given sentence is an important yet challenging task in natural language processing (NLP). In this paper, we present a deep reinforcement learning approach to paraphrase generation. Specifically, we propose a new framework for the task, which consists of a generator and an evaluator, both of which are learned from data. The generator, built as a sequence-to-sequence learning model, can produce paraphrases given a sentence. The evaluator, constructed as a deep matching model, can judge whether two sentences are paraphrases of each other. The generator is first trained by deep learning and then further fine-tuned by reinforcement learning in which the reward is given by the evaluator. For the learning of the evaluator, we propose two methods based on supervised learning and inverse reinforcement learning respectively, depending on the type of available training data. Experimental results on two datasets demonstrate the proposed models (the generators) can produce more accurate paraphrases and outperform the state-of-the-art methods in paraphrase generation in both automatic evaluation and human evaluation.
引用
收藏
页码:3865 / 3878
页数:14
相关论文
共 50 条
  • [1] An Empirical Comparison on Imitation Learning and Reinforcement Learning for Paraphrase Generation
    Du, Wanyu
    Ji, Yangfeng
    2019 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING AND THE 9TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (EMNLP-IJCNLP 2019): PROCEEDINGS OF THE CONFERENCE, 2019, : 6012 - 6018
  • [2] Learning to Selectively Learn forWeakly Supervised Paraphrase Generation with Model-based Reinforcement Learning
    Yin, Haiyan
    Li, Dingcheng
    Li, Ping
    NAACL 2022: THE 2022 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES, 2022, : 1385 - 1395
  • [3] Automatic View Generation with Deep Learning and Reinforcement Learning
    Yuan, Haitao
    Li, Guoliang
    Feng, Ling
    Sun, Ji
    Han, Yue
    2020 IEEE 36TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE 2020), 2020, : 1501 - 1512
  • [4] A Deep Generative Framework for Paraphrase Generation
    Gupta, Ankush
    Agarwal, Arvind
    Singh, Prawaan
    Rai, Piyush
    THIRTY-SECOND AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTIETH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / EIGHTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2018, : 5149 - 5156
  • [5] A Deep Reinforcement Learning Framework for Column Generation
    Chi, Cheng
    Aboussalah, Amine Mohamed
    Khalil, Elias B.
    Wang, Juyoung
    Sherkat-Masoumi, Zoha
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
  • [6] Deep Reinforcement Learning for Automatic Thumbnail Generation
    Li, Zhuopeng
    Zhang, Xiaoyan
    MULTIMEDIA MODELING, MMM 2019, PT II, 2019, 11296 : 41 - 53
  • [7] Generation of ice states through deep reinforcement learning
    Zhao, Kai-Wen
    Kao, Wen-Han
    Wu, Kai-Hsin
    Kao, Ying-Jer
    PHYSICAL REVIEW E, 2019, 99 (06)
  • [8] Deep reinforcement learning for community architectural layout generation
    Sheng, Tao
    Xiong, Yun
    Wang, Haofen
    Zhang, Yao
    Wang, Siqi
    Zhang, Weinan
    KNOWLEDGE AND INFORMATION SYSTEMS, 2025, 67 (03) : 2453 - 2480
  • [9] Deep Reinforcement Learning for Trajectory Generation and Optimisation of UAVs
    Akhtar, Mishma
    Maqsood, Adnan
    Verbeke, Mathias
    2023 10TH INTERNATIONAL CONFERENCE ON RECENT ADVANCES IN AIR AND SPACE TECHNOLOGIES, RAST, 2023,
  • [10] A Hybrid Deep Learning Architecture for Paraphrase Identification
    Kubal, Divesh R.
    Nimkar, Anant V.
    2018 9TH INTERNATIONAL CONFERENCE ON COMPUTING, COMMUNICATION AND NETWORKING TECHNOLOGIES (ICCCNT), 2018,