Paraphrase Generation with Deep Reinforcement Learning

被引:0
|
作者
Li, Zichao [1 ]
Jiang, Xin [1 ]
Shang, Lifeng [1 ]
Li, Hang [2 ]
机构
[1] Huawei Technol, Noahs Ark Lab, Shenzhen, Peoples R China
[2] Toutiao AI Lab, Beijing, Peoples R China
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Automatic generation of paraphrases from a given sentence is an important yet challenging task in natural language processing (NLP). In this paper, we present a deep reinforcement learning approach to paraphrase generation. Specifically, we propose a new framework for the task, which consists of a generator and an evaluator, both of which are learned from data. The generator, built as a sequence-to-sequence learning model, can produce paraphrases given a sentence. The evaluator, constructed as a deep matching model, can judge whether two sentences are paraphrases of each other. The generator is first trained by deep learning and then further fine-tuned by reinforcement learning in which the reward is given by the evaluator. For the learning of the evaluator, we propose two methods based on supervised learning and inverse reinforcement learning respectively, depending on the type of available training data. Experimental results on two datasets demonstrate the proposed models (the generators) can produce more accurate paraphrases and outperform the state-of-the-art methods in paraphrase generation in both automatic evaluation and human evaluation.
引用
收藏
页码:3865 / 3878
页数:14
相关论文
共 50 条
  • [1] An Empirical Comparison on Imitation Learning and Reinforcement Learning for Paraphrase Generation
    Du, Wanyu
    Ji, Yangfeng
    [J]. 2019 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING AND THE 9TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (EMNLP-IJCNLP 2019): PROCEEDINGS OF THE CONFERENCE, 2019, : 6012 - 6018
  • [2] Learning to Selectively Learn forWeakly Supervised Paraphrase Generation with Model-based Reinforcement Learning
    Yin, Haiyan
    Li, Dingcheng
    Li, Ping
    [J]. NAACL 2022: THE 2022 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES, 2022, : 1385 - 1395
  • [3] Automatic View Generation with Deep Learning and Reinforcement Learning
    Yuan, Haitao
    Li, Guoliang
    Feng, Ling
    Sun, Ji
    Han, Yue
    [J]. 2020 IEEE 36TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE 2020), 2020, : 1501 - 1512
  • [4] A Deep Generative Framework for Paraphrase Generation
    Gupta, Ankush
    Agarwal, Arvind
    Singh, Prawaan
    Rai, Piyush
    [J]. THIRTY-SECOND AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTIETH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / EIGHTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2018, : 5149 - 5156
  • [5] A Deep Reinforcement Learning Framework for Column Generation
    Chi, Cheng
    Aboussalah, Amine Mohamed
    Khalil, Elias B.
    Wang, Juyoung
    Sherkat-Masoumi, Zoha
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
  • [6] Deep Reinforcement Learning for Automatic Thumbnail Generation
    Li, Zhuopeng
    Zhang, Xiaoyan
    [J]. MULTIMEDIA MODELING, MMM 2019, PT II, 2019, 11296 : 41 - 53
  • [7] Generation of ice states through deep reinforcement learning
    Zhao, Kai-Wen
    Kao, Wen-Han
    Wu, Kai-Hsin
    Kao, Ying-Jer
    [J]. PHYSICAL REVIEW E, 2019, 99 (06)
  • [8] Deep Reinforcement Learning for Trajectory Generation and Optimisation of UAVs
    Akhtar, Mishma
    Maqsood, Adnan
    Verbeke, Mathias
    [J]. 2023 10TH INTERNATIONAL CONFERENCE ON RECENT ADVANCES IN AIR AND SPACE TECHNOLOGIES, RAST, 2023,
  • [9] A Hybrid Deep Learning Architecture for Paraphrase Identification
    Kubal, Divesh R.
    Nimkar, Anant V.
    [J]. 2018 9TH INTERNATIONAL CONFERENCE ON COMPUTING, COMMUNICATION AND NETWORKING TECHNOLOGIES (ICCCNT), 2018,
  • [10] A survey on deep learning and deep reinforcement learning in robotics with a tutorial on deep reinforcement learning
    Morales, Eduardo F.
    Murrieta-Cid, Rafael
    Becerra, Israel
    Esquivel-Basaldua, Marco A.
    [J]. INTELLIGENT SERVICE ROBOTICS, 2021, 14 (05) : 773 - 805