Integrating Linguistic Knowledge to Sentence Paraphrase Generation

被引:0
|
作者
Lin, Zibo [1 ,2 ]
Li, Ziran [1 ,2 ]
Ding, Ning [1 ,2 ]
Zheng, Hai-Tao [1 ,2 ]
Shen, Ying [3 ]
Wang, Wei [1 ,2 ]
Zhao, Cong-Zhi [4 ]
机构
[1] Tsinghua Univ, Dept Comp Sci & Technol, Beijing, Peoples R China
[2] Tsinghua Univ, Tsinghua Shenzhen Int Grad Sch, Beijing, Peoples R China
[3] Peking Univ Shenzhen Grad Sch, Sch Elect & Comp Engn, Shenzhen 518055, Guangdong, Peoples R China
[4] Giiso Informat Technol Co Ltd, Hangzhou, Zhejiang, Peoples R China
基金
中国国家自然科学基金;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Paraphrase generation aims to rewrite a text with different words while keeping the same meaning. Previous work performs the task based solely on the given dataset while ignoring the availability of external linguistic knowledge. However, it is intuitive that a model can generate more expressive and diverse paraphrase with the help of such knowledge. To fill this gap, we propose Knowledge-Enhanced Paraphrase Network (KEPN), a transformer-based framework that can leverage external linguistic knowledge to facilitate paraphrase generation. (1) The model integrates synonym information from the external linguistic knowledge into the paraphrase generator, which is used to guide the decision on whether to generate a new word or replace it with a synonym. (2) To locate the synonym pairs more accurately, we adopt an incremental encoding scheme to incorporate position information of each synonym. Besides, a multi-task architecture is designed to help the framework jointly learn the selection of synonym pairs and the generation of expressive paraphrase. Experimental results on both English and Chinese datasets show that our method significantly outperforms the state-of-the-art approaches in terms of both automatic and human evaluation.
引用
收藏
页码:8368 / 8375
页数:8
相关论文
共 50 条
  • [1] Linguistic steganography with knowledge-poor paraphrase generation
    Kermanidis, Katia Lida
    LITERARY AND LINGUISTIC COMPUTING, 2011, 26 (04): : 417 - 434
  • [2] Vietnamese Sentence Paraphrase Identification using Pre-trained Model and Linguistic Knowledge
    Dien Dinh
    Nguyen Le Thanh
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2021, 12 (08) : 796 - 806
  • [3] Integrating Transformer and Paraphrase Rules for Sentence Simplification
    Zhao, Sanqiang
    Meng, Rui
    He, Daqing
    Saptono, Andi
    Parmanto, Bambang
    2018 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2018), 2018, : 3164 - 3173
  • [4] Pushing Paraphrase Away from Original Sentence: A Multi-Round Paraphrase Generation Approach
    Lin, Zhe
    Wan, Xiaojun
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL-IJCNLP 2021, 2021, : 1548 - 1557
  • [5] Enhancing Paraphrase Question Generation With Prior Knowledge
    Xie, Jiayuan
    Fang, Wenhao
    Huang, Qingbao
    Cai, Yi
    Wang, Tao
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2023, 31 : 1464 - 1475
  • [6] Towards Document-Level Paraphrase Generation with Sentence Rewriting and Reordering
    Lin, Zhe
    Cai, Yitao
    Wan, Xiaojun
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EMNLP 2021, 2021, : 1033 - 1044
  • [7] Automatic Paraphrase Generation at Phrasal, and Sentence Level for Urdu Language: Data and Methods
    Khan, Zara
    Muneer, Iqra
    Nawab, Rao Muhammad Adeel
    Mahmood, Ahmad
    EUROPEAN JOURNAL ON ARTIFICIAL INTELLIGENCE, 2025,
  • [8] Linguistic resources for paraphrase generation in portuguese: a lexicon-grammar approach
    Anabela Barreiro
    Cristina Mota
    Jorge Baptista
    Lucília Chacoto
    Paula Carvalho
    Language Resources and Evaluation, 2022, 56 : 1 - 35
  • [9] Linguistic resources for paraphrase generation in portuguese: a lexicon-grammar approach
    Barreiro, Anabela
    Mota, Cristina
    Baptista, Jorge
    Chacoto, Lucilia
    Carvalho, Paula
    LANGUAGE RESOURCES AND EVALUATION, 2022, 56 (01) : 1 - 35
  • [10] NooJ Linguistic Resources for Paraphrase Generation of Italian Support Verb Construction
    Cirillo, Nicola
    FORMALIZING NATURAL LANGUAGES: APPLICATIONS TO NATURAL LANGUAGE PROCESSING AND DIGITAL HUMANITIES, NOOJ 2023, 2024, 1816 : 191 - 201