Paraphrase Identification Based on Weighted URAE, Unit Similarity and Context Correlation Feature

被引:4
|
作者
Zhou, Jie [1 ]
Liu, Gongshen [1 ]
Sun, Huanrong [2 ]
机构
[1] Shanghai Jiao Tong Univ, Sch Elect Informat & Elect Engn, Shanghai, Peoples R China
[2] SJTU Shanghai Songheng Informat Content Anal Join, Shanghai, Peoples R China
基金
中国国家自然科学基金;
关键词
Paraphrase identification; Recursive Autoencoders; Phrase embedding; Sentence embedding; Deep learning; Semantic feature;
D O I
10.1007/978-3-319-99501-4_4
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A deep learning model adaptive to both sentence-level and article-level paraphrase identification is proposed in this paper. It consists of pairwise unit similarity feature and semantic context correlation feature. In this model, sentences are represented by word and phrase embedding while articles are represented by sentence embedding. Those phrase and sentence embedding are learned from parse trees through Weighted Unfolding Recursive Autoencoders (WURAE), an unsupervised learning algorithm. Then, unit similarity matrix is calculated by matching the pairwise lists of embedding. It is used to extract the pairwise unit similarity feature through CNN and k-max pooling layers. In addition, semantic context correlation feature is taken into account, which is captured by the combination of CNN and LSTM. CNN layers learn collocation information between adjacent units while LSTM extracts the long-term dependency feature of the text based on the output of CNN. This model is experimented on a famous English sentence paraphrase corpus, MSRPC, and a Chinese article paraphrase corpus. The results show that the deep semantic feature of text could be extracted based on WURAE, unit similarity and context correlation feature. We release our code of WURAE, deep learning model for paraphrase identification and pre-trained phrase end sentence embedding data for use by the community.
引用
收藏
页码:41 / 53
页数:13
相关论文
共 50 条
  • [1] Parameter-efficient feature-based transfer for paraphrase identification
    Liu, Xiaodong
    Rzepka, Rafal
    Araki, Kenji
    NATURAL LANGUAGE ENGINEERING, 2023, 29 (04) : 1066 - 1096
  • [2] Paraphrase Identification by Using Clause-Based Similarity Features and Machine Translation Metrics
    Thenmozhi, D.
    Aravindan, Chandrabose
    COMPUTER JOURNAL, 2016, 59 (09): : 1289 - 1302
  • [3] Noninvasive Load Identification Method Based on Feature Similarity
    Li, Hongyan
    Ding, Xianfeng
    Qu, Dan
    Lin, Jiang
    JOURNAL OF ELECTRICAL AND COMPUTER ENGINEERING, 2020, 2020 (2020)
  • [4] A Research on Multi-feature Word-level Paraphrase Extracting System Based on Context
    He Xian-jiang
    Yu Zhong-hua
    2012 FOURTH INTERNATIONAL CONFERENCE ON MULTIMEDIA INFORMATION NETWORKING AND SECURITY (MINES 2012), 2012, : 441 - 444
  • [5] A Track to Track Association Algorithm Based on Weighted State Correlation Similarity
    Tokta, Aybars
    Hocaoglu, Ali Koksal
    2018 INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND DATA PROCESSING (IDAP), 2018,
  • [6] New Feature Selection Methods Based on Context Similarity for Text Categorization
    Chen, Yifei
    Han, Bingqing
    Hou, Ping
    2014 11TH INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY (FSKD), 2014, : 598 - 604
  • [7] An Improved Image Inpainting Method Based on Feature Similarity Context Encoder
    Li, Zhiqiang
    Zhang, Zhouzhong
    Guo, Hongchen
    3RD ANNUAL INTERNATIONAL CONFERENCE ON INFORMATION SYSTEM AND ARTIFICIAL INTELLIGENCE (ISAI2018), 2018, 1069
  • [8] Assessing semantic similarity between concepts: A weighted-feature-based approach
    Wasti, Shahbaz Hassan
    Hussain, Muhammad Jawad
    Huang, Guangjian
    Akram, Aftab
    Jiang, Yuncheng
    Tang, Yong
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2020, 32 (07):
  • [9] Weighted Correlation Filter Tracking Algorithm Based on Context and Relocation
    Xiong C.
    Lu Y.
    Yan J.
    Guangxue Xuebao/Acta Optica Sinica, 2019, 39 (04):
  • [10] Weighted Correlation Filter Tracking Algorithm Based on Context and Relocation
    Xiong Changzhen
    Lu Yan
    Yan Jiaqing
    ACTA OPTICA SINICA, 2019, 39 (04)