Paraphrase Identification Based on Weighted URAE, Unit Similarity and Context Correlation Feature

被引:4
|
作者
Zhou, Jie [1 ]
Liu, Gongshen [1 ]
Sun, Huanrong [2 ]
机构
[1] Shanghai Jiao Tong Univ, Sch Elect Informat & Elect Engn, Shanghai, Peoples R China
[2] SJTU Shanghai Songheng Informat Content Anal Join, Shanghai, Peoples R China
基金
中国国家自然科学基金;
关键词
Paraphrase identification; Recursive Autoencoders; Phrase embedding; Sentence embedding; Deep learning; Semantic feature;
D O I
10.1007/978-3-319-99501-4_4
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A deep learning model adaptive to both sentence-level and article-level paraphrase identification is proposed in this paper. It consists of pairwise unit similarity feature and semantic context correlation feature. In this model, sentences are represented by word and phrase embedding while articles are represented by sentence embedding. Those phrase and sentence embedding are learned from parse trees through Weighted Unfolding Recursive Autoencoders (WURAE), an unsupervised learning algorithm. Then, unit similarity matrix is calculated by matching the pairwise lists of embedding. It is used to extract the pairwise unit similarity feature through CNN and k-max pooling layers. In addition, semantic context correlation feature is taken into account, which is captured by the combination of CNN and LSTM. CNN layers learn collocation information between adjacent units while LSTM extracts the long-term dependency feature of the text based on the output of CNN. This model is experimented on a famous English sentence paraphrase corpus, MSRPC, and a Chinese article paraphrase corpus. The results show that the deep semantic feature of text could be extracted based on WURAE, unit similarity and context correlation feature. We release our code of WURAE, deep learning model for paraphrase identification and pre-trained phrase end sentence embedding data for use by the community.
引用
收藏
页码:41 / 53
页数:13
相关论文
共 50 条
  • [41] Optimised Feature Selection for Identification of Carcinogenic Leukocytes Using Weighted Aggregation Based Transposition PSO
    Kar, Subhajit
    Das Sharma, Kaushik
    Maitra, Madhubanti
    IETE JOURNAL OF RESEARCH, 2022, 68 (03) : 1991 - 2004
  • [42] A Wavelet Based Multiscale Weighted Permutation Entropy Method for Sensor Fault Feature Extraction and Identification
    Yang, Qiaoning
    Wang, Jianlin
    JOURNAL OF SENSORS, 2016, 2016
  • [43] Feature-based multi-criteria recommendation system using a weighted approach with ranking correlation
    Zeeshan, Zeeshan
    ul Ain, Qurat
    Bhatti, Uzair Aslam
    Memon, Waqar Hussain
    Ali, Sajid
    Nawaz, Saqib Ali
    Nizamani, Mir Muhammad
    Mehmood, Anum
    Bhatti, Mughair Aslam
    Shoukat, Muhammad Usman
    INTELLIGENT DATA ANALYSIS, 2021, 25 (04) : 1013 - 1029
  • [44] Distractor-Aware Long-Term Correlation Tracking Based on Information Entropy Weighted Feature
    Yu, Ming-Xin
    Zhang, Yu-Hua
    Li, Yong-Ke
    Li, Jian-Zeng
    Wang, Chang-Long
    IEEE ACCESS, 2020, 8 : 29417 - 29429
  • [45] Remote Sensing Image Registration Based on Fuzzy Shape Context Feature and Local Space Vector Similarity Constraint
    Ma, Xinke
    Yang, Yang
    Yang, Kun
    2018 26TH INTERNATIONAL CONFERENCE ON GEOINFORMATICS (GEOINFORMATICS 2018), 2018,
  • [46] Raman spectrum identification based on the correlation score using the weighted segmental hit quality index
    Park, Jun-Kyu
    Park, Aaron
    Yang, Si Kyung
    Baek, Sung-June
    Hwang, Joonki
    Choo, Jaebum
    ANALYST, 2017, 142 (02) : 380 - 388
  • [47] A progressive learning classifier based on dynamic differential weighted network for feature identification of brain network series
    Xue, Wei
    He, Hong
    KNOWLEDGE-BASED SYSTEMS, 2023, 274
  • [48] Identification of Mitral Annulus Hinge Point Based on Local Context Feature and Additive SVM Classifier
    Zhang, Jianming
    Liu, Yangchun
    Xu, Wei
    COMPUTATIONAL AND MATHEMATICAL METHODS IN MEDICINE, 2015, 2015
  • [49] A Flexible and Accurate Reasoning Method for Danger-Aware Services Based on Context Similarity from Feature Point of View
    Wang, Junbo
    Cheng, Zixue
    Chen, Yongping
    Jing, Lei
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2011, E94D (09): : 1755 - 1767
  • [50] Rank-order-correlation-based feature vector context transformation for learning to rank for information retrieval
    Yeh, Jen-Yuan
    COMPUTER SYSTEMS SCIENCE AND ENGINEERING, 2018, 33 (01): : 41 - 52