Deep Learning Based Semantic Similarity Detection Using Text Data

被引:13
|
作者
Mansoor, Muhammad [1 ]
Rehman, Zahoor Ur [1 ]
Shaheen, Muhammad [2 ]
Khan, Muhammad Attique [3 ]
Habib, Mohamed [4 ,5 ]
机构
[1] COMSATS Univ Islamabad, Comp Sci Dept, Attock Campus, Islamabad, Pakistan
[2] Fdn Univ Islamabad, Fac Engn & IT, Islamabad, Pakistan
[3] HITEC Univ Taxila, Dept Comp Sci, Taxila, Pakistan
[4] Saudi Elect Univ, Coll Comp & Informat, Riyadh, Saudi Arabia
[5] Port Said Univ, Fac Engn, Port Fuad City, Egypt
来源
INFORMATION TECHNOLOGY AND CONTROL | 2020年 / 49卷 / 04期
关键词
Deep Learning; Semantics; Similarity; Quora; question duplication; LSTM and CNN; CONTRAST ENHANCEMENT; NEURAL-NETWORK; RECOGNITION; SELECTION; MODEL;
D O I
10.5755/j01.itc.49.4.27118
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Similarity detection in the text is the main task for a number of Natural Language Processing (NLP) applications. As textual data are comparatively large in quantity and in volume than the numeric data, measuring textual similarity is one of the important problems. Most of the similarity detection algorithms are based upon word to word matching, sentence/paragraph matching, and matching of the whole document. In this research, a novel approach is proposed using deep learning models, combining Long Short-Term Memory Network (LSTM) with Convolutional Neural Network (CNN) for measuring semantics similarity between two questions. The proposed model takes sentence pairs as input to measure the similarity between them. The model is tested on publicly available Quora's dataset. In comparison to the existing techniques gave 87.50 % accuracy which is better than the previous approaches.
引用
收藏
页码:495 / 510
页数:16
相关论文
共 50 条
  • [1] Text similarity semantic calculation based on deep reinforcement learning
    Chen G.
    Shi X.
    Chen M.
    Zhou L.
    International Journal of Security and Networks, 2020, 15 (01) : 59 - 66
  • [2] Semantic similarity and text summarization based novelty detection
    Kumar, Sushil
    Bhatia, Komal Kumar
    SN APPLIED SCIENCES, 2020, 2 (03):
  • [3] Semantic similarity and text summarization based novelty detection
    Sushil Kumar
    Komal Kumar Bhatia
    SN Applied Sciences, 2020, 2
  • [4] SEMANTIC SEGMENTATION OF TEXT USING DEEP LEARNING
    Lattisi, Tiziano
    Farina, Davide
    Ronchetti, Marco
    COMPUTING AND INFORMATICS, 2022, 41 (01) : 78 - 97
  • [5] Semantic text similarity using corpus-based word similarity and string similarity
    University of Ottawa
    不详
    ACM Transactions on Knowledge Discovery from Data, 2008, 2 (02)
  • [6] Chinese Text Detection Using Deep Learning Model and Synthetic Data
    Gao, Wei-wei
    Zhang, Jun
    Chen, Peng
    Wang, Bing
    Xia, Yi
    INTELLIGENT COMPUTING THEORIES AND APPLICATION, PT I, 2018, 10954 : 503 - 512
  • [7] Text Similarity Based on Semantic Analysis
    Wang, Junli
    Zhou, Qing
    Sun, Guobao
    PROCEEDINGS OF THE 2016 2ND INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND INDUSTRIAL ENGINEERING (AIIE 2016), 2016, 133 : 303 - 307
  • [8] Tree-structured Curriculum Learning based on Semantic Similarity of Text
    Han, Sanggyu
    Myaeng, Sung-Hyon
    2017 16TH IEEE INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS (ICMLA), 2017, : 971 - 976
  • [9] Semantic Based Text Similarity Computation
    Liu, Yaqi
    Li, Zhijiang
    ADVANCED GRAPHIC COMMUNICATIONS AND MEDIA TECHNOLOGIES, 2017, 417 : 343 - 348
  • [10] Detection of medical text semantic similarity based on convolutional neural network
    Zheng, Tao
    Gao, Yimei
    Wang, Fei
    Fan, Chenhao
    Fu, Xingzhi
    Li, Mei
    Zhang, Ya
    Zhang, Shaodian
    Ma, Handong
    BMC MEDICAL INFORMATICS AND DECISION MAKING, 2019, 19 (01)