Deep Learning Based Semantic Similarity Detection Using Text Data

被引:13
|
作者
Mansoor, Muhammad [1 ]
Rehman, Zahoor Ur [1 ]
Shaheen, Muhammad [2 ]
Khan, Muhammad Attique [3 ]
Habib, Mohamed [4 ,5 ]
机构
[1] COMSATS Univ Islamabad, Comp Sci Dept, Attock Campus, Islamabad, Pakistan
[2] Fdn Univ Islamabad, Fac Engn & IT, Islamabad, Pakistan
[3] HITEC Univ Taxila, Dept Comp Sci, Taxila, Pakistan
[4] Saudi Elect Univ, Coll Comp & Informat, Riyadh, Saudi Arabia
[5] Port Said Univ, Fac Engn, Port Fuad City, Egypt
来源
INFORMATION TECHNOLOGY AND CONTROL | 2020年 / 49卷 / 04期
关键词
Deep Learning; Semantics; Similarity; Quora; question duplication; LSTM and CNN; CONTRAST ENHANCEMENT; NEURAL-NETWORK; RECOGNITION; SELECTION; MODEL;
D O I
10.5755/j01.itc.49.4.27118
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Similarity detection in the text is the main task for a number of Natural Language Processing (NLP) applications. As textual data are comparatively large in quantity and in volume than the numeric data, measuring textual similarity is one of the important problems. Most of the similarity detection algorithms are based upon word to word matching, sentence/paragraph matching, and matching of the whole document. In this research, a novel approach is proposed using deep learning models, combining Long Short-Term Memory Network (LSTM) with Convolutional Neural Network (CNN) for measuring semantics similarity between two questions. The proposed model takes sentence pairs as input to measure the similarity between them. The model is tested on publicly available Quora's dataset. In comparison to the existing techniques gave 87.50 % accuracy which is better than the previous approaches.
引用
收藏
页码:495 / 510
页数:16
相关论文
共 50 条
  • [41] Change Detection Using Deep Learning Based Semantic Segmentation for Nuclear Activity Detection and Monitoring
    Song, Ahram
    Lee, Changhui
    Lee, Jinmin
    Han, Youkyung
    KOREAN JOURNAL OF REMOTE SENSING, 2022, 38 (06) : 991 - 1005
  • [42] Learning Relations using Semantic-based Vector Similarity
    Budai, Kinga
    Barbantan, Ioana
    Dinsoreanu, Mihaela
    Potolea, Rodica
    2016 IEEE 12TH INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTER COMMUNICATION AND PROCESSING (ICCP), 2016, : 69 - 75
  • [43] Learning Semantic Similarity for Multi-label Text Categorization
    Li, Li
    Wang, Mengxiang
    Zhang, Longkai
    Wang, Houfeng
    CHINESE LEXICAL SEMANTICS, 2014, 8922 : 260 - 269
  • [44] Predicting Semantic Textual Similarity of Arabic Question Pairs using Deep Learning
    Einea, Omar
    Elnagar, Ashraf
    2019 IEEE/ACS 16TH INTERNATIONAL CONFERENCE ON COMPUTER SYSTEMS AND APPLICATIONS (AICCSA 2019), 2019,
  • [45] Short Text Semantic Similarity Measurement Approach Based on Semantic Network
    Hameed, Naamah Hussien
    Alimi, Adel M.
    Sadiq, Ahmed T.
    BAGHDAD SCIENCE JOURNAL, 2022, 19 (06) : 1581 - 1591
  • [46] Enhancing Text Clustering Performance Using Semantic Similarity
    Gad, Walaa K.
    Kamel, Mohamed S.
    ENTERPRISE INFORMATION SYSTEMS-BK, 2009, 24 : 325 - 335
  • [47] TEXT CONTENT ANALYSIS USING ONTOLOGY AND SEMANTIC SIMILARITY
    Prodanovic, Dejan
    Furlan, Bojan
    Nikolic, Bosko
    2014 22ND TELECOMMUNICATIONS FORUM TELFOR (TELFOR), 2014, : 1126 - 1129
  • [48] Similarity Search on Semantic Trajectories Using Text Processing
    de Almeida, Damiao Ribeiro
    Baptista, Claudio de Souza
    de Andrade, Fabio Gomes
    ISPRS INTERNATIONAL JOURNAL OF GEO-INFORMATION, 2022, 11 (07)
  • [49] Short Text Similarity Calculation Using Semantic Information
    Pu, Haoyu
    Fei, Gaolei
    Zhao, Hailin
    Hu, Guangmin
    Jiao, Chengbo
    Xu, Zhoujun
    2017 3RD INTERNATIONAL CONFERENCE ON BIG DATA COMPUTING AND COMMUNICATIONS (BIGCOM), 2017, : 144 - 150
  • [50] Text Information Retrieval Based on Concept Semantic Similarity
    Lv, Gang
    Zheng, Cheng
    Zhang, Li
    2009 FIFTH INTERNATIONAL CONFERENCE ON SEMANTICS, KNOWLEDGE AND GRID (SKG 2009), 2009, : 356 - +