Deep Learning Based Semantic Similarity Detection Using Text Data

被引:13
|
作者
Mansoor, Muhammad [1 ]
Rehman, Zahoor Ur [1 ]
Shaheen, Muhammad [2 ]
Khan, Muhammad Attique [3 ]
Habib, Mohamed [4 ,5 ]
机构
[1] COMSATS Univ Islamabad, Comp Sci Dept, Attock Campus, Islamabad, Pakistan
[2] Fdn Univ Islamabad, Fac Engn & IT, Islamabad, Pakistan
[3] HITEC Univ Taxila, Dept Comp Sci, Taxila, Pakistan
[4] Saudi Elect Univ, Coll Comp & Informat, Riyadh, Saudi Arabia
[5] Port Said Univ, Fac Engn, Port Fuad City, Egypt
来源
INFORMATION TECHNOLOGY AND CONTROL | 2020年 / 49卷 / 04期
关键词
Deep Learning; Semantics; Similarity; Quora; question duplication; LSTM and CNN; CONTRAST ENHANCEMENT; NEURAL-NETWORK; RECOGNITION; SELECTION; MODEL;
D O I
10.5755/j01.itc.49.4.27118
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Similarity detection in the text is the main task for a number of Natural Language Processing (NLP) applications. As textual data are comparatively large in quantity and in volume than the numeric data, measuring textual similarity is one of the important problems. Most of the similarity detection algorithms are based upon word to word matching, sentence/paragraph matching, and matching of the whole document. In this research, a novel approach is proposed using deep learning models, combining Long Short-Term Memory Network (LSTM) with Convolutional Neural Network (CNN) for measuring semantics similarity between two questions. The proposed model takes sentence pairs as input to measure the similarity between them. The model is tested on publicly available Quora's dataset. In comparison to the existing techniques gave 87.50 % accuracy which is better than the previous approaches.
引用
收藏
页码:495 / 510
页数:16
相关论文
共 50 条
  • [21] Robust semantic text similarity using LSA, machine learning, and linguistic resources
    Kashyap, Abhay
    Han, Lushan
    Yus, Roberto
    Sleeman, Jennifer
    Satyapanich, Taneeya
    Gandhi, Sunil
    Finin, Tim
    LANGUAGE RESOURCES AND EVALUATION, 2016, 50 (01) : 125 - 161
  • [22] Robust semantic text similarity using LSA, machine learning, and linguistic resources
    Abhay Kashyap
    Lushan Han
    Roberto Yus
    Jennifer Sleeman
    Taneeya Satyapanich
    Sunil Gandhi
    Tim Finin
    Language Resources and Evaluation, 2016, 50 : 125 - 161
  • [23] Assessing text semantic similarity using ontology
    Liu, Hongzhe
    Wang, Pengfei
    1600, Academy Publisher (09): : 490 - 497
  • [24] A Text Semantic Similarity Approach for Arabic Paraphrase Detection
    Mahmoud, Adnen
    Zrigui, Ahmed
    Zrigui, Mounir
    COMPUTATIONAL LINGUISTICS AND INTELLIGENT TEXT PROCESSING, CICLING 2017, PT II, 2018, 10762 : 338 - 349
  • [25] Abstractive summarization with deep reinforcement learning using semantic similarity rewards
    Fikri, Figen Beken
    Oflazer, Kemal
    Yanikoglu, Berrin
    NATURAL LANGUAGE ENGINEERING, 2024, 30 (03) : 554 - 576
  • [26] A Deforestation Detection Network Using Deep Learning-Based Semantic Segmentation
    Das, Pradeep Kumar
    Sahu, Adyasha
    Xavy, Dias V.
    Meher, Sukadev
    IEEE SENSORS LETTERS, 2024, 8 (01) : 1 - 4
  • [27] Enhanced Pedestrian Detection using Deep Learning based Semantic Image Segmentation
    Liu, Tianrui
    Stathaki, Tania
    2017 22ND INTERNATIONAL CONFERENCE ON DIGITAL SIGNAL PROCESSING (DSP), 2017,
  • [28] Comparison study of unsupervised paraphrase detection: Deep learning-The key for semantic similarity detection
    Vrbanec, Tedo
    Mestrovic, Ana
    EXPERT SYSTEMS, 2023, 40 (09)
  • [29] Semantic Event Detection Using Ensemble Deep Learning
    Pouyanfar, Samira
    Chen, Shu-Ching
    PROCEEDINGS OF 2016 IEEE INTERNATIONAL SYMPOSIUM ON MULTIMEDIA (ISM), 2016, : 203 - 208
  • [30] Semantic information extraction and search of mineral exploration data using text mining and deep learning methods
    Qiu, Qinjun
    Tian, Miao
    Tao, Liufeng
    Xie, Zhong
    Ma, Kai
    ORE GEOLOGY REVIEWS, 2024, 165