Plagiarism detection based on semantic analysis

被引:4
|
作者
Mukherjee, Indrajit [1 ]
Kumar, Bipul [1 ]
Singh, Samarth [1 ]
Sharma, Kishan [1 ]
机构
[1] BIT Mesra, Dept Comp Sci & Engn, Ranchi 835215, Bihar, India
关键词
semantic similarity; plagiarism detection; documents; WordNet;
D O I
10.1504/IJKL.2018.092316
中图分类号
G40 [教育学];
学科分类号
040101 ; 120403 ;
摘要
Plagiarism means copy and paste for a text or change in some words or make use of synonymous or near synonymous words without citing the source. Plagiarism is on rise especially in the academic and research field due the availability of the digital text documents in the internet which can easily be copied and pasted. Existing approaches for detecting the plagiarism have either ignored or made limited use of information about semantic similarities between the words. We proposed a method to measure the semantic similarity between the documents by mapping keywords (verbs; adverbs; adjectives; descriptors; etc.) with the nouns and then finding the similarity between the mapped words that can rectify the existing shortcomings. The efficiency of the algorithm is evaluated on the dataset (corpus of Plagiarised Short Answers) (Clough and Stevenson, 2011). The experiments showed that the proposed algorithm gives significantly accurate results in detecting semantic based similarity between the documents and found to outperform previously published methods.
引用
收藏
页码:242 / 254
页数:13
相关论文
共 50 条
  • [1] A plagiarism detection method based on semantic matching
    [J]. Chen, Y.-Q. (yqchen@scut.edu.cn), 1600, South China University of Technology (41):
  • [2] A Review of Plagiarism Detection Based On Lexical and Semantic Approach
    Yousuf, Shameem
    Ahmad, Muzamil
    Nasrullah, Sheikh
    [J]. 2013 INTERNATIONAL CONFERENCE ON EMERGING TRENDS IN COMMUNICATION, CONTROL, SIGNAL PROCESSING AND COMPUTING APPLICATIONS (IEEE-C2SPCA-2013), 2013,
  • [3] Fuzzy Semantic Plagiarism Detection
    Osman, Ahmed Hamza
    Salim, Naomie
    Kumar, Yogan Jaya
    Abuobieda, Albaraa
    [J]. ADVANCED MACHINE LEARNING TECHNOLOGIES AND APPLICATIONS, 2012, 322 : 543 - 553
  • [4] Experiments on the Indonesian Plagiarism Detection using Latent Semantic Analysis
    Soleman, Sidik
    Purwarianti, Ayu
    [J]. 2014 2ND INTERNATIONAL CONFERENCE ON INFORMATION AND COMMUNICATION TECHNOLOGY (ICOICT), 2014,
  • [5] An improved plagiarism detection scheme based on semantic role labeling
    Osman, Ahmed Hamza
    Salim, Naomie
    Binwahlan, Mohammed Salem
    Alteeb, Rihab
    Abuobieda, Albaraa
    [J]. APPLIED SOFT COMPUTING, 2012, 12 (05) : 1493 - 1502
  • [6] Weighted semantic plagiarism detection approach based on AHP decision model
    JavadiMoghaddam, SeyyedMohammad
    Roosta, Fatemeh
    Noroozi, Asadolla
    [J]. ACCOUNTABILITY IN RESEARCH-ETHICS INTEGRITY AND POLICY, 2022, 29 (04): : 203 - 223
  • [7] Semantic-Based Integrated Plagiarism Detection Approach for English Documents
    Kaur, Manpreet
    Gupta, Vishal
    Kaur, Ravreet
    [J]. IETE JOURNAL OF RESEARCH, 2023, 69 (09) : 6120 - 6136
  • [8] Integrating syntax-semantic-based text analysis with structural and citation information for scientific plagiarism detection
    Vani, K.
    Gupta, Deepa
    [J]. JOURNAL OF THE ASSOCIATION FOR INFORMATION SCIENCE AND TECHNOLOGY, 2018, 69 (11) : 1330 - 1345
  • [9] Plagiarism Detection Using Semantic Knowledge Graphs
    Khadilkar, Kunal
    Kulkarni, Siddhivinayak
    Bone, Poojarani
    [J]. 2018 FOURTH INTERNATIONAL CONFERENCE ON COMPUTING COMMUNICATION CONTROL AND AUTOMATION (ICCUBEA), 2018,
  • [10] An Adaptive Plagiarism Detection System Based on Semantic Concept and Hierarchical Genetic Algorithm
    Darwish, Saad M.
    Moawad, Mayar M.
    [J]. PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON ADVANCED INTELLIGENT SYSTEMS AND INFORMATICS 2019, 2020, 1058 : 739 - 749