Fast Plagiarism Detection by Sentence Hashing

被引:0
|
作者
Ceglarek, Dariusz [1 ]
Haniewicz, Konstanty [2 ]
机构
[1] Poznan Sch Banking, Poznan, Poland
[2] Poznan Univ Econ, Poznan, Poland
关键词
plagiarism; plagiarism detection; longest common subsequence; semantic compression; SEIPro2S; SEMANTIC COMPRESSION;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This work presents a Sentence Hashing Algorithm for Plagiarism Detection - SHAPD. To present a user with the best results the algorithm makes use of special trait of the written texts - their natural sentence fragmentation, later employing a set of special techniques for text representation. Results obtained demonstrate that the algorithm delivers solution faster than the alternatives. Its algorithmic complexity is logarithmic, thus its performance is better than most algorithms using dynamic programming used to find the longest common subsequence.
引用
收藏
页码:30 / 37
页数:8
相关论文
共 50 条
  • [1] Plagiarism Detection in Homework Based on Image Hashing
    Chen, Ying
    Gan, Liping
    Zhang, Shiqing
    Guo, Wenping
    Chuang, Yuelong
    Zhao, Xiaoming
    DATA SCIENCE, PT II, 2017, 728 : 424 - 432
  • [2] An Innovative Similarity Measure for Sentence Plagiarism Detection
    Augello, Agnese
    Cuzzocrea, Alfredo
    Pilato, Giovanni
    Spiccia, Carmelo
    Vassallo, Giorgio
    COMPUTATIONAL SCIENCE AND ITS APPLICATIONS - ICCSA 2016, PT V, 2016, 9790 : 552 - 566
  • [3] Fast plagiarism detection system
    Mozgovoy, Maxim
    Fredriksson, Kimmo
    White, Daniel
    Joy, Mike
    Sutinen, Erkki
    String Processing and Information Retrieval, Proceedings, 2005, 3772 : 267 - 270
  • [4] Fast and reliable plagiarism detection system
    Mozgovoy, Maxim
    Karakovskiy, Sergey
    Kiyuev, Vitaly
    2007 37TH ANNUAL FRONTIERS IN EDUCATION CONFERENCE, GLOBAL ENGINEERING : KNOWLEDGE WITHOUT BORDERS - OPPORTUNITIES WITHOUT PASSPORTS, VOLS 1- 4, 2007, : 1718 - +
  • [5] An Improved SRL based Plagiarism Detection Technique using Sentence Ranking
    Paul, Merin
    Jamal, Sangeetha
    PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON INFORMATION AND COMMUNICATION TECHNOLOGIES, ICICT 2014, 2015, 46 : 223 - 230
  • [6] Fast Plagiarism Detection Based on Simple Document Similarity
    Baba, Kensuke
    2017 TWELFTH INTERNATIONAL CONFERENCE ON DIGITAL INFORMATION MANAGEMENT (ICDIM), 2017, : 54 - 58
  • [7] Fast Plagiarism Detection in Large-Scale Data
    Szmit, Radoslaw
    BEYOND DATABASES, ARCHITECTURES AND STRUCTURES: TOWARDS EFFICIENT SOLUTIONS FOR DATA ANALYSIS AND KNOWLEDGE REPRESENTATION, 2017, 716 : 329 - 343
  • [8] Fast Duplicate Detection Using Locality Sensitive Hashing
    Rong, C. T.
    Feng, L. J.
    INTERNATIONAL CONFERENCE ON ADVANCED EDUCATIONAL TECHNOLOGY AND INFORMATION ENGINEERING (AETIE 2015), 2015, : 580 - 588
  • [9] Sentence-based Plagiarism Detection focusing on Nouns and Part-of-Speech Structure
    Yokoi, Takeru
    Oikawa, Gouki
    Iwata, Mitsuru
    Sato, Takashi
    Kobayakawa, Michihiro
    NEW TRENDS IN SOFTWARE METHODOLOGIES, TOOLS AND TECHNIQUES, 2014, 265 : 1006 - 1015
  • [10] Automatic Plagiarism Detection Using Word-Sentence Based S-gram
    Aimmanee, Pakinee
    CHIANG MAI JOURNAL OF SCIENCE, 2011, 38 : 1 - 7