Fast Plagiarism Detection by Sentence Hashing

被引:0
|
作者
Ceglarek, Dariusz [1 ]
Haniewicz, Konstanty [2 ]
机构
[1] Poznan Sch Banking, Poznan, Poland
[2] Poznan Univ Econ, Poznan, Poland
关键词
plagiarism; plagiarism detection; longest common subsequence; semantic compression; SEIPro2S; SEMANTIC COMPRESSION;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This work presents a Sentence Hashing Algorithm for Plagiarism Detection - SHAPD. To present a user with the best results the algorithm makes use of special trait of the written texts - their natural sentence fragmentation, later employing a set of special techniques for text representation. Results obtained demonstrate that the algorithm delivers solution faster than the alternatives. Its algorithmic complexity is logarithmic, thus its performance is better than most algorithms using dynamic programming used to find the longest common subsequence.
引用
收藏
页码:30 / 37
页数:8
相关论文
共 50 条
  • [41] Fast Hashing with Strong Concentration Bounds
    Aamand, Anders
    Knudsen, Jakob Baek Tejs
    Knudsen, Mathias Baek Tejs
    Rasmussen, Peter Michael Reichstein
    Thorup, Mikkel
    PROCEEDINGS OF THE 52ND ANNUAL ACM SIGACT SYMPOSIUM ON THEORY OF COMPUTING (STOC '20), 2020, : 1265 - 1278
  • [42] Strongly Universal String Hashing is Fast
    Lemire, Daniel
    Kaser, Owen
    COMPUTER JOURNAL, 2014, 57 (11): : 1624 - 1638
  • [43] Fast-Extract with Cube Hashing
    Schmitt, Bruno de O.
    Mishchenko, Alan
    Kravets, Victor N.
    Brayton, Robert K.
    Reis, Andre I.
    2017 22ND ASIA AND SOUTH PACIFIC DESIGN AUTOMATION CONFERENCE (ASP-DAC), 2017, : 145 - 150
  • [44] Sparse Hashing for Fast Multimedia Search
    Zhu, Xiaofeng
    Huang, Zi
    Cheng, Hong
    Cui, Jiangtao
    Shen, Heng Tao
    ACM TRANSACTIONS ON INFORMATION SYSTEMS, 2013, 31 (02)
  • [45] Feature hashing for fast image retrieval
    Yan, Lingyu
    Fu, Jiarun
    Zhang, Hongxin
    Yuan, Lu
    Xu, Hui
    MIPPR 2017: PATTERN RECOGNITION AND COMPUTER VISION, 2017, 10609
  • [46] A Fast Sentence Searching Algorithm
    Saxena, Rohit Kamal
    Singh, Kamlendra Pratap
    Jaiswal, U. C.
    COMPUTER NETWORKS AND INFORMATION TECHNOLOGIES, 2011, 142 : 557 - 561
  • [47] Fast and Robust Hashing for Database Operators
    Kara, Kaan
    Alonso, Gustavo
    2016 26TH INTERNATIONAL CONFERENCE ON FIELD PROGRAMMABLE LOGIC AND APPLICATIONS (FPL), 2016,
  • [48] Distributed Fast Supervised Discrete Hashing
    Liu, Zhifeng
    Chen, Feng
    Duan, Shukai
    IEEE ACCESS, 2019, 7 : 90003 - 90011
  • [49] Fast and Powerful Hashing Using Tabulation
    Thorup, Mikkel
    COMMUNICATIONS OF THE ACM, 2017, 60 (07) : 94 - 101
  • [50] Plagiarism Prevention and Detection A Challenge
    Broussard, Lisa
    Hurst, Helen
    NURSE EDUCATOR, 2015, 40 (04) : 168 - 168