Experiments on the Indonesian Plagiarism Detection using Latent Semantic Analysis

被引:0
|
作者
Soleman, Sidik
Purwarianti, Ayu
机构
关键词
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Plagiarism is an important task since its number is increasing and the plagiarism technique is getting difficult. It means that there is not only literal plagiarism but also intelligence plagiarism. In order to handle the intelligence plagiarism, we employed latent semantic analysis (LSA) as the term-document representation. The LSA was used in the Heuristic Retrieval (HR) component and Detailed Analysis (DA) component. We conducted several experiments to compare the token type, the text segmentation and the threshold value. The test data were prepared manually from the available Indonesian paper corpus. Experimental results showed that the LSA outperformed the VSM (Vector Space Model), especially in test cases with intelligence plagiarism.
引用
收藏
页数:6
相关论文
共 50 条
  • [31] A Review of Plagiarism Detection Based On Lexical and Semantic Approach
    Yousuf, Shameem
    Ahmad, Muzamil
    Nasrullah, Sheikh
    [J]. 2013 INTERNATIONAL CONFERENCE ON EMERGING TRENDS IN COMMUNICATION, CONTROL, SIGNAL PROCESSING AND COMPUTING APPLICATIONS (IEEE-C2SPCA-2013), 2013,
  • [32] COMPARISON OF LATENT SEMANTIC ANALYSIS AND PROBABILISTIC LATENT SEMANTIC ANALYSIS FOR DOCUMENTS CLUSTERING
    Kuta, Marcin
    Kitowski, Jacek
    [J]. COMPUTING AND INFORMATICS, 2014, 33 (03) : 652 - 666
  • [33] Semantic Similarity/Relatedness for Cross language plagiarism detection
    Ezzikouri, Hanane
    Oukessou, Mohamed
    Erritali, Mohammed
    [J]. 2016 13TH INTERNATIONAL CONFERENCE ON COMPUTER GRAPHICS, IMAGING AND VISUALIZATION (CGIV), 2016, : 372 - 374
  • [34] Finding aliases on the web using latent semantic analysis
    Bhat, V
    Oates, T
    Shanbhag, V
    Nicholas, C
    [J]. DATA & KNOWLEDGE ENGINEERING, 2004, 49 (02) : 129 - 143
  • [35] Classification of signature curves using latent semantic analysis
    Shakiban, C
    Lloyd, R
    [J]. COMPUTER ALGEBRA AND GEOMETRIC ALGEBRA WITH APPLICATIONS, 2005, 3519 : 152 - 162
  • [36] Process Model Search Using Latent Semantic Analysis
    Schoknecht, Andreas
    Fischer, Nicolai
    Oberweis, Andreas
    [J]. BUSINESS PROCESS MANAGEMENT WORKSHOPS, BPM 2016, 2017, 281 : 283 - 295
  • [37] Automatic text summarization using latent semantic analysis
    I. V. Mashechkin
    M. I. Petrovskiy
    D. S. Popov
    D. V. Tsarev
    [J]. Programming and Computer Software, 2011, 37 : 299 - 305
  • [38] Using latent semantic analysis to assess reader strategies
    Magliano, JP
    Wiemer-Hastings, K
    Millis, KK
    Muñoz, BD
    McNamara, D
    [J]. BEHAVIOR RESEARCH METHODS INSTRUMENTS & COMPUTERS, 2002, 34 (02): : 181 - 188
  • [39] Automatic Text Summarization Using Latent Semantic Analysis
    Mashechkin, I. V.
    Petrovskiy, M. I.
    Popov, D. S.
    Tsarev, D. V.
    [J]. PROGRAMMING AND COMPUTER SOFTWARE, 2011, 37 (06) : 299 - 305
  • [40] Latent semantic analysis of game models using LSTM
    Ghica, Dan R.
    Alyahya, Khulood
    [J]. JOURNAL OF LOGICAL AND ALGEBRAIC METHODS IN PROGRAMMING, 2019, 106 : 39 - 54