Experiments on the Indonesian Plagiarism Detection using Latent Semantic Analysis

被引:0
|
作者
Soleman, Sidik
Purwarianti, Ayu
机构
关键词
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Plagiarism is an important task since its number is increasing and the plagiarism technique is getting difficult. It means that there is not only literal plagiarism but also intelligence plagiarism. In order to handle the intelligence plagiarism, we employed latent semantic analysis (LSA) as the term-document representation. The LSA was used in the Heuristic Retrieval (HR) component and Detailed Analysis (DA) component. We conducted several experiments to compare the token type, the text segmentation and the threshold value. The test data were prepared manually from the available Indonesian paper corpus. Experimental results showed that the LSA outperformed the VSM (Vector Space Model), especially in test cases with intelligence plagiarism.
引用
收藏
页数:6
相关论文
共 50 条
  • [1] Intrinsic Plagiarism Detection Using Latent Semantic Indexing and Stylometry
    Alsallal, Muna
    Iqbal, Rahat
    Amin, Saad
    James, Anne
    [J]. 2013 SIXTH INTERNATIONAL CONFERENCE ON DEVELOPMENTS IN ESYSTEMS ENGINEERING (DESE), 2014, : 145 - 150
  • [2] An Approach to Source-Code Plagiarism Detection and Investigation Using Latent Semantic Analysis
    Cosma, Georgina
    Joy, Mike
    [J]. IEEE TRANSACTIONS ON COMPUTERS, 2012, 61 (03) : 379 - 394
  • [3] Plagiarism detection based on semantic analysis
    Mukherjee, Indrajit
    Kumar, Bipul
    Singh, Samarth
    Sharma, Kishan
    [J]. INTERNATIONAL JOURNAL OF KNOWLEDGE AND LEARNING, 2018, 12 (03) : 242 - 254
  • [4] Fuzzy Semantic-Based String Similarity Experiments to Detect Plagiarism in Indonesian Documents
    Umareta, Chonan Firda Odayakana
    Mariyah, Siti
    [J]. 2019 3RD INTERNATIONAL CONFERENCE ON INFORMATICS AND COMPUTATIONAL SCIENCES (ICICOS 2019), 2019,
  • [5] Plagiarism Detection Using Semantic Knowledge Graphs
    Khadilkar, Kunal
    Kulkarni, Siddhivinayak
    Bone, Poojarani
    [J]. 2018 FOURTH INTERNATIONAL CONFERENCE ON COMPUTING COMMUNICATION CONTROL AND AUTOMATION (ICCUBEA), 2018,
  • [6] Plagiarism Detection for Indonesian Language using Winnowing with Parallel Processing
    Arifin, Y.
    Isa, S. M.
    Wulandhari, L. A.
    Abdurachman, E.
    [J]. 2ND INTERNATIONAL CONFERENCE ON COMPUTING AND APPLIED INFORMATICS 2017, 2018, 978
  • [7] Signature Based Intrusion Detection using Latent Semantic Analysis
    Lassez, Jean-Louis
    Rossi, Ryan
    Sheel, Stephen
    Mukkamala, Srinivas
    [J]. 2008 IEEE INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, VOLS 1-8, 2008, : 1068 - 1074
  • [8] Fuzzy Semantic Plagiarism Detection
    Osman, Ahmed Hamza
    Salim, Naomie
    Kumar, Yogan Jaya
    Abuobieda, Albaraa
    [J]. ADVANCED MACHINE LEARNING TECHNOLOGIES AND APPLICATIONS, 2012, 322 : 543 - 553
  • [9] An Improved Online Plagiarism Detection Approach for Semantic Analysis using Custom Search Engine
    Sharma, Kamalpreet
    Jindal, Balkrishan
    [J]. PROCEEDINGS OF THE 10TH INDIACOM - 2016 3RD INTERNATIONAL CONFERENCE ON COMPUTING FOR SUSTAINABLE GLOBAL DEVELOPMENT, 2016, : 764 - 768
  • [10] Using word semantic concepts for plagiarism detection in text documents
    Chang, Chia-Yang
    Lee, Shie-Jue
    Wu, Chih-Hung
    Liu, Chih-Feng
    Liu, Ching-Kuan
    [J]. INFORMATION RETRIEVAL JOURNAL, 2021, 24 (4-5): : 298 - 321