Plagiarism Detection System for Indonesia Text Based Document by Fingerprint Method and Natural Language Processing Approach

被引:0
|
作者
Winarti, Titin [1 ]
Kerami, Djati [2 ]
Etp, Lussiana [3 ]
Sekarwati, Kemal Ade [4 ]
机构
[1] Semarang Univ, Fac Informat Technol & Commun, Semarang 50196, Indonesia
[2] Indonesia Univ, Fac Math & Nat Sci, Depok 16424, Indonesia
[3] Sch Informat Management & Comp Jakarta, Comp Syst, Jakarta 12140, Indonesia
[4] Gunadarma Univ, Fac Comp Sci & Informat Technol, Jakarta 16424, Indonesia
关键词
Plagiarism; Fingerprint; Natural Language Processing;
D O I
10.1166/asl.2016.7993
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
The practice of plagiarism is very often carried out in a community environment for example in academia. So it can be stated that plagiarism is a major concern, especially in the academic environment, where it can affect both the credibility of the institution and its ability to ensure the quality of its students. In other words, the act of plagiarism may result in a decrease of creativity in the community. This research uses a combination of fingerprint method with natural language processing (NLP) approach. With the process or plagiarism detection system can be done through various methods, such as by the method of calculation algorithms Manber the similarities using the Jaccard coefficient and K-gram method as an alternative in the detection of document similarity, is expected to allow a user to use the application this without deciding the value of gram and its window to produce an accurate similarity value. Although it has been proven NLP techniques can improve the accuracy of detection tasks, there are other challenges remain. Current plagiarism detection tools are mostly limited to comparisons of suspicious plagiarised texts and potential original texts at string level. By doing stemming, the document similarity measurement process there was an increase of 31% measurement document based on documents that were tested.
引用
收藏
页码:3128 / 3131
页数:4
相关论文
共 50 条
  • [31] AST-based Multi-language Plagiarism Detection Method
    Zhang, Li Ping
    Liu, Dong Sheng
    PROCEEDINGS OF 2013 IEEE 4TH INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING AND SERVICE SCIENCE (ICSESS), 2012, : 738 - 742
  • [32] Anomaly Detection of System Logs Based on Natural Language Processing and Deep Learning
    Wang, Mengying
    Xu, Lele
    Guo, Lili
    2018 4TH INTERNATIONAL CONFERENCE ON FRONTIERS OF SIGNAL PROCESSING (ICFSP 2018), 2018, : 140 - 144
  • [33] Detecting Weak Signals of the Future: A System Implementation Based on Text Mining and Natural Language Processing
    Griol-Barres, Israel
    Milla, Sergio
    Cebrian, Antonio
    Fan, Huaan
    Millet, Jose
    SUSTAINABILITY, 2020, 12 (19)
  • [34] AN EXPERT SYSTEM APPROACH TO NATURAL-LANGUAGE PROCESSING
    METZLER, DP
    NOREAULT, T
    HAAS, DF
    COSIC, C
    PROCEEDINGS OF THE AMERICAN SOCIETY FOR INFORMATION SCIENCE, 1985, 22 : 301 - 307
  • [35] Text detection method in document images based on multiresolution analysis
    Lee, Geum-Boon
    Shin, Dong-Guk
    Cho, Beom-Joon
    WMSCI 2007 : 11TH WORLD MULTI-CONFERENCE ON SYSTEMICS, CYBERNETICS AND INFORMATICS, VOL V, POST CONFERENCE ISSUE, PROCEEDINGS, 2007, : 200 - +
  • [36] A plagiarism Detection System for Malayalam Text based documents with Full and Partial Copy
    Sindhu, L.
    Idicula, Sumam Mary
    1ST GLOBAL COLLOQUIUM ON RECENT ADVANCEMENTS AND EFFECTUAL RESEARCHES IN ENGINEERING, SCIENCE AND TECHNOLOGY - RAEREST 2016, 2016, 25 : 372 - 377
  • [37] An effective text plagiarism detection system based on feature selection and SVM techniques
    Mohamed A. El-Rashidy
    Ramy G. Mohamed
    Nawal A. El-Fishawy
    Marwa A. Shouman
    Multimedia Tools and Applications, 2024, 83 : 2609 - 2646
  • [38] An effective text plagiarism detection system based on feature selection and SVM techniques
    El-Rashidy, Mohamed A. A.
    Mohamed, Ramy G. G.
    El-Fishawy, Nawal A. A.
    Shouman, Marwa A. A.
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 83 (1) : 2609 - 2646
  • [39] Integrated natural language processing method for text mining and visualization of underground engineering text reports
    Shao, Ruiqi
    Lin, Peng
    Xu, Zhenhao
    AUTOMATION IN CONSTRUCTION, 2024, 166
  • [40] Source Code Plagiarism Detection and Performance Analysis Using Fingerprint Based Distance Measure Method
    Narayanan, Sandhya
    Simi, S.
    PROCEEDINGS OF 2012 7TH INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE & EDUCATION, VOLS I-VI, 2012, : 1065 - 1068