Cross-Language Plagiarism Detection Model Based On Multiple Features

被引:0
|
作者
Liu, Gang [1 ,2 ]
Dong, Yichao [1 ]
Li, Guangxi [1 ]
机构
[1] Harbin Engn Univ, Coll Comp Sci & Technol, Harbin, Peoples R China
[2] Nanjing Univ, State Key Lab Novel Software Technol, Nanjing, Peoples R China
关键词
Feature Selection; Candidate Retrieval; Translation Features; Cross-Language; Dictionary;
D O I
10.1109/ISCC53001.2021.9631406
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
As information sharing becomes more and more convenient, a lot of phenomena of plagiarism shows up. The study of cross-language plagiarism is an important problem that the whole academic circle tries to solve it collectively. In this paper, a multiple-features based cross-language plagiarism detection model is proposed, which includes cross-language plagiarism candidate retrieval based on multiple features and cross-language plagiarism detection based on dynamic text alignment. For cross-language plagiarism candidate retrieval, it is mainly based on the translation features. What's more, for cross-language plagiarism detection, a text-alignment based similarity analysis was used to filter the final results between the identified paragraphs. In this step, our approach doesn't use a machine translation system to convert longer text, but uses a dictionary to obtain the translation of a single word. Moreover, experimental results show that our method outperforms the previous methods and achieved the best results in four datasets.
引用
收藏
页数:7
相关论文
共 50 条
  • [1] Cross-language plagiarism detection
    Potthast, Martin
    Barron-Cedeno, Alberto
    Stein, Benno
    Rosso, Paolo
    [J]. LANGUAGE RESOURCES AND EVALUATION, 2011, 45 (01) : 45 - 62
  • [2] Cross-language plagiarism detection
    Martin Potthast
    Alberto Barrón-Cedeño
    Benno Stein
    Paolo Rosso
    [J]. Language Resources and Evaluation, 2011, 45 : 45 - 62
  • [3] Methods for cross-language plagiarism detection
    Barron-Cedeno, Alberto
    Gupta, Parth
    Rosso, Paolo
    [J]. KNOWLEDGE-BASED SYSTEMS, 2013, 50 : 211 - 217
  • [4] On the Mono- and Cross-Language Detection of Text Reuse and Plagiarism
    Barron-Cedeno, Alberto
    [J]. SIGIR 2010: PROCEEDINGS OF THE 33RD ANNUAL INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH DEVELOPMENT IN INFORMATION RETRIEVAL, 2010, : 914 - 914
  • [5] Graph-Based Similarity Analysis: A New Approach to Cross-Language Plagiarism Detection
    Franco-Salvador, Marc
    Gupta, Parth
    Rosso, Paolo
    [J]. PROCESAMIENTO DEL LENGUAJE NATURAL, 2013, (50): : 21 - 28
  • [6] Word Embedding for High Performance Cross-Language Plagiarism Detection Techniques
    Bouaine, Chaimaa
    Benabbou, Faouzia
    Sadgali, Imane
    [J]. International Journal of Interactive Mobile Technologies, 2023, 17 (10): : 69 - 91
  • [7] Cross-Language Plagiarism Detection Method: Arabic vs. English
    Hattab, Ezz
    [J]. PROCEEDINGS 2015 INTERNATIONAL CONFERENCE ON DEVELOPMENTS IN ESYSTEMS ENGINEERING DESE 2015, 2015, : 141 - 144
  • [8] Cross-language Plagiarism Detection Using BabelNet's Statistical Dictionary
    Franco-Salvador, Marc
    Gupta, Parth
    Rosso, Paolo
    [J]. COMPUTACION Y SISTEMAS, 2012, 16 (04): : 383 - 390
  • [9] A systematic study of knowledge graph analysis for cross-language plagiarism detection
    Franco-Salvador, Marc
    Rosso, Paolo
    Montes-y-Gomez, Manuel
    [J]. INFORMATION PROCESSING & MANAGEMENT, 2016, 52 (04) : 550 - 570
  • [10] A New Approach for Cross-Language Plagiarism Analysis
    Pereira, Rafael Corezola
    Moreira, Viviane P.
    Galante, Renata
    [J]. MULTILINGUAL AND MULTIMODAL INFORMATION ACCESS EVALUATION, 2010, 6360 : 15 - 26