A Diversified Feature Extraction Approach for Program Similarity Analysis

被引:1
|
作者
Wang, Ying [1 ]
Jin, Dahai [2 ]
Gong, Yunzhan [3 ]
机构
[1] Beijing Univ Posts & Telecommun, State Key Lab Networking & Switching Technol, Beijing 18311026809, Peoples R China
[2] Beijing Univ Posts & Telecommun, State Key Lab Networking & Switching Technol, Beijing 13020034471, Peoples R China
[3] Beijing Univ Posts & Telecommun, State Key Lab Networking & Switching Technol, Beijing 61198028, Peoples R China
基金
中国国家自然科学基金;
关键词
Similarity detection; code plagiarism; feature extraction;
D O I
10.1145/3305160.3305189
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
As code plagiarism becomes more and more prevalent, the need for code similarity detection technology is growing greatly. The feature of program is the basic unit that can represent the procedure and structure. Therefore, the quality of the feature will directly impact the accuracy of the similarity detection results. In this paper, we propose a diversified feature extraction approach, which extracts feature information from attribute counting, statement structure, program structure and program function. In the process of feature extraction, we comprehensively consider multiple factors of program, such as program structure, semantics and data flow. Evaluation results shows that this approach can eliminate the interference caused by multiple plagiarism methods, and it also has certain improvement in accuracy and detection efficiency.
引用
收藏
页码:96 / 101
页数:6
相关论文
共 50 条
  • [1] Similarity Feature Extraction of EEG
    Mu, Zhendong
    PROCEEDINGS OF THE 2015 INTERNATIONAL CONFERENCE ON MANAGEMENT, EDUCATION, INFORMATION AND CONTROL, 2015, 125 : 1250 - 1253
  • [2] Discriminant Analysis with Local Gaussian Similarity Preserving for Feature Extraction
    Xi Liu
    Zhengming Ma
    Neural Processing Letters, 2018, 47 : 39 - 55
  • [3] Discriminant Analysis with Local Gaussian Similarity Preserving for Feature Extraction
    Liu, Xi
    Ma, Zhengming
    NEURAL PROCESSING LETTERS, 2018, 47 (01) : 39 - 55
  • [4] Feature Extraction Using Semantic Similarity
    Aboelela, Eman M.
    Gad, Walaa
    Ismail, Rasha
    PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON ADVANCED INTELLIGENT SYSTEMS AND INFORMATICS 2019, 2020, 1058 : 82 - 91
  • [5] Automatic Extraction of Behavioral Features for Test Program Similarity Analysis
    De Angelis, Emanuele
    Pellegrini, Alessandro
    Proietti, Maurizio
    2021 IEEE INTERNATIONAL SYMPOSIUM ON SOFTWARE RELIABILITY ENGINEERING WORKSHOPS (ISSREW 2021), 2021, : 129 - 136
  • [6] Fusion of gradient and feature similarity for Keyframe extraction
    Mounika Bommisetty, Reddy
    Khare, Ashish
    Siddiqui, Tanveer J.
    Palanisamy, P.
    MULTIMEDIA TOOLS AND APPLICATIONS, 2021, 80 (10) : 15429 - 15467
  • [7] Fusion of gradient and feature similarity for Keyframe extraction
    Reddy Mounika Bommisetty
    Ashish Khare
    Tanveer J. Siddiqui
    P. Palanisamy
    Multimedia Tools and Applications, 2021, 80 : 15429 - 15467
  • [8] SIMILARITY PRESERVING ANALYSIS BASED ON SPARSE REPRESENTATION FOR IMAGE FEATURE EXTRACTION AND CLASSIFICATION
    Liu, Qian
    Jing, Xiao-yuan
    Hu, Rui-min
    Yao, Yong-fang
    Yang, Jing-yu
    2013 20TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP 2013), 2013, : 3013 - 3016
  • [9] FEATURE-EXTRACTION IN COMPUTERIZED APPROACH TO THE EGG ANALYSIS
    PIETKA, E
    PATTERN RECOGNITION, 1991, 24 (02) : 139 - 146
  • [10] Modeling aircraft similarity with musical auditory feature extraction
    Mobley, Frank S.
    Bowers, Gregory
    Ugolini, Margaret
    Fox, Elizabeth
    Gillespie, Nathan
    APPLIED ACOUSTICS, 2023, 214