A Diversified Feature Extraction Approach for Program Similarity Analysis

被引:1
|
作者
Wang, Ying [1 ]
Jin, Dahai [2 ]
Gong, Yunzhan [3 ]
机构
[1] Beijing Univ Posts & Telecommun, State Key Lab Networking & Switching Technol, Beijing 18311026809, Peoples R China
[2] Beijing Univ Posts & Telecommun, State Key Lab Networking & Switching Technol, Beijing 13020034471, Peoples R China
[3] Beijing Univ Posts & Telecommun, State Key Lab Networking & Switching Technol, Beijing 61198028, Peoples R China
基金
中国国家自然科学基金;
关键词
Similarity detection; code plagiarism; feature extraction;
D O I
10.1145/3305160.3305189
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
As code plagiarism becomes more and more prevalent, the need for code similarity detection technology is growing greatly. The feature of program is the basic unit that can represent the procedure and structure. Therefore, the quality of the feature will directly impact the accuracy of the similarity detection results. In this paper, we propose a diversified feature extraction approach, which extracts feature information from attribute counting, statement structure, program structure and program function. In the process of feature extraction, we comprehensively consider multiple factors of program, such as program structure, semantics and data flow. Evaluation results shows that this approach can eliminate the interference caused by multiple plagiarism methods, and it also has certain improvement in accuracy and detection efficiency.
引用
收藏
页码:96 / 101
页数:6
相关论文
共 50 条
  • [21] Channel pruning method driven by similarity of feature extraction capability
    Sun, Chuanmeng
    Chen, Jiaxin
    Li, Yong
    Wang, Yu
    Ma, Tiehua
    Soft Computing, 2025, 29 (02) : 1207 - 1226
  • [22] Benchmark of Multiple Approaches for Feature Extraction and Image Similarity Characterization
    Yang, Chunlei
    Ga, Yuli
    Fan, Jianping
    IMAGING AND PRINTING IN A WEB 2.0 WORLD; AND MULTIMEDIA CONTENT ACCESS: ALGORITHMS AND SYSTEMS IV, 2010, 7540
  • [23] Improved Feature Extraction and Similarity Algorithm for Video Object Detection
    You, Haotian
    Lu, Yufang
    Tang, Haihua
    INFORMATION, 2023, 14 (02)
  • [24] A divide-and-conquer approach to contour extraction and invariant feature analysis
    Gavrilova, Marina
    Apu, Russel
    INTERNATIONAL JOURNAL OF REMOTE SENSING, 2010, 31 (22) : 5813 - 5835
  • [25] An adaptive window size selection approach for feature extraction in texture analysis
    Sheng, W
    Xu, CX
    Liu, JA
    IMAGE EXTRACTION, SEGMENTATION, AND RECOGNITION, 2001, 4550 : 228 - 233
  • [26] Neurosphere fate prediction: an analysis-synthesis approach for feature extraction
    Rigaud, Stephane U.
    Lomenie, Nicolas
    Sankaran, Shvetha
    Ahmed, Sohail
    Lim, Joo-Hwee
    Racoceanu, Daniel
    2012 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2012,
  • [27] An automatic approach of audio feature engineering for the extraction, analysis and selection of descriptors
    Marvin Jiménez
    Jose Aguilar
    Julin Monsalve-Pulido
    Edwin Montoya
    International Journal of Multimedia Information Retrieval, 2021, 10 : 33 - 42
  • [28] An approach based on wavelet analysis for feature extraction in the a-wave of the electroretinogram
    Barraco, R.
    Adorno, D. Persano
    Brai, M.
    COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE, 2011, 104 (03) : 316 - 324
  • [29] AN OPINION FEATURE EXTRACTION APPROACH BASED ON A MULTIDIMENSIONAL SENTENCE ANALYSIS MODEL
    Guo, Jiunn-Liang
    Peng, Jhih-En
    Wang, Hei-Chia
    CYBERNETICS AND SYSTEMS, 2013, 44 (05) : 379 - 401
  • [30] An automatic approach of audio feature engineering for the extraction, analysis and selection of descriptors
    Jimenez, Marvin
    Aguilar, Jose
    Monsalve-Pulido, Julin
    Montoya, Edwin
    INTERNATIONAL JOURNAL OF MULTIMEDIA INFORMATION RETRIEVAL, 2021, 10 (01) : 33 - 42