A Diversified Feature Extraction Approach for Program Similarity Analysis

被引:1
|
作者
Wang, Ying [1 ]
Jin, Dahai [2 ]
Gong, Yunzhan [3 ]
机构
[1] Beijing Univ Posts & Telecommun, State Key Lab Networking & Switching Technol, Beijing 18311026809, Peoples R China
[2] Beijing Univ Posts & Telecommun, State Key Lab Networking & Switching Technol, Beijing 13020034471, Peoples R China
[3] Beijing Univ Posts & Telecommun, State Key Lab Networking & Switching Technol, Beijing 61198028, Peoples R China
基金
中国国家自然科学基金;
关键词
Similarity detection; code plagiarism; feature extraction;
D O I
10.1145/3305160.3305189
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
As code plagiarism becomes more and more prevalent, the need for code similarity detection technology is growing greatly. The feature of program is the basic unit that can represent the procedure and structure. Therefore, the quality of the feature will directly impact the accuracy of the similarity detection results. In this paper, we propose a diversified feature extraction approach, which extracts feature information from attribute counting, statement structure, program structure and program function. In the process of feature extraction, we comprehensively consider multiple factors of program, such as program structure, semantics and data flow. Evaluation results shows that this approach can eliminate the interference caused by multiple plagiarism methods, and it also has certain improvement in accuracy and detection efficiency.
引用
收藏
页码:96 / 101
页数:6
相关论文
共 50 条
  • [31] Concrete Crack Skeleton Analysis: A Machine Vision Approach to Feature Extraction
    Wang, Rui
    Guo, Xinxin
    Liu, Jiaxuan
    Wan, Yu
    ADVANCES IN CIVIL ENGINEERING, 2024, 2024
  • [32] DSTF: A Diversified Spatio-Temporal Feature Extraction Model for traffic flow prediction
    Wang, Xing
    Wang, Xiaojun
    Huang, Faliang
    Zou, Fumin
    Liao, Lyuchao
    Zeng, Ruihao
    NEUROCOMPUTING, 2025, 621
  • [33] LDA and Deep Learning: A Combined Approach for Feature Extraction and Sentiment Analysis
    Syamala, Maganti
    Nalini, N. J.
    2019 10TH INTERNATIONAL CONFERENCE ON COMPUTING, COMMUNICATION AND NETWORKING TECHNOLOGIES (ICCCNT), 2019,
  • [34] Diversified Occupations Program
    Smith, J. Warren
    TEACHERS COLLEGE RECORD, 1940, 42 (01): : 23 - 27
  • [35] Soft computing approach to feature extraction
    Li, CS
    Huang, JY
    Chen, CM
    FUZZY SETS AND SYSTEMS, 2004, 147 (01) : 119 - 140
  • [36] COMPLEMENTARY FEATURE EXTRACTION APPROACH IN CBIR
    Kumar, Kamlesh
    Li, Jian-Ping
    Zain-Ul-Abidin
    2015 12TH INTERNATIONAL COMPUTER CONFERENCE ON WAVELET ACTIVE MEDIA TECHNOLOGY AND INFORMATION PROCESSING (ICCWAMTIP), 2015, : 192 - 197
  • [37] Product Feature Extraction with a Combined Approach
    Li, Zhixing
    2010 THIRD INTERNATIONAL SYMPOSIUM ON INTELLIGENT INFORMATION TECHNOLOGY AND SECURITY INFORMATICS (IITSI 2010), 2010, : 686 - 690
  • [38] A SEMANTIC APPROACH TO THE EXTRACTION OF FEATURE TERMS
    Angioni, Manuela
    Tuveri, Franco
    ICSOFT 2011: PROCEEDINGS OF THE 6TH INTERNATIONAL CONFERENCE ON SOFTWARE AND DATABASE TECHNOLOGIES, VOL 2, 2011, : 402 - 407
  • [39] A MapReduce Approach for SIFT Feature Extraction
    Han, Wei
    Kang, Yiding
    Chen, Yang
    Zhang, Xueqing
    2013 INTERNATIONAL CONFERENCE ON CLOUD COMPUTING AND BIG DATA (CLOUDCOM-ASIA), 2013, : 465 - 469
  • [40] A hierarchical approach to feature extraction and grouping
    Foresti, GL
    Regazzoni, C
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2000, 9 (06) : 1056 - 1074