Supervised Deep Features for Software Functional Clone Detection by Exploiting Lexical and Syntactical Information in Source Code

被引:0
|
作者
Wei, Hui-Hui [1 ]
Li, Ming [1 ]
机构
[1] Nanjing Univ, Natl Key Lab Novel Software Technol, Collaborat Innovat Ctr Novel Software Technol & I, Nanjing 210023, Peoples R China
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Software clone detection, aiming at identifying out code fragments with similar functionalities, has played an important role in software maintenance and evolution. Many clone detection approaches have been proposed. However, most of them represent source codes with hand-crafted features using lexical or syntactical information, or unsupervised deep features, which makes it difficult to detect the functional clone pairs, i.e., pieces of codes with similar functionality but differing in both syntactical and lexical level. In this paper, we address the software functional clone detection problem by learning supervised deep features. We formulate the clone detection as a supervised learning to hash problem and propose an end-to-end deep feature learning framework called CDLH for functional clone detection. Such framework learns hash codes by exploiting the lexical and syntactical information for fast computation of functional similarity between code fragments. Experiments on software clone detection benchmarks indicate that the CDLH approach is effective and outperforms the state-of-the-art approaches in software functional clone detection.
引用
收藏
页码:3034 / 3040
页数:7
相关论文
共 10 条
  • [1] Deep code comment generation with hybrid lexical and syntactical information
    Xing Hu
    Ge Li
    Xin Xia
    David Lo
    Zhi Jin
    [J]. Empirical Software Engineering, 2020, 25 : 2179 - 2217
  • [2] Deep code comment generation with hybrid lexical and syntactical information
    Hu, Xing
    Li, Ge
    Xia, Xin
    Lo, David
    Jin, Zhi
    [J]. EMPIRICAL SOFTWARE ENGINEERING, 2020, 25 (03) : 2179 - 2217
  • [3] DeepCommenter: A Deep Code Comment Generation Tool with Hybrid Lexical and Syntactical Information
    Li, Boao
    Yan, Meng
    Xia, Xin
    Hu, Xing
    Li, Ge
    Lo, David
    [J]. PROCEEDINGS OF THE 28TH ACM JOINT MEETING ON EUROPEAN SOFTWARE ENGINEERING CONFERENCE AND SYMPOSIUM ON THE FOUNDATIONS OF SOFTWARE ENGINEERING (ESEC/FSE '20), 2020, : 1571 - 1575
  • [4] CLCDSA: Cross Language Code Clone Detection using Syntactical Features and API Documentation
    Nafi, Kawser Wazed
    Kar, Tonny Shekha
    Roy, Banani
    Roy, Chanchal K.
    Schneider, Kevin A.
    [J]. 34TH IEEE/ACM INTERNATIONAL CONFERENCE ON AUTOMATED SOFTWARE ENGINEERING (ASE 2019), 2019, : 1026 - 1037
  • [5] Review Sharing via Deep Semi-Supervised Code Clone Detection
    Guo, Chenkai
    Yang, Hui
    Huang, Dengrong
    Zhang, Jianwen
    Dong, Naipeng
    Xu, Jing
    Zhu, Jingwen
    [J]. IEEE ACCESS, 2020, 8 (24948-24965) : 24948 - 24965
  • [6] Recovering Variability Information from Source Code of Clone-and-Own Software Systems
    Schlie, Alexander
    Schulze, Sandro
    Schaefer, Ina
    [J]. PROCEEDINGS OF THE 14TH INTERNATIONAL WORKING CONFERENCE ON VARIABILITY MODELLING OF SOFTWARE-INTENSIVE SYSTEMS (VAMOS '20), 2020,
  • [7] Java']Java Code Clone Detection by Exploiting Semantic and Syntax Information From Intermediate Code-Based Graph
    Yuan, Dawei
    Fang, Sen
    Zhang, Tao
    Xu, Zhou
    Luo, Xiapu
    [J]. IEEE TRANSACTIONS ON RELIABILITY, 2023, 72 (02) : 511 - 526
  • [8] Find Me if You Can: Deep Software Clone Detection by Exploiting the Contest between the Plagiarist and the Detector
    Zhang, Yan-Ya
    Li, Ming
    [J]. THIRTY-THIRD AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FIRST INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / NINTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019, : 5813 - 5820
  • [9] An Empirical Study on Vulnerability Detection for Source Code Software based on Deep Learning
    Lin, Wei
    Cai, Saihua
    [J]. 2021 21ST INTERNATIONAL CONFERENCE ON SOFTWARE QUALITY, RELIABILITY AND SECURITY COMPANION (QRS-C 2021), 2021, : 1159 - 1160
  • [10] Deep Learning-Based Software Defect Prediction via Semantic Key Features of Source Code-Systematic Survey
    Abdu, Ahmed
    Zhai, Zhengjun
    Algabri, Redhwan
    Abdo, Hakim A.
    Hamad, Kotiba
    Al-antari, Mugahed A.
    [J]. MATHEMATICS, 2022, 10 (17)