Complexity-Based Code Embeddings

被引:1
|
作者
Folea, Rares [1 ]
Iacob, Radu [1 ]
Slusanschi, Emil [1 ]
Rebedea, Traian [1 ]
机构
[1] Univ Politehn Bucuresti, Bucharest, Romania
关键词
algorithm classification; code embeddings; complexity; tree-based classification;
D O I
10.1007/978-3-031-41456-5_20
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper presents a generic method for transforming the source code of various algorithms to numerical embeddings, by dynamically analysing the behaviour of computer programs against different inputs and by tailoring multiple generic complexity functions for the analysed metrics. The used algorithms embeddings are based on r-Complexity [7]. Using the proposed code embeddings, we present an implementation of the XGBoost algorithm that achieves an average 90% F1-score on a multi-label dataset with 11 classes, built using real-world code snippets submitted for programming competitions on the Code-forces platform.
引用
收藏
页码:256 / 269
页数:14
相关论文
共 50 条
  • [1] Is Complexity-based Clustering of Process Metrics as Effective as in Static Code Metrics
    Ozturk, Muhammed Maruf
    [J]. BALTIC JOURNAL OF MODERN COMPUTING, 2019, 7 (01): : 31 - 46
  • [2] Complexity-based Steganalysis
    Manoharan, Sathiamoorthy
    Master, Aquib
    Speidel, Ulrich
    [J]. 2014 INTERNATIONAL SYMPOSIUM ON INFORMATION THEORY AND ITS APPLICATIONS (ISITA), 2014, : 40 - 44
  • [3] COMPLEXITY-BASED INDUCTION
    CONKLIN, D
    WITTEN, IH
    [J]. MACHINE LEARNING, 1994, 16 (03) : 203 - 225
  • [4] A Novel Function Complexity-Based Code Migration Policy for Reducing Power Consumption
    Choi, Hayeon
    Koo, Youngkyoung
    Park, Sangsoo
    [J]. JOURNAL OF COMMUNICATIONS SOFTWARE AND SYSTEMS, 2018, 14 (01) : 68 - 74
  • [5] A complexity-based classification for multiprocessor synchronization
    Ellen, Faith
    Gelashvili, Rati
    Shavit, Nir
    Zhu, Leqi
    [J]. DISTRIBUTED COMPUTING, 2020, 33 (02) : 125 - 144
  • [6] Some Perspectives on Complexity-Based Cryptography
    Yao, Andrew Chi-Chih
    [J]. ADVANCES IN CRYPTOLOGY - ASIACRYPT 2008, 2008, 5350 : 54 - 54
  • [7] A complexity-based classification for multiprocessor synchronization
    Faith Ellen
    Rati Gelashvili
    Nir Shavit
    Leqi Zhu
    [J]. Distributed Computing, 2020, 33 : 125 - 144
  • [8] A Complexity-Based Hierarchy for Multiprocessor Synchronization
    Ellen, Faith
    Gelashvili, Rati
    Shavit, Nir
    Zhu, Leqi
    [J]. PROCEEDINGS OF THE 2016 ACM SYMPOSIUM ON PRINCIPLES OF DISTRIBUTED COMPUTING (PODC'16), 2016, : 289 - 298
  • [9] Complexity-based robust hydrologic prediction
    Pande, Saket
    McKee, Mac
    Bastidas, Luis A.
    [J]. WATER RESOURCES RESEARCH, 2009, 45
  • [10] Complexity-based Biometric Signature Verification
    Tolosana, Ruben
    Vera-Rodriguez, Ruben
    Guest, Richard
    Fierrez, Julian
    Ortega-Garcia, Javier
    [J]. 2017 14TH IAPR INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR), VOL 1, 2017, : 195 - 200