Towards a Block-Level ML-Based Python']Python Vulnerability Detection Tool∗

被引:0
|
作者
Bagheri, Amirreza [1 ]
Hegedus, Peter [1 ]
机构
[1] Univ Szeged, Inst Informat, Szeged, Hungary
来源
ACTA CYBERNETICA | 2024年 / 26卷 / 03期
关键词
deep learning; vulnerability detection; source code embedding; data mining;
D O I
10.14232/actacyb.299667
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Computer software is driving our everyday life, therefore their security is pivotal. Unfortunately, security flaws are common in software systems, which can result in a variety of serious repercussions, including data loss, secret information disclosure, manipulation, or system failure. Although techniques for detecting vulnerable code exist, the improvement of their accuracy and effectiveness to a practically applicable level remains a challenge. Many existing methods require a substantial amount of human expert labor to develop attributes that indicate vulnerabilities. In previous work, we have shown that machine learning is suitable for solving the issue automatically by learning features from a vast collection of real-world code and predicting vulnerable code locations. Applying a BERT-based code embedding, LSTM models with the best hyperparameters were able to identify seven different security flaws in Python source code with high precision (average of 91%) and recall (average of 83%). Upon the encouraging first empirical results, we go beyond this paper and discuss the challenges of applying these models in practice and outlining a method that solves these issues. Our goal is to develop a hands-on tool for developers that they can use to pinpoint potentially vulnerable spots in their code.
引用
收藏
页码:323 / 371
页数:49
相关论文
共 50 条
  • [1] Web Vulnerability Detection Analyzer Based on Python']Python
    Xu, Dawei
    Chen, Tianxin
    Tan, Zhonghua
    Wu, Fudong
    Gao, Jiaqi
    Yang, Yunfan
    INTERNATIONAL JOURNAL OF DIGITAL CRIME AND FORENSICS, 2022, 14 (02)
  • [2] DetectVul: A statement-level code vulnerability detection for Python']Python
    Tran, Hoai-Chau
    Tran, Anh-Duy
    Le, Kim-Hung
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2025, 163
  • [3] A block based editor for Python']Python
    Strong, Glenn
    O'Carroll, Sean
    Bresnihan, Nina
    WIPSCE'18: PROCEEDINGS OF THE 13TH WORKSHOP IN PRIMARY AND SECONDARY COMPUTING EDUCATION, 2018, : 151 - 152
  • [4] SecureQwen: Leveraging LLMs for vulnerability detection in python']python codebases
    Mechri, Abdechakour
    Ferrag, Mohamed Amine
    Debbah, Merouane
    COMPUTERS & SECURITY, 2025, 148
  • [5] PyTOPS: A Python']Python based tool for TOPSIS
    Yadav, Vinay
    Karmakar, Subhankar
    Kalbar, Pradip P.
    Dikshit, A. K.
    SOFTWAREX, 2019, 9 : 217 - 222
  • [6] VUDENC: Vulnerability Detection with Deep Learning on a Natural Codebase for Python']Python
    Wartschinski, Laura
    Noller, Yannic
    Vogel, Thomas
    Kehrer, Timo
    Grunske, Lars
    INFORMATION AND SOFTWARE TECHNOLOGY, 2022, 144
  • [7] Python']Python source code vulnerability detection with named entity recognition
    Ehrenberg, Melanie
    Sarkani, Shahram
    Mazzuchi, Thomas A.
    COMPUTERS & SECURITY, 2024, 140
  • [8] Machine Learning Techniques For Python']Python Source Code Vulnerability Detection
    Farasat, Talaya
    Posegga, Joachim
    PROCEEDINGS OF THE FOURTEENTH ACM CONFERENCE ON DATA AND APPLICATION SECURITY AND PRIVACY, CODASPY 2024, 2024, : 151 - 153
  • [9] A Python']Python Based EMT Model Quality Testing Tool
    Wang, Xiaoyu
    Huang, Shun Hsien
    Schmall, John
    Conto, Jose
    2021 IEEE POWER & ENERGY SOCIETY GENERAL MEETING (PESGM), 2021,
  • [10] APIScanner - Towards Automated Detection of Deprecated APIs in Python']Python Libraries
    Vadlamani, Aparna
    Kalicheti, Rishitha
    Chimalakonda, Sridhar
    2021 IEEE/ACM 43RD INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING: COMPANION PROCEEDINGS (ICSE-COMPANION 2021), 2021, : 5 - 8