Toward Large-Scale Vulnerability Discovery using Machine Learning

被引:146
|
作者
Grieco, Gustavo [1 ]
Grinblat, Guillermo Luis [1 ]
Uzal, Lucas [1 ]
Rawat, Sanjay [2 ,4 ]
Feist, Josselin [3 ]
Mounier, Laurent [3 ]
机构
[1] CIFASIS CONICET, Rosario, Santa Fe, Argentina
[2] Vrije Univ Amsterdam, Syst Secur Grp, Amsterdam, Netherlands
[3] Univ Grenoble Alps, VERIMAG, Grenoble, France
[4] IIIT Hyderabad, Hyderabad, Telangana, India
关键词
D O I
10.1145/2857705.2857720
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
With sustained growth of software complexity, finding security vulnerabilities in operating systems has become an important necessity. Nowadays, OS are shipped with thousands of binary executables. Unfortunately, methodologies and tools for an OS scale program testing within a limited time budget are still missing. In this paper we present an approach that uses lightweight static and dynamic features to predict if a test case is likely to contain a software vulnerability using machine learning techniques. To show the effectiveness of our approach, we set up a large experiment to detect easily exploitable memory corruptions using 1039 Debian programs obtained from its bug tracker, collected 138,308 unique execution traces and statically explored 76,083 different subsequences of function calls. We managed to predict with reasonable accuracy which programs contained dangerous memory corruptions. We also developed and implemented VDiscovER, a tool that uses state-of-the-art Machine Learning techniques to predict vulnerabilities in test cases. Such tool will be released as open-source to encourage the research of vulnerability discovery at a large scale, together with VDISCOVERY, a public dataset that collects raw analyzed data.
引用
收藏
页码:85 / 96
页数:12
相关论文
共 50 条
  • [1] Toward Smarter Vulnerability Discovery Using Machine Learning
    Grieco, Gustavo
    Dinaburg, Artem
    AISEC'18: PROCEEDINGS OF THE 11TH ACM WORKSHOP ON ARTIFICIAL INTELLIGENCE AND SECURITY, 2018, : 48 - 56
  • [2] Toward Large-Scale Riverine Phosphorus Estimation Using Remote Sensing and Machine Learning
    Ramtel, Pradeep
    Feng, Dongmei
    Gardner, John
    JOURNAL OF GEOPHYSICAL RESEARCH-BIOGEOSCIENCES, 2024, 129 (08)
  • [3] Toward Large-Scale Learning Design
    Davis, Dan
    Seaton, Daniel
    Hauff, Claudia
    Houben, Geert-Jan
    PROCEEDINGS OF THE FIFTH ANNUAL ACM CONFERENCE ON LEARNING AT SCALE (L@S'18), 2018,
  • [4] A machine learning workflow for large-scale discovery of direct bandgap double perovskites
    Chen, Yuzhi
    Liu, Hongyu
    Fang, Xu
    Li, Yuanhua
    Chen, Jing
    Peng, Lin
    Liu, Xiaolin
    Lin, Jia
    SOLAR ENERGY MATERIALS AND SOLAR CELLS, 2025, 282
  • [5] A Survey on Large-Scale Machine Learning
    Wang, Meng
    Fu, Weijie
    He, Xiangnan
    Hao, Shijie
    Wu, Xindong
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2022, 34 (06) : 2574 - 2594
  • [6] Toward Robust Anxiety Biomarkers: A Machine Learning Approach in a Large-Scale Sample
    Boeke, Emily A.
    Holmes, Avram J.
    Phelps, Elizabeth A.
    BIOLOGICAL PSYCHIATRY-COGNITIVE NEUROSCIENCE AND NEUROIMAGING, 2020, 5 (08) : 799 - 807
  • [7] Configuring large-scale storage using a middleware with machine learning
    Eyers, David M.
    Routray, Ramani
    Zhang, Rui
    Willcocks, Douglas
    Pietzuch, Peter
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2011, 23 (17): : 2063 - 2077
  • [8] Toward automated large-scale information integration and discovery
    Brown, P
    Haas, P
    Myllymaki, J
    Pirahesh, H
    Reinwald, B
    Sismanis, Y
    DATA MANAGEMENT IN A CONNECTED WORLD: ESSAYS DEDICATED TO HARTMUT WEDEKIND ON THE OCCASION OF HIS 70TH BIRTHDAY, 2005, 3551 : 161 - 180
  • [9] Machine learning enabled pattern discovery in large-scale spatial gene expression datasets
    Abbasi-Asl, Reza
    JOURNAL OF COMPUTATIONAL NEUROSCIENCE, 2024, 52 : S18 - S18
  • [10] Machine learning enabled pattern discovery in large-scale spatial gene expression datasets
    Abbasi-Asl, Reza
    JOURNAL OF COMPUTATIONAL NEUROSCIENCE, 2024, 52 : S18 - S18