A tree-based machine learning methodology to automatically classify software vulnerabilities

被引:4
|
作者
Aivatoglou, Georgios [1 ]
Anastasiadis, Mike [1 ]
Spanos, Georgios [1 ]
Voulgaridis, Antonis [1 ]
Votis, Konstantinos [1 ]
Tzovaras, Dimitrios [1 ]
机构
[1] Informat Technol Inst, Ctr Res & Technol Hellas, Thessaloniki, Greece
基金
欧盟地平线“2020”;
关键词
Software Vulnerability categorization; Cyber-security; Machine Learning; Decision Trees; Random Forests; Gradient Boosting;
D O I
10.1109/CSR51186.2021.9527965
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Software vulnerabilities have become a major problem for the security analysts, since the number of new vulnerabilities is constantly growing. Thus, there was a need for a categorization system, in order to group and handle these vulnerabilities in a more efficient way. Hence, the MITRE corporation introduced the Common Weakness Enumeration that is a list of the most common software and hardware vulnerabilities. However, the manual task of understanding and analyzing new vulnerabilities by security experts, is a very slow and exhausting process. For this reason, a new automated classification methodology is introduced in this paper, based on the vulnerability textual descriptions from National Vulnerability Database. The proposed methodology, combines textual analysis and tree-based machine learning techniques in order to classify vulnerabilities automatically. The results of the experiments showed that the proposed methodology performed pretty well achieving an overall accuracy close to 80%.
引用
收藏
页码:312 / 317
页数:6
相关论文
共 50 条
  • [41] Fundamental error in tree-based machine learning model selection for reservoir characterisation
    Daniel Asante Otchere
    Energy Geoscience, 2024, 5 (02) : 218 - 228
  • [42] Detection of financial fraud: comparisons of some tree-based machine learning approaches
    Kausik Sengupta
    Pradyot Kumar Das
    Journal of Data, Information and Management, 2023, 5 (1-2): : 23 - 37
  • [43] Boosting Insights in Insurance Tariff Plans with Tree-Based Machine Learning Methods
    Henckaerts, Roel
    Cote, Marie-Pier
    Antonio, Katrien
    Verbelen, Roel
    NORTH AMERICAN ACTUARIAL JOURNAL, 2021, 25 (02) : 255 - 285
  • [44] Tree-Based Machine Learning Techniques for Automated Human Sleep Stage Classification
    Arslan, Recep Sinan
    Ulutas, Hasan
    Koksal, Ahmet Sertol
    Bakir, Mehmet
    Ciftci, Bulent
    TRAITEMENT DU SIGNAL, 2023, 40 (04) : 1385 - 1400
  • [45] A Comparative Analysis of Tree-based Machine Learning Algorithms for Breast Cancer Detection
    A'la, Fiddin Yusfida
    Permanasari, Adhistya Erna
    Setiawan, Noor Akhmad
    PROCEEDINGS OF 2019 12TH INTERNATIONAL CONFERENCE ON INFORMATION & COMMUNICATION TECHNOLOGY AND SYSTEM (ICTS), 2019, : 55 - 59
  • [46] Tree-Based Transforms for Privileged Learning
    Moradi, Mehdi
    Syeda-Mahmood, Tanveer
    Hor, Soheil
    MACHINE LEARNING IN MEDICAL IMAGING, MLMI 2016, 2016, 10019 : 188 - 195
  • [47] Regression tree-based active learning
    Ashna Jose
    João Paulo Almeida de Mendonça
    Emilie Devijver
    Noël Jakse
    Valérie Monbet
    Roberta Poloni
    Data Mining and Knowledge Discovery, 2024, 38 : 420 - 460
  • [48] On the Netlist Gate-level Pruning for Tree-based Machine Learning Accelerators
    de Abreu, Brunno A.
    Paim, Guilherme
    Castro-Godinez, Jorge
    Grellert, Mateus
    Bampi, Sergio
    2022 IEEE 13TH LATIN AMERICAN SYMPOSIUM ON CIRCUITS AND SYSTEMS (LASCAS), 2022, : 21 - 24
  • [49] Decision Tree-based Machine Learning Algorithm for In-node Vehicle Classification
    Ying, Kyle
    Ameri, Alireza
    Trivedi, Ankit
    Ravindra, Dilip
    Patel, Darshan
    Mozumdar, Mohammad
    2015 IEEE GREEN ENERGY AND SYSTEMS CONFERENCE (IGESC), 2015, : 71 - 76
  • [50] Fundamental error in tree-based machine learning model selection for reservoir characterisation
    Otchere, Daniel Asante
    ENERGY GEOSCIENCE, 2024, 5 (02):