Advancing Software Vulnerability Scoring: A Statistical Approach with Machine Learning Techniques and GridSearchCV Parameter Tuning

被引:0
|
作者
Birendra Kumar Verma
Ajay Kumar Yadav
机构
[1] Banasthali Vidyapith,
[2] JSS Academy of Technical Education,undefined
关键词
Statistical technique; GridSearchCV; Software vulnerability scoring;
D O I
10.1007/s42979-024-02942-x
中图分类号
学科分类号
摘要
The growing complexity, diversity, and importance of software pose a significant threat to computer system security due to exploitable software vulnerabilities. Important infrastructure systems, including banking, electricity, healthcare, and the military, are at risk of loss due to these vulnerabilities that permit unwanted access. This study investigates statistical features that contribute to improved outcomes, even though existing approaches primarily utilize natural language processing for vulnerability descriptions. We present an innovative scoring method that incorporates six well-known machine learning techniques: Linear Regressor, Decision Tree Regressor, Random Forest Regressor, K Nearest Neighbors Regressor, AdaBoost Regressor, and Support Vector Regressor into its framework. An assessment is conducted on 159,979 vulnerabilities obtained from the National Vulnerability Database using six metrics: explained variance, mean absolute error, mean squared log error, R-squared, root mean squared error, and mean squared error. GridSearchCV and tenfold cross-validation have validated the Random Forest Regressor as superior, yielding an accuracy of 0.9486. This approach demonstrates promise in proactive risk management across multiple sectors, including healthcare, energy, defense, and finance, by integrating machine learning techniques and statistical features.
引用
收藏
相关论文
共 50 条
  • [21] Deep Learning and Machine Learning Techniques for Credit Scoring: A Review
    Wube, Hana Demma
    Esubalew, Sintayehu Zekarias
    Weldesellasie, Firesew Fayiso
    Debelee, Taye Girma
    [J]. PAN-AFRICAN CONFERENCE ON ARTIFICIAL INTELLIGENCE, PT II, PANAFRICON AI 2023, 2024, 2069 : 30 - 61
  • [22] Parameter Identifiability in Statistical Machine Learning: A Review
    Ran, Zhi-Yong
    Hu, Bao-Gang
    [J]. NEURAL COMPUTATION, 2017, 29 (05) : 1151 - 1203
  • [23] A Comparative Analysis of Machine Learning Techniques for Credit Scoring
    Nwulu, Nnamdi I.
    Oroja, Shola
    Ilkan, Mustafa
    [J]. INFORMATION-AN INTERNATIONAL INTERDISCIPLINARY JOURNAL, 2012, 15 (10): : 4129 - 4145
  • [24] Machine Learning: A Practical Approach on the Statistical Learning
    Liu, Shin Ta
    [J]. TECHNOMETRICS, 2020, 62 (04) : 560 - 561
  • [25] Software Vulnerability Analysis and Discovery Using Machine-Learning and Data-Mining Techniques: A Survey
    Ghaffarian, Seyed Mohammad
    Shahriari, Hamid Reza
    [J]. ACM COMPUTING SURVEYS, 2017, 50 (04)
  • [26] Incorporating statistical and machine learning techniques into the optimization of correction factors for software development effort estimation
    Ho Le Thi Kim Nhung
    Vo Van Hai
    Silhavy, Petr
    Prokopova, Zdenka
    Silhavy, Radek
    [J]. JOURNAL OF SOFTWARE-EVOLUTION AND PROCESS, 2024, 36 (05)
  • [27] Incorporating statistical and machine learning techniques into the optimization of correction factors for software development effort estimation
    Nhung, Ho Le Thi Kim
    Van Hai, Vo
    Silhavy, Petr
    Prokopova, Zdenka
    Silhavy, Radek
    [J]. JOURNAL OF SOFTWARE-EVOLUTION AND PROCESS, 2023,
  • [28] Software Modernization Using Machine Learning Techniques
    Somogyi, Norbert
    Kovesdan, Gabor
    [J]. 2021 IEEE 19TH WORLD SYMPOSIUM ON APPLIED MACHINE INTELLIGENCE AND INFORMATICS (SAMI 2021), 2021, : 361 - 365
  • [29] Machine Learning and Statistical Analysis Techniques on Terrorism
    Rajesh, P.
    Babitha, D.
    Alam, Mansoor
    Tahernezhadi, Mansour
    Monika, A.
    [J]. FUZZY SYSTEMS AND DATA MINING VI, 2020, 331 : 210 - 222
  • [30] The rise of software vulnerability: Taxonomy of software vulnerabilities detection and machine learning approaches
    Hanif, Hazim
    Nasir, Mohd Hairul Nizam Md
    Ab Razak, Mohd Faizal
    Firdaus, Ahmad
    Anuar, Nor Badrul
    [J]. JOURNAL OF NETWORK AND COMPUTER APPLICATIONS, 2021, 179