Predicting Vulnerable Components via Text Mining or Software Metrics? An Effort-aware Perspective

被引:23
|
作者
Tang, Yaming [1 ]
Zhao, Fei [1 ]
Yang, Yibiao [1 ]
Lu, Hongmin [1 ]
Zhou, Yuming [1 ]
Xu, Baowen [1 ]
机构
[1] Nanjing Univ, State Key Lab Novel Software Technol, Nanjing, Jiangsu, Peoples R China
关键词
software metrics; text mining; vulnerability; prediction; effort-aware;
D O I
10.1109/QRS.2015.15
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
In order to identify vulnerable software components, developers can take software metrics as predictors or use text mining techniques to build vulnerability prediction models. A recent study reported that text mining based models have higher recall than software metrics based models. However, this conclusion was drawn without considering the sizes of individual components which affects the code inspection effort to determine whether a component is vulnerable. In this paper, we investigate the predictive power of these two kinds of prediction models in the context of effort-aware vulnerability prediction. To this end, we use the same data sets, containing 223 vulnerabilities found in three web applications, to build vulnerability prediction models. The experimental results show that: (1) in the context of effort-aware ranking scenario, text mining based models only slightly outperform software metrics based models; (2) in the context of effort-aware classification scenario, text mining based models perform similarly to software metrics based models in most cases; and (3) most of the effect sizes (i.e. the magnitude of the differences) between these two kinds of models are trivial. These results suggest that, from the viewpoint of practical application, software metrics based models are comparable to text mining based models. Therefore, for developers, software metrics based models are practical choices for vulnerability prediction, as the cost to build and apply these models is much lower.
引用
收藏
页码:27 / 36
页数:10
相关论文
共 39 条
  • [1] Predicting Vulnerable Components: Software Metrics vs Text Mining
    Walden, James
    Stuckman, Jeff
    Scandariato, Riccardo
    [J]. 2014 IEEE 25TH INTERNATIONAL SYMPOSIUM ON SOFTWARE RELIABILITY ENGINEERING (ISSRE), 2014, : 23 - 33
  • [2] Predicting Vulnerable Software Components via Text Mining
    Scandariato, Riccardo
    Walden, James
    Hovsepyan, Aram
    Joosen, Wouter
    [J]. IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 2014, 40 (10) : 993 - 1006
  • [3] On effort-aware metrics for defect prediction
    Jonida Çarka
    Matteo Esposito
    Davide Falessi
    [J]. Empirical Software Engineering, 2022, 27
  • [4] On effort-aware metrics for defect prediction
    Carka, Jonida
    Esposito, Matteo
    Falessi, Davide
    [J]. EMPIRICAL SOFTWARE ENGINEERING, 2022, 27 (06)
  • [5] Learning to rank software modules for effort-aware defect prediction
    Rao, Jiqing
    Yu, Xiao
    Zhang, Chen
    Zhou, Junwei
    Xiang, Jianwen
    [J]. 2021 21ST INTERNATIONAL CONFERENCE ON SOFTWARE QUALITY, RELIABILITY AND SECURITY COMPANION (QRS-C 2021), 2021, : 372 - 380
  • [6] Predicting Vulnerable Software Components
    Neuhaus, Stephan
    Zimmermann, Thomas
    Holler, Christian
    Zeller, Andreas
    [J]. CCS'07: PROCEEDINGS OF THE 14TH ACM CONFERENCE ON COMPUTER AND COMMUNICATIONS SECURITY, 2007, : 529 - +
  • [7] A Novel Effort Measure Method for Effort-Aware Just-in-Time Software Defect Prediction
    Chen, Liqiong
    Song, Shilong
    Wang, Can
    [J]. INTERNATIONAL JOURNAL OF SOFTWARE ENGINEERING AND KNOWLEDGE ENGINEERING, 2021, 31 (08) : 1145 - 1169
  • [8] Improving effort-aware defect prediction by directly learning to rank software modules
    Yu, Xiao
    Rao, Jiqing
    Liu, Lei
    Lin, Guancheng
    Hu, Wenhua
    Keung, Jacky Wai
    Zhou, Junwei
    Xiang, Jianwen
    [J]. INFORMATION AND SOFTWARE TECHNOLOGY, 2024, 165
  • [9] Extending Developer Experience Metrics for Better Effort-Aware Just-In-Time Defect Prediction
    Cho, Yeongjun
    Kwon, Jung-Hyun
    Yi, Jooyong
    Ko, In-Young
    [J]. IEEE ACCESS, 2022, 10 : 128218 - 128231
  • [10] MULTI: Multi-objective effort-aware just-in-time software defect prediction
    Chen, Xiang
    Zhao, Yingquan
    Wang, Qiuping
    Yuan, Zhidan
    [J]. INFORMATION AND SOFTWARE TECHNOLOGY, 2018, 93 : 1 - 13