Automatically Identifying Security Bug Reports via Multitype Features Analysis

被引:8
|
作者
Zou, Deqing [1 ,2 ]
Deng, Zhijun [1 ]
Li, Zhen [1 ,3 ]
Jin, Hai [1 ]
机构
[1] Huazhong Univ Sci & Technol, Sch Comp Sci & Technol, Serv Comp Technol & Syst Lab, Big Data Technol & Syst Lab,Cluster & Grid Comp L, Wuhan 430074, Peoples R China
[2] Shenzhen Huazhong Univ Sci & Technol, Res Inst, Shenzhen, Peoples R China
[3] Hebei Univ, Sch Cyber Secur & Comp, Baoding, Peoples R China
来源
基金
美国国家科学基金会;
关键词
Security bug identification; Bug report; Natural language processing; Machine learning;
D O I
10.1007/978-3-319-93638-3_35
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Bug-tracking systems are widely used by software developers to manage bug reports. Since it is time-consuming and costly to fix all the bugs, developers usually pay more attention to the bugs with higher impact, such as security bugs (i.e., vulnerabilities) which can be exploited by malicious users to launch attacks and cause great damages. However, manually identifying security bug reports from millions of reports in bug-tracking systems is difficult and error-prone. Furthermore, existing automated identification approaches to security bug reports often incur many false negatives, causing a hidden danger to the computer system. To address this important problem, we present an automatic security bug reports identification model via multitype features analysis, dubbed Security Bug Report Identifier (SBRer). Specifically, we make use of multiple kinds of information contained in a bug report, including meta features and textual features, to automatically identify the security bug reports via natural language processing and machine learning techniques. The experimental results show that SBRer with imbalanced data processing can successfully identify the security bug reports with a much higher precision of 99.4% and recall of 79.9% compared to existing work.
引用
收藏
页码:619 / 633
页数:15
相关论文
共 50 条
  • [21] Identification of Security related Bug Reports via Text Mining using Supervised and Unsupervised Classification
    Goseva-Popstojanova, Katerina
    Tyo, Jacob
    2018 IEEE INTERNATIONAL CONFERENCE ON SOFTWARE QUALITY, RELIABILITY AND SECURITY (QRS 2018), 2018, : 344 - 355
  • [22] Automated Extraction of Security Concerns from Bug Reports
    Alqahtani, Sultan S.
    2019 17TH INTERNATIONAL CONFERENCE ON PRIVACY, SECURITY AND TRUST (PST), 2019, : 327 - 329
  • [23] Automatically Identifying Performance Issue Reports with Heuristic Linguistic Patterns
    Zhao, Yutong
    Xiao, Lu
    Babvey, Pouria
    Sun, Lei
    Wong, Sunny
    Martinez, Angel A.
    Wang, Xiao
    PROCEEDINGS OF THE 28TH ACM JOINT MEETING ON EUROPEAN SOFTWARE ENGINEERING CONFERENCE AND SYMPOSIUM ON THE FOUNDATIONS OF SOFTWARE ENGINEERING (ESEC/FSE '20), 2020, : 964 - 975
  • [24] Experience Report: Security Vulnerability Profiles of Mission Critical Software: Empirical Analysis of Security Related Bug Reports
    Goseva-Popstojanova, Katerina
    Tyo, Jacob
    2017 IEEE 28TH INTERNATIONAL SYMPOSIUM ON SOFTWARE RELIABILITY ENGINEERING (ISSRE), 2017, : 152 - 163
  • [25] Identifying non-natural language artifacts in bug reports
    Hirsch, Thomas
    Hofer, Birgit
    2021 36TH IEEE/ACM INTERNATIONAL CONFERENCE ON AUTOMATED SOFTWARE ENGINEERING WORKSHOPS (ASEW 2021), 2021, : 191 - 197
  • [26] Identifying and predicting key features to support bug reporting
    Karim, Md Rejaul
    Ihara, Akinori
    Choi, Eunjong
    Iida, Hajimu
    JOURNAL OF SOFTWARE-EVOLUTION AND PROCESS, 2019, 31 (12)
  • [27] Bug Reports Prioritization: Which Features and Classifier to Use?
    Alenezi, Mamdouh
    Banitaan, Shadi
    2013 12TH INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS (ICMLA 2013), VOL 2, 2013, : 112 - 116
  • [28] Automatically Identifying Security Checks for Detecting Kernel Semantic Bugs
    Lu, Kangjie
    Pakki, Aditya
    Wu, Qiushi
    COMPUTER SECURITY - ESORICS 2019, PT II, 2019, 11736 : 3 - 25
  • [29] Guiding Bug Triage through Developer Analysis in Bug Reports
    Zhang, Tao
    Yang, Geunseok
    Lee, Byungjeong
    Chan, Alvin T. S.
    INTERNATIONAL JOURNAL OF SOFTWARE ENGINEERING AND KNOWLEDGE ENGINEERING, 2016, 26 (03) : 405 - 431
  • [30] Sais: Self-Adaptive Identification of Security Bug Reports
    Mostafa, Shaikh
    Findley, Bridgette
    Meng, Na
    Wang, Xiaoyin
    IEEE TRANSACTIONS ON DEPENDABLE AND SECURE COMPUTING, 2021, 18 (04) : 1779 - 1792