Automatically Identifying Security Bug Reports via Multitype Features Analysis

被引:8
|
作者
Zou, Deqing [1 ,2 ]
Deng, Zhijun [1 ]
Li, Zhen [1 ,3 ]
Jin, Hai [1 ]
机构
[1] Huazhong Univ Sci & Technol, Sch Comp Sci & Technol, Serv Comp Technol & Syst Lab, Big Data Technol & Syst Lab,Cluster & Grid Comp L, Wuhan 430074, Peoples R China
[2] Shenzhen Huazhong Univ Sci & Technol, Res Inst, Shenzhen, Peoples R China
[3] Hebei Univ, Sch Cyber Secur & Comp, Baoding, Peoples R China
来源
基金
美国国家科学基金会;
关键词
Security bug identification; Bug report; Natural language processing; Machine learning;
D O I
10.1007/978-3-319-93638-3_35
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Bug-tracking systems are widely used by software developers to manage bug reports. Since it is time-consuming and costly to fix all the bugs, developers usually pay more attention to the bugs with higher impact, such as security bugs (i.e., vulnerabilities) which can be exploited by malicious users to launch attacks and cause great damages. However, manually identifying security bug reports from millions of reports in bug-tracking systems is difficult and error-prone. Furthermore, existing automated identification approaches to security bug reports often incur many false negatives, causing a hidden danger to the computer system. To address this important problem, we present an automatic security bug reports identification model via multitype features analysis, dubbed Security Bug Report Identifier (SBRer). Specifically, we make use of multiple kinds of information contained in a bug report, including meta features and textual features, to automatically identify the security bug reports via natural language processing and machine learning techniques. The experimental results show that SBRer with imbalanced data processing can successfully identify the security bug reports with a much higher precision of 99.4% and recall of 79.9% compared to existing work.
引用
收藏
页码:619 / 633
页数:15
相关论文
共 50 条
  • [31] Domain knowledge-based security bug reports prediction
    Zheng, Wei
    Cheng, JingYuan
    Wu, Xiaoxue
    Sun, Ruiyang
    Wang, Xiaolong
    Sun, Xiaobing
    KNOWLEDGE-BASED SYSTEMS, 2022, 241
  • [32] Automatically Reproducing Android Bug Reports using Natural Language Processing and Reinforcement Learning
    Zhang, Zhaoxu
    Winn, Robert
    Zhao, Yu
    Yu, Tingting
    Halfond, William G. J.
    PROCEEDINGS OF THE 32ND ACM SIGSOFT INTERNATIONAL SYMPOSIUM ON SOFTWARE TESTING AND ANALYSIS, ISSTA 2023, 2023, : 411 - 422
  • [33] BugListener: Identifying and Synthesizing Bug Reports from Collaborative Live Chats
    Shi, Lin
    Mu, Fangwen
    Zhang, Yumin
    Yang, Ye
    Chen, Junjie
    Chen, Xiao
    Jiang, Hanzhi
    Jiang, Ziyou
    Wang, Qing
    2022 ACM/IEEE 44TH INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING (ICSE 2022), 2022, : 299 - 311
  • [34] Understanding Key Features of High-impact Bug Reports
    Karim, Md. Rejaul
    Ihara, Akinori
    Yang, Xin
    Iida, Hajimu
    Matsumoto, Kenichi
    2017 8TH IEEE INTERNATIONAL WORKSHOP ON EMPIRICAL SOFTWARE ENGINEERING IN PRACTICE (IWESEP), 2017, : 53 - 58
  • [35] Enhancements for duplication detection in bug reports with manifold correlation features
    Lin, Meng-Jie
    Yang, Cheng-Zen
    Lee, Chao-Yuan
    Chen, Chun-Chang
    JOURNAL OF SYSTEMS AND SOFTWARE, 2016, 121 : 223 - 233
  • [36] Automated Identification of Security Issues from Commit Messages and Bug Reports
    Zhou, Yaqin
    Sharma, Asankhaya
    ESEC/FSE 2017: PROCEEDINGS OF THE 2017 11TH JOINT MEETING ON FOUNDATIONS OF SOFTWARE ENGINEERING, 2017, : 914 - 919
  • [37] Automatically Identifying the Human Sense of Familiarity Using Eye Gaze Features
    Castillon, Iliana
    Chartier, Trevor
    Venkatesha, Videep
    Okada, Noah S.
    Davis, Asa
    Cleary, Anne M.
    Blanchard, Nathaniel
    HUMAN-COMPUTER INTERACTION, PT I, HCI 2024, 2024, 14684 : 291 - 310
  • [38] An Empirical Analysis of Bug Reports and Bug Fixing in Open Source Android Apps
    Bhattacharya, Pamela
    Ulanova, Liudmila
    Neamtiu, Iulian
    Koduru, Sai Charan
    PROCEEDINGS OF THE 17TH EUROPEAN CONFERENCE ON SOFTWARE MAINTENANCE AND REENGINEERING (CSMR 2013), 2013, : 133 - 143
  • [39] New Methodology for Contextual Features Usage in Duplicate Bug Reports Detection
    Neysiani, Behzad Soleimani
    Babamir, Seyed Morteza
    2019 5TH INTERNATIONAL CONFERENCE ON WEB RESEARCH (ICWR), 2019, : 178 - 183
  • [40] How to Better Distinguish Security Bug Reports (Using Dual Hyperparameter Optimization)
    Rui Shu
    Tianpei Xia
    Jianfeng Chen
    Laurie Williams
    Tim Menzies
    Empirical Software Engineering, 2021, 26