Comparative analysis of impact of classification algorithms on security and performance bug reports

被引:0
|
作者
Said, Maryyam [2 ]
Bin Faiz, Rizwan [2 ]
Aljaidi, Mohammad [1 ]
Alshammari, Muteb [3 ]
机构
[1] Zarqa Univ, Fac Informat Technol, Dept Comp Sci, Zarqa 13116, Jordan
[2] Riphah Int Univ, Fac Comp, Islamabad 46000, Pakistan
[3] Northern Border Univ, Fac Comp & Informat Technol, Dept Informat Technol, Rafha 91431, Saudi Arabia
关键词
bug classification; security bug; performance bug; text mining; bug prediction;
D O I
10.1515/jisys-2024-0045
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Identification and classification of bugs, e.g., security and performance are a preemptive and fundamental practice which contributes to the development of secure and efficient software. Software Quality Assurance (SQA) needs to classify bugs into relevant categories, e.g., security and performance bugs since one type of bug may have a higher preference over another, thus facilitating software evolution and maintenance. In addition to classification, it would be ideal for the SQA manager to prioritize security and performance bugs based on the level of perseverance, severity, or impact to assign relevant developers whose expertise is aligned with the identification of such bugs, thus facilitating triaging. The aim of this research is to compare and analyze the prediction accuracy of machine learning algorithms, i.e., Artificial neural network (ANN), Support vector machine (SVM), Na & iuml;ve Bayes (NB), Decision tree (DT), Logistic regression (LR), and K-nearest neighbor (KNN) to identify security and performance bugs from the bug repository. We first label the existing dataset from the Bugzilla repository with the help of a software security expert to train the algorithms. Our research type is explanatory, and our research method is controlled experimentation, in which the independent variable is prediction accuracy and the dependent variables are ANN, SVM, NB, DT, LR, and KNN. First, we applied preprocessing, Term Frequency-Inverse Document Frequency feature extraction methods, and then applied classification algorithms. The results were measured through accuracy, precision, recall, and F-measure and then the results were compared and validated through the ten-fold cross-validation technique. Comparative analysis reveals that two algorithms (SVM and LR) perform better in terms of precision (0.99) for performance bugs and three algorithms (SVM, ANN, and LR) perform better in terms of F1 score for security bugs as compared to other classification algorithms which are essentially due to the linear dataset and extensive number of features in the dataset.
引用
收藏
页数:23
相关论文
共 50 条
  • [1] A Comparative Study of Text Classification Algorithms on User Submitted Bug Reports
    Sohrawardi, Saniat Javid
    Azam, Iftekhar
    Hosain, Shazzad
    2014 NINTH INTERNATIONAL CONFERENCE ON DIGITAL INFORMATION MANAGEMENT (ICDIM), 2014, : 242 - 247
  • [2] Security bug reports classification using fasttext
    Alqahtani, Sultan S.
    INTERNATIONAL JOURNAL OF INFORMATION SECURITY, 2024, 23 (02) : 1347 - 1358
  • [3] Security bug reports classification using fasttext
    Sultan S. Alqahtani
    International Journal of Information Security, 2024, 23 : 1347 - 1358
  • [4] A Comparative Study of Bug Classification Algorithms
    Nagwani, Naresh Kumar
    Verma, Shrish
    INTERNATIONAL JOURNAL OF SOFTWARE ENGINEERING AND KNOWLEDGE ENGINEERING, 2014, 24 (01) : 111 - 138
  • [5] Textual Analysis of Security Bug Reports
    Peeples, Cody R.
    Rotella, Pete
    McLaughlin, Mark-David
    2017 IEEE INTERNATIONAL SYMPOSIUM ON TECHNOLOGIES FOR HOMELAND SECURITY (HST), 2017,
  • [6] Predicting the Severity of Bug Reports using Classification Algorithms
    Pushpalatha, M. N.
    Mrunalini, M.
    2016 INTERNATIONAL CONFERENCE ON CIRCUITS, CONTROLS, COMMUNICATIONS AND COMPUTING (I4C), 2016,
  • [7] A New Method of Security Bug Reports Analysis
    Xu, Yunwu
    Li, Yan
    IT PROFESSIONAL, 2024, 26 (02) : 49 - 56
  • [8] A comparative study and performance analysis of multirelational classification algorithms
    Shah K.
    Patel K.S.
    International Journal of Business Intelligence and Data Mining, 2022, 20 (02) : 121 - 145
  • [9] Comparative Analysis on the Performance of Selected Security Algorithms in Cloud Computing
    Cordova, Ronald S.
    Maata, Rolou Lyn R.
    Halibas, Alrence S.
    Al-Azawi, Rula
    2017 INTERNATIONAL CONFERENCE ON ELECTRICAL AND COMPUTING TECHNOLOGIES AND APPLICATIONS (ICECTA), 2017, : 274 - 277
  • [10] On the classification of bug reports to improve bug localization
    Fang, Fan
    Wu, John
    Li, Yanyan
    Ye, Xin
    Aljedaani, Wajdi
    Mkaouer, Mohamed Wiem
    SOFT COMPUTING, 2021, 25 (11) : 7307 - 7323