Mining Software Repository for Cleaning Bugs Using Data Mining Technique

被引:5
|
作者
Mahmood, Nasir [1 ]
Hafeez, Yaser [1 ]
Iqbal, Khalid [2 ]
Hussain, Shariq [3 ]
Aqib, Muhammad [1 ]
Jamal, Muhammad [4 ]
Song, Oh-Young [5 ]
机构
[1] Pir Mehr Ali Shah Arid Agr Univ, Univ Inst Informat Technol, Rawalpindi 46000, Pakistan
[2] COMSATS Univ Islamabad, Dept Comp Sci, Attock Campus, Attock 43600, Pakistan
[3] Fdn Univ Islamabad, Dept Software Engn, Islamabad 44000, Pakistan
[4] Pir Mehr Ali Shah Arid Agr Univ, Dept Math & Stat, Rawalpindi 46000, Pakistan
[5] Sejong Univ, Dept Software, Seoul 05006, South Korea
来源
CMC-COMPUTERS MATERIALS & CONTINUA | 2021年 / 69卷 / 01期
关键词
Fault prediction; association rule; data mining; frequent pattern mining; RULES;
D O I
10.32604/cmc.2021.016614
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Despite advances in technological complexity and efforts, software repository maintenance requires reusing the data to reduce the effort and complexity. However, increasing ambiguity, irrelevance, and bugs while extracting similar data during software development generate a large amount of data from those data that reside in repositories. Thus, there is a need for a repository mining technique for relevant and bug-free data prediction. This paper proposes a fault prediction approach using a data-mining technique to find good predictors for high-quality software. To predict errors in mining data, the Apriori algorithm was used to discover association rules by fixing confidence at more than 40% and support at least 30%. The pruning strategy was adopted based on evaluation measures. Next, the rules were extracted from three projects of different domains; the extracted rules were then combined to obtain the most popular rules based on the evaluation measure values. To evaluate the proposed approach, we conducted an experimental study to compare the proposed rules with existing ones using four different industrial projects. The evaluation showed that the results of our proposal are promising. Practitioners and developers can utilize these rules for defect prediction during early software development.
引用
收藏
页码:873 / 893
页数:21
相关论文
共 50 条
  • [31] DATA MINING FOR SOFTWARE ENGINEERING
    Xie, Tao
    Thummalapenta, Suresh
    Lo, David
    Liu, Chao
    COMPUTER, 2009, 42 (08) : 55 - 62
  • [32] Mining software engineering data
    Xie, Tao
    Pei, Jian
    Hassan, Ahmed E.
    29TH INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING: ICSE 2007 COMPANION VOLUME, PROCEEDINGS, 2007, : 172 - +
  • [33] Software quality prediction using data mining techniques
    Merzah, Bayadaa M.
    2019 International Conference on Information and Communications Technology, ICOIACT 2019, 2019, : 394 - 397
  • [34] Software Effort Estimation Using Data Mining Techniques
    Benala, Tirimula Rao
    Mall, Rajib
    Srikavya, P.
    HariPriya, M. Vani
    ICT AND CRITICAL INFRASTRUCTURE: PROCEEDINGS OF THE 48TH ANNUAL CONVENTION OF COMPUTER SOCIETY OF INDIA - VOL I, 2014, 248 : 85 - 92
  • [35] Real estate valuation using data mining software
    Hromada, Eduard
    5TH CREATIVE CONSTRUCTION CONFERENCE (CCC 2016), 2016, 164 : 284 - 291
  • [36] Improvised Software Code Comprehension Using Data Mining
    Gupta, Ram Gopal
    Dumka, Ankur
    Mazumdar, Bireshwar Dass
    INTERNATIONAL ARAB JOURNAL OF INFORMATION TECHNOLOGY, 2024, 21 (03) : 531 - 547
  • [37] Diggit: Automated Code Review via Software Repository Mining
    Chatley, Robert
    Jones, Lawrence
    2018 25TH IEEE INTERNATIONAL CONFERENCE ON SOFTWARE ANALYSIS, EVOLUTION AND REENGINEERING (SANER 2018), 2018, : 567 - 571
  • [38] Software Fault Prediction Using Data Mining Techniques on Software Metrics
    Kumar, Rakesh
    Chaturvedi, Amrita
    MACHINE LEARNING AND BIG DATA ANALYTICS (PROCEEDINGS OF INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND BIG DATA ANALYTICS (ICMLBDA) 2021), 2022, 256 : 304 - 313
  • [39] Open Source Software Success Measurement Method Based on Mining Software Repository
    Ning D.-J.
    Ye P.-G.
    Liu Q.
    Li M.
    2018, Chinese Institute of Electronics (46): : 2930 - 2935
  • [40] An Open Data Repository for Engineering Design: Using Text Mining with Open Government Data
    Giordano, Vito
    Coli, Elena
    Martini, Antonella
    COMPUTERS IN INDUSTRY, 2022, 142