Mining Software Repository for Cleaning Bugs Using Data Mining Technique

被引:5
|
作者
Mahmood, Nasir [1 ]
Hafeez, Yaser [1 ]
Iqbal, Khalid [2 ]
Hussain, Shariq [3 ]
Aqib, Muhammad [1 ]
Jamal, Muhammad [4 ]
Song, Oh-Young [5 ]
机构
[1] Pir Mehr Ali Shah Arid Agr Univ, Univ Inst Informat Technol, Rawalpindi 46000, Pakistan
[2] COMSATS Univ Islamabad, Dept Comp Sci, Attock Campus, Attock 43600, Pakistan
[3] Fdn Univ Islamabad, Dept Software Engn, Islamabad 44000, Pakistan
[4] Pir Mehr Ali Shah Arid Agr Univ, Dept Math & Stat, Rawalpindi 46000, Pakistan
[5] Sejong Univ, Dept Software, Seoul 05006, South Korea
来源
CMC-COMPUTERS MATERIALS & CONTINUA | 2021年 / 69卷 / 01期
关键词
Fault prediction; association rule; data mining; frequent pattern mining; RULES;
D O I
10.32604/cmc.2021.016614
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Despite advances in technological complexity and efforts, software repository maintenance requires reusing the data to reduce the effort and complexity. However, increasing ambiguity, irrelevance, and bugs while extracting similar data during software development generate a large amount of data from those data that reside in repositories. Thus, there is a need for a repository mining technique for relevant and bug-free data prediction. This paper proposes a fault prediction approach using a data-mining technique to find good predictors for high-quality software. To predict errors in mining data, the Apriori algorithm was used to discover association rules by fixing confidence at more than 40% and support at least 30%. The pruning strategy was adopted based on evaluation measures. Next, the rules were extracted from three projects of different domains; the extracted rules were then combined to obtain the most popular rules based on the evaluation measure values. To evaluate the proposed approach, we conducted an experimental study to compare the proposed rules with existing ones using four different industrial projects. The evaluation showed that the results of our proposal are promising. Practitioners and developers can utilize these rules for defect prediction during early software development.
引用
收藏
页码:873 / 893
页数:21
相关论文
共 50 条
  • [21] A Mining Technique Using n-Grams and Motion Transcripts for Body Sensor Network Data Repository
    Loseu, Vitali
    Ghasemzadeh, Hassan
    Jafari, Roozbeh
    PROCEEDINGS OF THE IEEE, 2012, 100 (01) : 107 - 121
  • [22] Data Mining Technique Effectiveness for Improving Software Productivity and Quality
    Kobayashi, Toru
    2012 IEEE 36TH ANNUAL COMPUTER SOFTWARE AND APPLICATIONS CONFERENCE (COMPSAC), 2012, : 256 - 256
  • [23] Text Mining Technique for Data Mining Application
    Govindarajan, M.
    PROCEEDINGS OF WORLD ACADEMY OF SCIENCE, ENGINEERING AND TECHNOLOGY, VOL 26, PARTS 1 AND 2, DECEMBER 2007, 2007, 26 : 544 - 549
  • [24] Data cleaning of medical data for knowledge mining
    Hongxing, P. (phxlee@zzu.edu.cn), 1600, Academy Publisher, P.O.Box 40,, OULU, 90571, Finland (08):
  • [25] Predictive Analytics Using Data Mining Technique
    Gulati, Hina
    2015 2ND INTERNATIONAL CONFERENCE ON COMPUTING FOR SUSTAINABLE GLOBAL DEVELOPMENT (INDIACOM), 2015, : 713 - 716
  • [26] Mining ordinal patterns for data cleaning
    Liu, YB
    Liu, DY
    PROCEEDINGS OF THE 2004 IEEE INTERNATIONAL CONFERENCE ON INFORMATION REUSE AND INTEGRATION (IRI-2004), 2004, : 438 - 443
  • [27] Mining ordinal patterns for data cleaning
    Liu, Y. B.
    Liu, D. Y.
    COMPUTATIONAL METHODS, PTS 1 AND 2, 2006, : 1267 - +
  • [28] Data mining for software trustworthiness
    Kou, Gang
    Shi, Yong
    Dong, Guozhu
    INFORMATION SCIENCES, 2012, 191 : 1 - 2
  • [29] Data mining in software engineering
    Halkidi, M.
    Spinellis, D.
    Tsatsaronis, G.
    Vazirgiannis, M.
    INTELLIGENT DATA ANALYSIS, 2011, 15 (03) : 413 - 441
  • [30] Image Cleaning and Enhancement Technique for Underwater Mining
    Rajesh, Shravan Dev
    Almeida, Jose Miguel
    Martins, Alfredo
    OCEANS 2019 - MARSEILLE, 2019,