Extended Association Rule Mining and Its Application to Software Engineering Data Sets

被引:0
|
作者
Saito, Hidekazu [1 ]
Nishiura, Kinari [2 ]
Monden, Akito [1 ]
Morisaki, Shuji [3 ]
机构
[1] Kyoto Inst Technol, Fac Informat & Human Sci, Kyoto, Japan
[2] Okayama Univ, Grad Sch Nat Sci & Technol, Okayama, Japan
[3] Nagoya Univ, Grad Sch Informat, Nagoya, Japan
关键词
Association rule mining; software metrics; software effort estimation; data mining;
D O I
10.1142/S0218194024500347
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Association rule mining is a highly effective approach to data analysis for datasets of varying sizes, accommodating diverse feature values. Nevertheless, deriving practical rules from datasets with numerical variables presents a challenge, as these variables must be discretized beforehand. Quantitative association rule mining addresses this issue, allowing the extraction of valuable rules. This paper introduces an extension to quantitative association rules, incorporating a two-variable function in their consequent part. The use of correlation functions, statistical test functions, and error functions is also introduced. We illustrate the utility of this extension through three case studies employing software engineering datasets. In case study 1, we successfully pinpointed the conditions that result in either a high or low correlation between effort and software size, offering valuable insights for software project managers. In case study 2, we effectively identified the conditions that lead to a high or low correlation between the number of bugs and source lines of code, aiding in the formulation of software test planning strategies. In case study 3, we applied our approach to the two-step software effort estimation process, uncovering the conditions most likely to yield low effort estimation errors.
引用
收藏
页码:1735 / 1756
页数:22
相关论文
共 50 条
  • [41] Strategies for partitioning data in association rule mining
    Ahmed, S
    Coenen, R
    Leng, P
    [J]. RESEARCH AND DEVELOPMENT IN INTELLIGENT SYSTEMS XX, 2004, : 127 - 139
  • [42] Association Rule Data Mining in Agriculture - A Review
    Vignesh, N.
    Vinutha, D. C.
    [J]. COMPUTATIONAL VISION AND BIO-INSPIRED COMPUTING, 2020, 1108 : 233 - 239
  • [43] Parallel implementation of association rule in Data Mining
    Einakian, Sussan
    Ghanbari, M.
    [J]. Proceedings of the Thirty-Eighth Southeastern Symposium on System Theory, 2004, : 21 - 26
  • [44] Data squashing as preprocessing in association rule mining
    Fister, Iztok
    Fister, Iztok, Jr.
    Novak, Damijan
    Verber, Domen
    [J]. 2022 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (SSCI), 2022, : 1720 - 1725
  • [45] Association rule selection in a data mining environment
    Klemettinen, M
    Mannila, H
    Verkamo, AI
    [J]. PRINCIPLES OF DATA MINING AND KNOWLEDGE DISCOVERY, 1999, 1704 : 372 - 377
  • [46] Improvement of Association Algorithm and Its Application in Audit Data Mining
    Lu, Haitao
    Sivaparthipan, C. B.
    Antonidoss, A.
    [J]. JOURNAL OF INTERCONNECTION NETWORKS, 2022, 22 (SUPP03)
  • [47] Data mining for validation in software engineering: An example
    Kajko-Mattsson, M
    Chapin, N
    [J]. INTERNATIONAL JOURNAL OF SOFTWARE ENGINEERING AND KNOWLEDGE ENGINEERING, 2004, 14 (04) : 407 - 427
  • [48] Data mining for software engineering and humans in the loop
    Minku L.L.
    Mendes E.
    Turhan B.
    [J]. Progress in Artificial Intelligence, 2016, 5 (04) : 307 - 314
  • [49] Mining Software Engineering Data from GitHub
    Gousios, Georgios
    Spinellis, Diomidis
    [J]. PROCEEDINGS OF THE 2017 IEEE/ACM 39TH INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING COMPANION (ICSE-C 2017), 2017, : 501 - 502
  • [50] Software defect prediction using relational association rule mining
    Czibula, Gabriela
    Marian, Zsuzsanna
    Czibula, Istvan Gergely
    [J]. INFORMATION SCIENCES, 2014, 264 : 260 - 278