Extended Association Rule Mining and Its Application to Software Engineering Data Sets

被引：0

作者：

Saito, Hidekazu ^{[1
]}

Nishiura, Kinari ^{[2
]}

Monden, Akito ^{[1
]}

Morisaki, Shuji ^{[3
]}

机构：

[1] Kyoto Inst Technol, Fac Informat & Human Sci, Kyoto, Japan

[2] Okayama Univ, Grad Sch Nat Sci & Technol, Okayama, Japan

[3] Nagoya Univ, Grad Sch Informat, Nagoya, Japan

来源：

INTERNATIONAL JOURNAL OF SOFTWARE ENGINEERING AND KNOWLEDGE ENGINEERING | 2024年

关键词：

Association rule mining; software metrics; software effort estimation; data mining;

D O I：

10.1142/S0218194024500347

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Association rule mining is a highly effective approach to data analysis for datasets of varying sizes, accommodating diverse feature values. Nevertheless, deriving practical rules from datasets with numerical variables presents a challenge, as these variables must be discretized beforehand. Quantitative association rule mining addresses this issue, allowing the extraction of valuable rules. This paper introduces an extension to quantitative association rules, incorporating a two-variable function in their consequent part. The use of correlation functions, statistical test functions, and error functions is also introduced. We illustrate the utility of this extension through three case studies employing software engineering datasets. In case study 1, we successfully pinpointed the conditions that result in either a high or low correlation between effort and software size, offering valuable insights for software project managers. In case study 2, we effectively identified the conditions that lead to a high or low correlation between the number of bugs and source lines of code, aiding in the formulation of software test planning strategies. In case study 3, we applied our approach to the two-step software effort estimation process, uncovering the conditions most likely to yield low effort estimation errors.

引用

页码：1735 / 1756

页数：22

共 50 条

[41] Strategies for partitioning data in association rule mining
Ahmed, S
Coenen, R
Leng, P
[J]. RESEARCH AND DEVELOPMENT IN INTELLIGENT SYSTEMS XX, 2004, : 127 - 139
[42] Association Rule Data Mining in Agriculture - A Review
Vignesh, N.
Vinutha, D. C.
[J]. COMPUTATIONAL VISION AND BIO-INSPIRED COMPUTING, 2020, 1108 : 233 - 239
[43] Parallel implementation of association rule in Data Mining
Einakian, Sussan
Ghanbari, M.
[J]. Proceedings of the Thirty-Eighth Southeastern Symposium on System Theory, 2004, : 21 - 26
[44] Data squashing as preprocessing in association rule mining
Fister, Iztok
Fister, Iztok, Jr.
Novak, Damijan
Verber, Domen
[J]. 2022 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (SSCI), 2022, : 1720 - 1725
[45] Association rule selection in a data mining environment
Klemettinen, M
Mannila, H
Verkamo, AI
[J]. PRINCIPLES OF DATA MINING AND KNOWLEDGE DISCOVERY, 1999, 1704 : 372 - 377
[46] Improvement of Association Algorithm and Its Application in Audit Data Mining
Lu, Haitao
Sivaparthipan, C. B.
Antonidoss, A.
[J]. JOURNAL OF INTERCONNECTION NETWORKS, 2022, 22 (SUPP03)
[47] Data mining for validation in software engineering: An example
Kajko-Mattsson, M
Chapin, N
[J]. INTERNATIONAL JOURNAL OF SOFTWARE ENGINEERING AND KNOWLEDGE ENGINEERING, 2004, 14 (04) : 407 - 427
[48] Data mining for software engineering and humans in the loop
Minku L.L.
Mendes E.
Turhan B.
[J]. Progress in Artificial Intelligence, 2016, 5 (04) : 307 - 314
[49] Mining Software Engineering Data from GitHub
Gousios, Georgios
Spinellis, Diomidis
[J]. PROCEEDINGS OF THE 2017 IEEE/ACM 39TH INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING COMPANION (ICSE-C 2017), 2017, : 501 - 502
[50] Software defect prediction using relational association rule mining
Czibula, Gabriela
Marian, Zsuzsanna
Czibula, Istvan Gergely
[J]. INFORMATION SCIENCES, 2014, 264 : 260 - 278

← 1 2 3 4 5 →