Extended Association Rule Mining and Its Application to Software Engineering Data Sets

被引:0
|
作者
Saito, Hidekazu [1 ]
Nishiura, Kinari [2 ]
Monden, Akito [1 ]
Morisaki, Shuji [3 ]
机构
[1] Kyoto Inst Technol, Fac Informat & Human Sci, Kyoto, Japan
[2] Okayama Univ, Grad Sch Nat Sci & Technol, Okayama, Japan
[3] Nagoya Univ, Grad Sch Informat, Nagoya, Japan
关键词
Association rule mining; software metrics; software effort estimation; data mining;
D O I
10.1142/S0218194024500347
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Association rule mining is a highly effective approach to data analysis for datasets of varying sizes, accommodating diverse feature values. Nevertheless, deriving practical rules from datasets with numerical variables presents a challenge, as these variables must be discretized beforehand. Quantitative association rule mining addresses this issue, allowing the extraction of valuable rules. This paper introduces an extension to quantitative association rules, incorporating a two-variable function in their consequent part. The use of correlation functions, statistical test functions, and error functions is also introduced. We illustrate the utility of this extension through three case studies employing software engineering datasets. In case study 1, we successfully pinpointed the conditions that result in either a high or low correlation between effort and software size, offering valuable insights for software project managers. In case study 2, we effectively identified the conditions that lead to a high or low correlation between the number of bugs and source lines of code, aiding in the formulation of software test planning strategies. In case study 3, we applied our approach to the two-step software effort estimation process, uncovering the conditions most likely to yield low effort estimation errors.
引用
收藏
页码:1735 / 1756
页数:22
相关论文
共 50 条
  • [1] Mining association rule oriented data cube and its application
    Shi, H
    Zhang, JF
    Zheng, L
    [J]. 2002 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-4, PROCEEDINGS, 2002, : 705 - 709
  • [2] Association Rule Mining and Its Application
    DUAN Yun feng
    [J]. The Journal of China Universities of Posts and Telecommunications, 2001, (04) : 13 - 17
  • [3] Application of data mining based on rough sets and association rule in laminar cooling system
    State Key Laboratory of Rolling and Automation, Northeastern University, Shenyang 110004, China
    不详
    [J]. Dongbei Daxue Xuebao/Journal of Northeastern University, 2007, 28 (11): : 1583 - 1585
  • [4] Application of Data Mining Technology in Software Engineering
    Ma, Jie
    [J]. PROCEEDINGS OF THE 2017 2ND INTERNATIONAL CONFERENCE ON MATERIALS SCIENCE, MACHINERY AND ENERGY ENGINEERING (MSMEE 2017), 2017, 123 : 169 - 172
  • [5] Parametric Rough Sets with Application to Granular Association Rule Mining
    He, Xu
    Min, Fan
    Zhu, William
    [J]. MATHEMATICAL PROBLEMS IN ENGINEERING, 2013, 2013
  • [6] Research and Application of Association Rule Mining Algorithm Based on Multidimensional Sets
    Zou, Yan
    Liu, Yan
    Qin, Xiaowei
    Ma, Songyan
    [J]. 2014 5TH IEEE INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING AND SERVICE SCIENCE (ICSESS), 2014, : 557 - 560
  • [7] Mining extremely small data sets with application to software reuse
    Jiang, Yuan
    Li, Ming
    Zhou, Zhi-Hua
    [J]. SOFTWARE-PRACTICE & EXPERIENCE, 2009, 39 (04): : 423 - 440
  • [8] Proposal of Association Rule with Correlation Functions and Its Application to Software Development Data.
    Saito, Hidekazu
    Monden, Akito
    Yücel, Zeynep
    Morisaki, Shuji
    [J]. Computer Software, 2019, 36 (03) : 47 - 53
  • [9] SOFTWARE APPLICATION IN MINING ENGINEERING
    Ali, Mahrous A. M.
    [J]. MINING OF MINERAL DEPOSITS, 2018, 12 (01): : 48 - 53
  • [10] Association rule mining algorithm of multidimensional sets
    Zhong, Yong
    Qin, Xiaolin
    Bao, Lei
    [J]. Jisuanji Yanjiu yu Fazhan/Computer Research and Development, 2006, 43 (12): : 2117 - 2123