Symbolic Regression with augmented dataset using RuleFit

被引:1
|
作者
de Franca, Fabricio Olivetti [1 ]
机构
[1] Univ Fed ABC, Ctr Math Comp & Cognit CMCC, Heurist & Anal Lab HAL, Santo Andre, SP, Brazil
基金
巴西圣保罗研究基金会;
关键词
symbolic regression; regression analysis; data augmentation;
D O I
10.1109/SYNASC57785.2022.00058
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Symbolic Regression models are often associated with transparency and interpretability. The main motivation is their ability to describe nonlinear models balancing accuracy and conciseness. But, in practice, it may generate models that are hard to understand at the same level as opaque models. From another perspective, linear models are guaranteed to be transparent but fail to model nonlinearities and interactions. The algorithm RuleFit uses a tree-based nonlinear model to create meta-features augmenting the dataset, increasing the accuracy of the linear models while maintaining their transparency. In this paper we test whether this augmented dataset can help Symbolic Regression models to find more transparent models without reducing the overall accuracy. The results indicate that the augmented models have a slightly better accuracy on a class of benchmarks while keeping the expression size small and closer to a linear model. As a caveat, the models also tend to become closer to a step function which limits the interpretability of the studied phenomena.
引用
收藏
页码:323 / 326
页数:4
相关论文
共 50 条
  • [1] Data Mining Using Unguided Symbolic Regression on a Blast Furnace Dataset
    Kommenda, Michael
    Kronberger, Gabriel
    Feilmayr, Christoph
    Affenzeller, Michael
    APPLICATIONS OF EVOLUTIONARY COMPUTATION, PT I, 2011, 6624 : 274 - +
  • [2] Modeling Hierarchy using Symbolic Regression
    Icke, Ilknur
    Bongard, Joshua C.
    2013 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION (CEC), 2013, : 2980 - 2987
  • [3] Dimensionality Reduction using Symbolic Regression
    Icke, Ilknur
    Rosenberg, Andrew
    GECCO-2010 COMPANION PUBLICATION: PROCEEDINGS OF THE 12TH ANNUAL GENETIC AND EVOLUTIONARY COMPUTATION CONFERENCE, 2010, : 2085 - 2086
  • [4] Identification of Rice Using Symbolic Regression
    Watanachaturaporn, Pakorn
    PROCEEDINGS OF 2016 8TH INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY AND ELECTRICAL ENGINEERING (ICITEE), 2016,
  • [5] Augmenting Equivalent Mutant Dataset Using Symbolic Execution
    Chung, Seungjoon
    Yoo, Shin
    2022 IEEE 15TH INTERNATIONAL CONFERENCE ON SOFTWARE TESTING, VERIFICATION AND VALIDATION WORKSHOPS (ICSTW 2022), 2022, : 150 - 159
  • [6] Classification and regression using augmented trees
    Rajiv Sambasivan
    Sourish Das
    International Journal of Data Science and Analytics, 2019, 7 : 259 - 276
  • [7] Classification and regression using augmented trees
    Sambasivan, Rajiv
    Das, Sourish
    INTERNATIONAL JOURNAL OF DATA SCIENCE AND ANALYTICS, 2019, 7 (04) : 259 - 276
  • [8] Correction of Gravimetric Geoid Using Symbolic Regression
    Palancz, B.
    Awange, J. L.
    Voelgyesi, L.
    MATHEMATICAL GEOSCIENCES, 2015, 47 (07) : 867 - 883
  • [9] Symbolic Regression Using Nearest Neighbor Indexing
    McRee, Randall
    GECCO-2010 COMPANION PUBLICATION: PROCEEDINGS OF THE 12TH ANNUAL GENETIC AND EVOLUTIONARY COMPUTATION CONFERENCE, 2010, : 1983 - 1989
  • [10] AUTOMATED REGRESSION TESTING USING SYMBOLIC EXECUTION
    Barisas, Dominykas
    Milasius, Tomas
    Bareisa, Eduardas
    INFORMATION TECHNOLOGIES' 2011, 2011, : 117 - 124