LbR: A New Regression Architecture for Automated Feature Engineering

被引:1
|
作者
Wang, Meng [1 ]
Ding, Zhijun [1 ]
Pan, Meiqin [2 ]
机构
[1] Tongji Univ, Dept Comp Sci & Technol, Shanghai 201804, Peoples R China
[2] Shanghai Int Studies Univ, Sch Business & Management, Shanghai 200083, Peoples R China
关键词
automatic feature engineering; label; regression; feature pairs; correlations;
D O I
10.1109/ICDMW51313.2020.00066
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In recent years, machine learning has developed rapidly and has been widely applied in many fields, such as finance and medical treatment. Many studies have shown that feature engineering is the most important part of machine learning and the most creative part of data science. However, in the traditional feature engineering step, it often requires the participation of experienced domain experts and is very time-consuming. Therefore, automatic feature engineering technology arises, aiming at improving the performance of the model by automatically generating high informative features without expert domain knowledge. However, in these methods, new features are generated by pre-defining a set of identical operators on datasets, ignoring the diversity of data sets. So there is room for improvement in performance. In this paper, we proposed a method named LbR (Label based Regression), which can fully mine correlations between feature pairs and then select feature pairs with high discrimination to generate informative features. We conducted many experiments to show that LbR has better performance and efficiency than other methods in different data sets and machine learning models.
引用
收藏
页码:432 / 439
页数:8
相关论文
共 50 条
  • [31] A New Algorithm for Feature Matching in Reverse Engineering
    朱根松
    周天瑞
    周捷
    TsinghuaScienceandTechnology, 2009, 14(S1) (S1) : 43 - 46
  • [32] A new textural feature for automated cell proliferation analysis
    Gabriel, C
    Leticia, V
    Jorge, M
    MEDICAL PHYSICS, 1998, 440 : 55 - 61
  • [33] AttentionPoolMobileNeXt: An automated construction damage detection model based on a new convolutional neural network and deep feature engineering models
    Aydin M.
    Barua P.D.
    Chadalavada S.
    Dogan S.
    Tuncer T.
    Chakraborty S.
    Acharya R.U.
    Multimedia Tools and Applications, 2025, 84 (4) : 1821 - 1843
  • [34] Enhancing Regression Models for Complex Systems Using Evolutionary Techniques for Feature Engineering
    Arroba, Patricia
    Risco-Martin, Jos L.
    Zapater, Marina
    Moya, Jose M.
    Ayala, Jose L.
    JOURNAL OF GRID COMPUTING, 2015, 13 (03) : 409 - 423
  • [35] NEW ENGINEERING MATERIALS AND DEVELOPING COUNTRIES ARCHITECTURE
    Doroodgar, Amene
    Pourmand, Hassan Ali
    FIFTH INTERNATIONAL CONFERENCE ON ADVANCED COMPUTER THEORY AND ENGINEERING (ICACTE 2012), 2012, : 485 - 491
  • [36] SVM Regression Modeling Based on Properties of Engineering Materials with PLS Feature Extraction
    Wu, Ya-Rong
    Li, Hua-Ping
    Gan, Xu-Sheng
    ADVANCED RESEARCH ON ENERGY, CHEMISTRY AND MATERIALS APPLICATION, 2014, 848 : 122 - +
  • [37] Enhancing Regression Models for Complex Systems Using Evolutionary Techniques for Feature Engineering
    Patricia Arroba
    José L. Risco-Martín
    Marina Zapater
    José M. Moya
    José L. Ayala
    Journal of Grid Computing, 2015, 13 : 409 - 423
  • [38] NEW DISTRIBUTED CONTROL PRODUCTS FEATURE OPEN ARCHITECTURE
    HICKEY, J
    I&CS-INSTRUMENTATION & CONTROL SYSTEMS, 1992, 65 (03): : 79 - 79
  • [39] Automated segmentation of point data in a feature-based reverse engineering system
    Park, S
    Jun, Y
    PROCEEDINGS OF THE INSTITUTION OF MECHANICAL ENGINEERS PART B-JOURNAL OF ENGINEERING MANUFACTURE, 2002, 216 (03) : 445 - 451
  • [40] Solving the False Positives Problem in Fraud Prediction Using Automated Feature Engineering
    Wedge, Roy
    Max Kanter, James
    Veeramachaneni, Kalyan
    Moral Rubio, Santiago
    Iglesias Perez, Sergio
    MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, ECML PKDD 2018, PT III, 2019, 11053 : 372 - 388