A Hybrid Regression Model for Mixed Numerical and Categorical Data

被引:0
|
作者
Alghanmi, Nouf [1 ]
Zeng, Xiao-Jun [1 ]
机构
[1] Univ Manchester, Sch Comp Sci, Manchester M13 9PL, Lancs, England
关键词
Decision tree; Regression; Mixed data; Hybrid model; SELECTION;
D O I
10.1007/978-3-030-29933-0_31
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
It is noticeable in different heterogeneity types that complexity is inherent in heterogeneous data, and regression analysis methods are well defined and exhibit high-accuracy performance with numeric data. However, real-world problems contain non-numerical variables. There are two main approaches to handling mixed-type data sets in regression analyses. The first approach is unifying data types for all the variables (such as continuous numerical data) and then applying the regression analysis. However, this approach degrades the data quality, as some original data types are converted to other types in the learning stage. The second approach is to apply some similarity measurements, which can be highly complex in some situations. To overcome these limitations, we propose a tree-based regression model to effectively handle the mixed-type data sets without using a dummy code or a similarity measurement.
引用
收藏
页码:369 / 376
页数:8
相关论文
共 50 条
  • [21] Asymptotic Inferences in a Multinomial Logit Mixed Model for Spatial Categorical Data
    Sutradhar, Brajendra C.
    Rao, R. Prabhakar
    SANKHYA-SERIES A-MATHEMATICAL STATISTICS AND PROBABILITY, 2023, 85 (01): : 885 - 930
  • [22] Bayesian analysis of ordinal categorical data under a mixed inheritance model
    Skotarczak, Ewa
    Molinski, Krzysztof
    Szwaczkowski, Tomasz
    Dobek, Anita
    ARCHIV FUR TIERZUCHT-ARCHIVES OF ANIMAL BREEDING, 2011, 54 (01): : 93 - 103
  • [23] Nonparametric regression and classification with functional, categorical, and mixed covariates
    Selk, Leonie
    Gertheiss, Jan
    ADVANCES IN DATA ANALYSIS AND CLASSIFICATION, 2023, 17 (02) : 519 - 543
  • [24] Nonparametric regression and classification with functional, categorical, and mixed covariates
    Leonie Selk
    Jan Gertheiss
    Advances in Data Analysis and Classification, 2023, 17 : 519 - 543
  • [25] Statistical micro matching using a multinomial logistic regression model for categorical data
    Kim, Kangmin
    Park, Mingue
    COMMUNICATIONS FOR STATISTICAL APPLICATIONS AND METHODS, 2019, 26 (05) : 507 - 517
  • [26] ON EXTENDING BOCKS MODEL OF LOGISTIC-REGRESSION IN THE ANALYSIS OF CATEGORICAL-DATA
    LEVIN, B
    SHROUT, PE
    COMMUNICATIONS IN STATISTICS PART A-THEORY AND METHODS, 1981, 10 (02): : 125 - 147
  • [27] A BIVARIATE CUMULATIVE PROBIT REGRESSION-MODEL FOR ORDERED CATEGORICAL-DATA
    KIM, K
    STATISTICS IN MEDICINE, 1995, 14 (12) : 1341 - 1352
  • [28] Finding mixed memberships in categorical data
    Qing, Huan
    INFORMATION SCIENCES, 2024, 676
  • [29] Ellipsoidally symmetric extensions of the general location model for mixed categorical and continuous data
    Liu, CH
    Rubin, DB
    BIOMETRIKA, 1998, 85 (03) : 673 - 688
  • [30] A Hybrid Model for Nonlinear Regression with Missing Data UsingQuasilinearKernel
    Zhu, Huilin
    Tian, Yanling
    Ren, Yanni
    Hu, Jinglu
    IEEJ TRANSACTIONS ON ELECTRICAL AND ELECTRONIC ENGINEERING, 2020, 15 (12) : 1791 - 1800