A Hybrid Regression Model for Mixed Numerical and Categorical Data

被引:0
|
作者
Alghanmi, Nouf [1 ]
Zeng, Xiao-Jun [1 ]
机构
[1] Univ Manchester, Sch Comp Sci, Manchester M13 9PL, Lancs, England
关键词
Decision tree; Regression; Mixed data; Hybrid model; SELECTION;
D O I
10.1007/978-3-030-29933-0_31
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
It is noticeable in different heterogeneity types that complexity is inherent in heterogeneous data, and regression analysis methods are well defined and exhibit high-accuracy performance with numeric data. However, real-world problems contain non-numerical variables. There are two main approaches to handling mixed-type data sets in regression analyses. The first approach is unifying data types for all the variables (such as continuous numerical data) and then applying the regression analysis. However, this approach degrades the data quality, as some original data types are converted to other types in the learning stage. The second approach is to apply some similarity measurements, which can be highly complex in some situations. To overcome these limitations, we propose a tree-based regression model to effectively handle the mixed-type data sets without using a dummy code or a similarity measurement.
引用
收藏
页码:369 / 376
页数:8
相关论文
共 50 条