共 23 条
Oblique random forests with binary and ternary decision structures and non-parallel hyperplanes classifiers
被引:0
|作者:
Saigal, Pooja
[1
]
David, Anubhav
[2
]
Rastogi, Reshma
[2
]
机构:
[1] Vivekananda Inst Profess Studies, Sch Informat Technol, Delhi, India
[2] South Asian Univ, Dept Comp Sci, Delhi 110021, India
关键词:
Ensemble methods;
Oblique random forests;
Twin support vector machines and Multi-surface proximal support vector machines;
Classification;
Decision trees;
SUPPORT VECTOR MACHINE;
MULTI-CATEGORY CLASSIFICATION;
TREE ENSEMBLE;
REGRESSION;
SYSTEM;
SOLVE;
D O I:
10.1007/s41060-023-00472-y
中图分类号:
TP18 [人工智能理论];
学科分类号:
081104 ;
0812 ;
0835 ;
1405 ;
摘要:
Due to their robust nature, ensemble methods have gained a lot of popularity these days. In this work, we propose several variations of oblique decision tree ensembles called as oblique random forests, which are implemented with binary and ternary decision structures. Oblique random forests are trained using a linear classifier, where the feature axis is not invariably orthogonal to the decision hyperplanes at each internal node of the base model. For the multiclass classification problems, the training samples are partitioned at non-leaf nodes into groups of classes corresponding to the underlying geometric characteristics, with respect to a randomly chosen feature subspace. Each of the proposed models employ a different binary base classifier. The binary classifiers used for this work are twin support vector machines (TWSVM), Improvements on nu-TWSVM, multi-surface proximal support vector machine (MPSVM) and Regularized MPSVM. We also propose a novel approach to choose the final hyperplane to split the data at the non-leaf node while optimizing an impurity criterion in the decision tree. This work presents a comparative analysis of different base classifiers for implementing Oblique Random forests using binary and ternary decision structures. In addition, multiple regularization strategies like Tikhonov regularization, axis-parallel split regularization, and null space regularization are used to address limited sample size issues in the oblique random forest decision trees implemented with MPSVM and RegMPSVM. Whereas implementations for TWSVM and I nu TWSVM is done with Tikhonov regularization only. All these models are compared for their generalization ability through benchmark 38 UCI classification datasets. The efficacy of these methods is also established through statistical analysis.
引用
收藏
页数:19
相关论文