Modeling Governance KB with CATPCA to Overcome Multicollinearity in the Logistic Regression

被引:4
|
作者
Khikmah, L. [1 ]
Wijayanto, H. [1 ]
Syafitri, U. D. [1 ]
机构
[1] Bogor Agr Univ, Fac Math & Nat Sci, Dept Stat, Bogor, Indonesia
关键词
PRINCIPAL-COMPONENTS;
D O I
10.1088/1742-6596/824/1/012027
中图分类号
G40 [教育学];
学科分类号
040101 ; 120403 ;
摘要
The problem often encounters in logistic regression modeling are multicollinearity problems. Data that have multicollinearity between explanatory variables with the result in the estimation of parameters to be bias. Besides, the multicollinearity will result in error in the classification. In general, to overcome multicollinearity in regression used stepwise regression. They are also another method to overcome multicollinearity which involves all variable for prediction. That is Principal Component Analysis (PCA). However, classical PCA in only for numeric data. Its data are categorical, one method to solve the problems is Categorical Principal Component Analysis (CATPCA). Data were used in this research were a part of data Demographic and Population Survey Indonesia (IDHS) 2012. This research focuses on the characteristic of women of using the contraceptive methods. Classification results evaluated using Area Under Curve (AUC) values. The higher the AUC value, the better. Based on AUC values, the classification of the contraceptive method using stepwise method (58.66%) is better than the logistic regression model (57.39%) and CATPCA (57.39%). Evaluation of the results of logistic regression using sensitivity, shows the opposite where CATPCA method (99.79%) is better than logistic regression method (92.43%) and stepwise (92.05%). Therefore in this study focuses on major class classification (using a contraceptive method), then the selected model is CATPCA because it can raise the level of the major class model accuracy.
引用
收藏
页数:6
相关论文
共 50 条