Regularized Feature Selection in Categorical PLS for Multicollinear Data

被引:1
|
作者
Mehmood, Tahir [1 ]
机构
[1] NUST, SNS, Islamabad, Pakistan
关键词
PARTIAL LEAST-SQUARES; BALOCHISTAN;
D O I
10.1155/2021/5561752
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
Article presents the algorithm which models the categorical multicollinear data by providing the balance in model accuracy on test data and number of selected features in the model. In all scientific fields, multicollinear data is being generated, where obviously some variables are noise and some are influential reference to response variable. Features and response appeared to be categorical in mathematical and statistical modeling of public health data. These datasets usually appeared to collinear, where partial least squares (PLS) is the potential method, which is not feature selection at its default level and deals with quantitative features. Recently, categorical PLS (Cat-PLS) is introduced. We have implemented the regularized feature selection in Cat-PLS where filter-based feature selection and categorical mean through Cramer's V, Phi coefficient, Tschuprow's T coefficient, Contingency Coefficient, and Yule's Q and Yule's Y are used. Monte carlo simulation with 100 runs indicates Cramer V*VIP is the better choice in terms of better model performance, number of feature selection, and interpretations for modeling the stillbirths, which is taken as the case study. The framework can be used in related areas to explore and model the related data structures.
引用
收藏
页数:8
相关论文
共 50 条
  • [41] A Regularized Framework for Feature Selection in Face Detection and Authentication
    Destrero, Augusto
    De Mol, Christine
    Odone, Francesca
    Verri, Alessandro
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2009, 83 (02) : 164 - 177
  • [42] Regularized Feature Selection Landscapes: An Empirical Study of Multimodality
    Sanchez-Diaz, Xavier F. C.
    Masson, Corentin
    Mengshoel, Ole Jakob
    PARALLEL PROBLEM SOLVING FROM NATURE-PPSN XVIII, PPSN 2024, PT I, 2024, 15148 : 409 - 426
  • [43] Parallel Feature Selection for Regularized Least-Squares
    Okser, Sebastian
    Airola, Antti
    Aittokallio, Tero
    Salakoski, Tapio
    Pahikkala, Tapio
    APPLIED PARALLEL AND SCIENTIFIC COMPUTING (PARA 2012), 2013, 7782 : 280 - 294
  • [44] Feature Selection by Transfer Learning with Linear Regularized Models
    Helleputte, Thibault
    Dupont, Pierre
    MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, PT I, 2009, 5781 : 533 - 547
  • [45] Robust inner product regularized unsupervised feature selection
    Youcheng Qian
    Xueyan Yin
    Wei Gao
    Multimedia Tools and Applications, 2019, 78 : 33593 - 33615
  • [46] Regularized least square discriminant projection and feature selection
    Shi, Jun
    Jiang, Zhiguo
    Zhao, Danpei
    Feng, Hao
    Gao, Chao
    JOURNAL OF ELECTRONIC IMAGING, 2014, 23 (01)
  • [47] A Regularized Framework for Feature Selection in Face Detection and Authentication
    Augusto Destrero
    Christine De Mol
    Francesca Odone
    Alessandro Verri
    International Journal of Computer Vision, 2009, 83 : 164 - 177
  • [48] Robust feature selection based on regularized brownboost loss
    Wei, Pan
    Hu, Qinghua
    Ma, Peijun
    Su, Xiaohong
    KNOWLEDGE-BASED SYSTEMS, 2013, 54 : 180 - 198
  • [49] Manifold regularized stacked denoising autoencoders with feature selection
    Yu, Jianbo
    NEUROCOMPUTING, 2019, 358 : 235 - 245
  • [50] Subspace Clustering with Feature Grouping for Categorical Data
    Jia, Hong
    Dong, Menghan
    KNOWLEDGE SCIENCE, ENGINEERING AND MANAGEMENT, PT I, KSEM 2023, 2023, 14117 : 247 - 254