Adaptive lasso with weights based on normalized filtering scores in molecular big data

被引:3
|
作者
Patil, Abhijeet R. [1 ]
Park, Byung-Kwon [2 ]
Kim, Sangjin [2 ]
机构
[1] Univ Texas El Paso, Computat Sci, El Paso, TX 79968 USA
[2] Dong A Univ, Dept Management Informat Syst, Busan 49236, South Korea
来源
关键词
Adaptive lasso; feature ranking; sure independence screening; accuracy; geometric mean; LOGISTIC-REGRESSION; VARIABLE SELECTION; RIDGE REGRESSION; SHRINKAGE;
D O I
10.1142/S0219633620400106
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
The molecular big data are highly correlated, and numerous genes are not related. The various classification methods performance mainly rely on the selection of significant genes. Sparse regularized regression (SRR) models using the least absolute shrinkage and selection operator (lasso) and adaptive lasso (alasso) are popularly used for gene selection and classification. Nevertheless, it becomes challenging when the genes are highly correlated. Here, we propose a modified adaptive lasso with weights using the ranking-based feature selection (RFS) methods capable of dealing with the highly correlated gene expression data. Firstly, an RFS methods such as Fisher's score (FS), Chi-square (CS), and information gain (IG) are employed to ignore the unimportant genes and the top significant genes are chosen through sure independence screening (SIS) criteria. The scores of the ranked genes are normalized and assigned as proposed weights to the alasso method to obtain the most significant genes that were proven to be biologically related to the cancer type and helped in attaining higher classification performance. With the synthetic data and real application of microarray data, we demonstrated that the proposed alasso method with RFS methods is a better approach than the other known methods such as alasso with filtering such as ridge and marginal maximum likelihood estimation (MMLE), lasso and alasso without filtering. The metrics of accuracy, area under the receiver operating characteristics curve (AUROC), and geometric mean (GM-mean) are used for evaluating the performance of the models.
引用
收藏
页数:18
相关论文
共 50 条
  • [1] A Normalized Adaptive Filtering Algorithm Based on Geometric Algebra
    Wang, Rui
    Liang, Meixiang
    He, Yinmei
    Wang, Xiangyang
    Cao, Wenming
    IEEE ACCESS, 2020, 8 (08): : 92861 - 92874
  • [2] Distributed adaptive lasso penalized generalized linear models for big data
    Fan, Ye
    Fan, Suning
    COMMUNICATIONS IN STATISTICS-SIMULATION AND COMPUTATION, 2023, 52 (04) : 1679 - 1698
  • [3] Big Data Filtering Through Adaptive Resonance Theory
    Barton, Adam
    Volna, Eva
    Kotyrba, Martin
    INTELLIGENT INFORMATION AND DATABASE SYSTEMS (ACIIDS 2017), PT II, 2017, 10192 : 382 - 391
  • [4] Simulation Realization of Adaptive Filtering Based on a Normalized LMS Algorithm
    Li, Yafeng
    Zheng, Ziwei
    MECHATRONICS, ROBOTICS AND AUTOMATION, PTS 1-3, 2013, 373-375 : 1159 - +
  • [5] Novel normalized subband adaptive filtering algorithms with weights-dependent variable step-size
    Li, Ke
    Yu, Yi
    He, Hongsen
    Yu, Tao
    de Lamare, Rodrigo
    Digital Signal Processing: A Review Journal, 2025, 158
  • [6] Multikernel adaptive filtering over graphs based on normalized LMS algorithm
    Xiao, Yilin
    Yan, Wenxu
    Dogancay, Kutluyil
    Ni, Hongyu
    Wang, Wenyuan
    SIGNAL PROCESSING, 2024, 214
  • [7] Fractional Normalized Subband Adaptive Filtering Algorithm Based on Mixture Correntropy
    Huo Y.
    Ding R.
    Qi Y.
    Tuo L.
    Beijing Youdian Daxue Xuebao/Journal of Beijing University of Posts and Telecommunications, 2023, 46 (05): : 28 - 34
  • [8] Combined Forecasting Model based on Self-adaptive Filtering Weights
    Yu, Junfu
    Yu, Zhen
    Gu, Xinping
    Wei, Lianxing
    2019 11TH INTERNATIONAL CONFERENCE ON INTELLIGENT HUMAN-MACHINE SYSTEMS AND CYBERNETICS (IHMSC 2019), VOL 2, 2019, : 221 - 224
  • [9] Cluster-based data filtering for manufacturing big data systems
    Li, Yifu
    Deng, Xinwei
    Ba, Shan
    Myers, William R.
    Brenneman, William A.
    Lange, Steve J.
    Zink, Ron
    Jin, Ran
    JOURNAL OF QUALITY TECHNOLOGY, 2022, 54 (03) : 290 - 302
  • [10] Data Adjustment of Power System Based on Kalman Filtering and Adaptive Filtering
    Lin, Zhi
    Hu, Lijuan
    Liu, Ke-yan
    Yin, Zhongdong
    2018 2ND IEEE CONFERENCE ON ENERGY INTERNET AND ENERGY SYSTEM INTEGRATION (EI2), 2018, : 309 - 314