Adaptive lasso with weights based on normalized filtering scores in molecular big data

被引:3
|
作者
Patil, Abhijeet R. [1 ]
Park, Byung-Kwon [2 ]
Kim, Sangjin [2 ]
机构
[1] Univ Texas El Paso, Computat Sci, El Paso, TX 79968 USA
[2] Dong A Univ, Dept Management Informat Syst, Busan 49236, South Korea
来源
关键词
Adaptive lasso; feature ranking; sure independence screening; accuracy; geometric mean; LOGISTIC-REGRESSION; VARIABLE SELECTION; RIDGE REGRESSION; SHRINKAGE;
D O I
10.1142/S0219633620400106
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
The molecular big data are highly correlated, and numerous genes are not related. The various classification methods performance mainly rely on the selection of significant genes. Sparse regularized regression (SRR) models using the least absolute shrinkage and selection operator (lasso) and adaptive lasso (alasso) are popularly used for gene selection and classification. Nevertheless, it becomes challenging when the genes are highly correlated. Here, we propose a modified adaptive lasso with weights using the ranking-based feature selection (RFS) methods capable of dealing with the highly correlated gene expression data. Firstly, an RFS methods such as Fisher's score (FS), Chi-square (CS), and information gain (IG) are employed to ignore the unimportant genes and the top significant genes are chosen through sure independence screening (SIS) criteria. The scores of the ranked genes are normalized and assigned as proposed weights to the alasso method to obtain the most significant genes that were proven to be biologically related to the cancer type and helped in attaining higher classification performance. With the synthetic data and real application of microarray data, we demonstrated that the proposed alasso method with RFS methods is a better approach than the other known methods such as alasso with filtering such as ridge and marginal maximum likelihood estimation (MMLE), lasso and alasso without filtering. The metrics of accuracy, area under the receiver operating characteristics curve (AUROC), and geometric mean (GM-mean) are used for evaluating the performance of the models.
引用
收藏
页数:18
相关论文
共 50 条
  • [41] RLS-Laguerre lattice adaptive filtering: Error-feedback, normalized, and array-based algorithms
    Merched, R
    Sayed, AH
    IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2001, 49 (11) : 2565 - 2576
  • [42] Efficient sign based normalized adaptive filtering techniques for cancelation of artifacts in ECG signals: Application to wireless biotelemetry
    Rahman, Muhammad Zia Ur
    Shaik, Rafi Ahamed
    Reddy, D. V. Rama Koti
    SIGNAL PROCESSING, 2011, 91 (02) : 225 - 239
  • [43] Data-based design of robust fault detection and isolation residuals via LASSO optimization and Bayesian filtering
    Cascianelli, Silvia
    Costante, Gabriele
    Crocetti, Francesco
    Ricci, Elisa
    Valigi, Paolo
    Luca Fravolini, Mario
    ASIAN JOURNAL OF CONTROL, 2021, 23 (01) : 57 - 71
  • [44] An adaptive algorithm for voice quality based on big data voiceprint identification
    Wang J.
    Kang R.
    Applied Mathematics and Nonlinear Sciences, 2024, 9 (01)
  • [45] Improvement of Adaptive Learning Service Recommendation Algorithm Based on Big Data
    Yang, Ya-zhi
    Zhong, Yong
    Wozniak, Marcin
    MOBILE NETWORKS & APPLICATIONS, 2021, 26 (05): : 2176 - 2187
  • [46] Attribute-Based Adaptive Homomorphic Encryption for Big Data Security
    Thenmozhi, R.
    Shridevi, S.
    Mohanty, Sachi Nandan
    Garcia Diaz, Vicente
    Gupta, Deepak
    Tiwari, Prayag
    Shorfuzzaman, Mohammad
    BIG DATA, 2021,
  • [47] Hybrid security analysis based on intelligent adaptive learning in Big Data
    Sasubilli, Satya Murthy
    Dubey, Ashutosh Kumar
    Kumar, Abhishek
    PROCEEDINGS OF THE 2020 INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING AND COMMUNICATION ENGINEERING (ICACCE-2020), 2020,
  • [48] An adaptive rule-based classifier for mining big biological data
    Farid, Dewan Md
    Al-Mamun, Mohammad Abdullah
    Manderick, Bernard
    Nowe, Ann
    EXPERT SYSTEMS WITH APPLICATIONS, 2016, 64 : 305 - 316
  • [49] Random Partition Based Adaptive Distributed Kernelized SVM for Big Data
    Pal, Amrit
    Chowdhury, Abishi
    Satakshi
    Narman, Husnu S.
    Chowdhury, Arkabandhu
    Kumar, Manish
    IEEE ACCESS, 2022, 10 : 95623 - 95637
  • [50] Improvement of Adaptive Learning Service Recommendation Algorithm Based on Big Data
    Ya-zhi Yang
    Yong Zhong
    Marcin Woźniak
    Mobile Networks and Applications, 2021, 26 : 2176 - 2187