Adaptive lasso with weights based on normalized filtering scores in molecular big data

被引:3
|
作者
Patil, Abhijeet R. [1 ]
Park, Byung-Kwon [2 ]
Kim, Sangjin [2 ]
机构
[1] Univ Texas El Paso, Computat Sci, El Paso, TX 79968 USA
[2] Dong A Univ, Dept Management Informat Syst, Busan 49236, South Korea
来源
关键词
Adaptive lasso; feature ranking; sure independence screening; accuracy; geometric mean; LOGISTIC-REGRESSION; VARIABLE SELECTION; RIDGE REGRESSION; SHRINKAGE;
D O I
10.1142/S0219633620400106
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
The molecular big data are highly correlated, and numerous genes are not related. The various classification methods performance mainly rely on the selection of significant genes. Sparse regularized regression (SRR) models using the least absolute shrinkage and selection operator (lasso) and adaptive lasso (alasso) are popularly used for gene selection and classification. Nevertheless, it becomes challenging when the genes are highly correlated. Here, we propose a modified adaptive lasso with weights using the ranking-based feature selection (RFS) methods capable of dealing with the highly correlated gene expression data. Firstly, an RFS methods such as Fisher's score (FS), Chi-square (CS), and information gain (IG) are employed to ignore the unimportant genes and the top significant genes are chosen through sure independence screening (SIS) criteria. The scores of the ranked genes are normalized and assigned as proposed weights to the alasso method to obtain the most significant genes that were proven to be biologically related to the cancer type and helped in attaining higher classification performance. With the synthetic data and real application of microarray data, we demonstrated that the proposed alasso method with RFS methods is a better approach than the other known methods such as alasso with filtering such as ridge and marginal maximum likelihood estimation (MMLE), lasso and alasso without filtering. The metrics of accuracy, area under the receiver operating characteristics curve (AUROC), and geometric mean (GM-mean) are used for evaluating the performance of the models.
引用
收藏
页数:18
相关论文
共 50 条
  • [21] CORRENTROPY-BASED ADAPTIVE FILTERING OF NONCIRCULAR COMPLEX DATA
    Dees, Bruno Scalzo
    Xia, Yili
    Douglas, Scott C.
    Mandic, Danilo P.
    2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 4339 - 4343
  • [22] A New Adaptive Notch Filtering Algorithm Based on Normalized Lattice Structure with Improved Mean Update Term
    Nakamura, Shinichiro
    Koshita, Shunsuke
    Abe, Masahide
    Kawamata, Masayuki
    IEICE TRANSACTIONS ON FUNDAMENTALS OF ELECTRONICS COMMUNICATIONS AND COMPUTER SCIENCES, 2015, E98A (07) : 1482 - 1493
  • [23] Adaptive Caching Strategy Based on Big Data Learning in ICN
    Cai, Ling
    Wang, Xingwei
    Li, Keqin
    Cheng, Hui
    Cao, Jiannong
    JOURNAL OF INTERNET TECHNOLOGY, 2018, 19 (06): : 1677 - 1689
  • [24] Data envelopment analysis method based on a common set of normalized weights using bargaining game thought
    Wang, Qing
    Wei, Keke
    Zhang, Yang
    Wang, Xuan
    COMPUTERS & INDUSTRIAL ENGINEERING, 2021, 154
  • [25] A Scalable Adaptive Sampling Based Approach for Big Data Classification
    Djouzi, Kheyreddine
    Beghdad-Bey, Kadda
    Amamra, Abdenour
    ADVANCES IN COMPUTING SYSTEMS AND APPLICATIONS, 2022, 513 : 73 - 83
  • [26] Towards ontology-based multilingual URL filtering: a big data problem
    Hussain, Mubashar
    Ahmed, Mansoor
    Khattak, Hasan Ali
    Imran, Muhammad
    Khan, Abid
    Din, Sadia
    Ahmad, Awais
    Jeon, Gwanggil
    Reddy, Alavalapati Goutham
    JOURNAL OF SUPERCOMPUTING, 2018, 74 (10): : 5003 - 5021
  • [27] A Big Data Analysis Method Based on Modified Collaborative Filtering Recommendation Algorithms
    Yin, Nan
    OPEN PHYSICS, 2019, 17 (01): : 966 - 974
  • [28] Research on the filtering recommendation technology of network information based on big data environment
    Cui, Lei
    INTERNATIONAL JOURNAL OF INTERNET PROTOCOL TECHNOLOGY, 2020, 13 (04) : 211 - 218
  • [29] Research on big data mining based on improved parallel collaborative filtering algorithm
    Li Zhu
    Heng Li
    Yuxuan Feng
    Cluster Computing, 2019, 22 : 3595 - 3604
  • [30] ClubCF: A Clustering-Based Collaborative Filtering Approach for Big Data Application
    Hu, Rong
    Dou, Wanchun
    Liu, Jianxun
    IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTING, 2014, 2 (03) : 302 - 313