Automated Feature Selection in Microarray Data Analysis using Deep Learning

被引:0
|
作者
Tekade, Pallavi [1 ]
Joshi, Ram [1 ]
Salunke, Dipmala [1 ]
Gore, Shubham [1 ]
Shinde, Shaunak [1 ]
Bahirat, Divya [1 ]
机构
[1] JSPMs Rajarshri Shahu Coll Engn, Informat Technol, Pune, Maharashtra, India
关键词
Deep Learning; Feature Selection; Microarray Data; Bioinformatics; Genetic Markers;
D O I
10.1109/ICSCSS60660.2024.10625652
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Cancer is the major reason of death around the world. However, timely identification and accurate prediction of cancer types play a pivotal role in safeguarding patient health. Microarray technology revolutionizes cancer detection by enabling the simultaneous examination of thousands of genes, simplifying the acquisition of extensive gene expression data. However, conventional cancer detection algorithms struggle with the vast amount of generated data. This study addresses these challenges by employing Principal Component Analysis (PCA) and Deep Learning, specifically Stacked Autoencoders, to reduce the dimensionality of large microarray datasets while preserving essential features. Utilizing these techniques, feature selection was conducted on datasets containing over 7000 features. The selected features were evaluated using standard regression and classification methods, with Logistic Regression emerging as the most effective, achieving an impressive 99.37% accuracy, followed by the Decision Tree classifier at 98.34%. Additionally, a Flask-based web application was developed to facilitate seamless CSV file upload for analysis, enhancing user accessibility and streamlining the data processing and analysis workflow. This user-friendly interface empowers researchers and practitioners to navigate through data complexities efficiently, fostering a more productive research environment.
引用
收藏
页码:1060 / 1066
页数:7
相关论文
共 50 条
  • [31] GEOlimma: differential expression analysis and feature selection using pre-existing microarray data
    Liangqun Lu
    Kevin A. Townsend
    Bernie J. Daigle
    BMC Bioinformatics, 22
  • [32] GEOlimma: differential expression analysis and feature selection using pre-existing microarray data
    Lu, Liangqun
    Townsend, Kevin A.
    Daigle, Bernie J., Jr.
    BMC BIOINFORMATICS, 2021, 22 (01)
  • [33] Hybrid GA-IBPSO for feature selection using microarray data
    Yang, Cheng-San
    Chuang, Li-Yeh
    Ho, Chang-Hsuan
    Yang, Cheng-Hong
    IMECS 2007: INTERNATIONAL MULTICONFERENCE OF ENGINEERS AND COMPUTER SCIENTISTS, VOLS I AND II, 2007, : 284 - +
  • [34] Feature Selection of Microarray Data Using Simulated Kalman Filter with Mutation
    Zamri, Nurhawani Ahmad
    Aziz, Nor Azlina Ab
    Bhuvaneswari, Thangavel
    Aziz, Nor Hidayati Abdul
    Ghazali, Anith Khairunnisa
    PROCESSES, 2023, 11 (08)
  • [35] Global feature selection from microarray data using Lagrange multipliers
    Sun, Shiquan
    Peng, Qinke
    Zhang, Xiaokang
    KNOWLEDGE-BASED SYSTEMS, 2016, 110 : 267 - 274
  • [36] Ensemble Feature Selection for Breast Cancer Classification using Microarray Data
    Hengpraprohm, Supoj
    Jungjit, Suwimol
    INTELIGENCIA ARTIFICIAL-IBEROAMERICAL JOURNAL OF ARTIFICIAL INTELLIGENCE, 2020, 23 (65): : 100 - 114
  • [37] A method for feature selection on microarray data using support vector machine
    Huang, Xiao Bing
    Tang, Jian
    DATA WAREHOUSING AND KNOWLEDGE DISCOVERY, PROCEEDINGS, 2006, 4081 : 513 - 523
  • [38] Analysis of Feature Selection Approaches in Large Scale Cyber Intelligence Data with Deep Learning
    Ahmetoglu, Huseyin
    Das, Resul
    2020 28TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2020,
  • [39] Automated Backend Selection for ProB Using Deep Learning
    Dunkelau, Jannik
    Krings, Sebastian
    Schmidt, Joshua
    NASA FORMAL METHODS (NFM 2019), 2019, 11460 : 130 - 147
  • [40] Memetic algorithms for feature selection on microarray data
    Zhu, Zexuan
    Ong, Yew-Soon
    ADVANCES IN NEURAL NETWORKS - ISNN 2007, PT 1, PROCEEDINGS, 2007, 4491 : 1327 - +