Fractal feature selection model for enhancing high-dimensional biological problems

被引:1
|
作者
Alsaeedi, Ali Hakem [1 ,2 ]
Al-Mahmood, Haider Hameed R. [3 ]
Alnaseri, Zainab Fahad [1 ]
Aziz, Mohammad R. [1 ]
Al-Shammary, Dhiah [1 ]
Ibaida, Ayman [4 ]
Ahmed, Khandakar [4 ]
机构
[1] Univ Al Qadisiyah, Coll Comp Sci & Informat Technol, Diwaniyah 58009, Iraq
[2] Imam Kadhum Coll, Dept Comp Tech, Diwaniyah 58009, Iraq
[3] Univ Mustansiriyah, Coll Sci, Dept Comp Sci, Baghdad 10052, Iraq
[4] Victoria Univ, Intelligent Technol Innovat Lab, Melbourne, Vic, Australia
关键词
Bioinformatics; Feature selection; High-dimensional datasets; Fractal; Machine learning;
D O I
10.1186/s12859-023-05619-z
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
The integration of biology, computer science, and statistics has given rise to the interdisciplinary field of bioinformatics, which aims to decode biological intricacies. It produces extensive and diverse features, presenting an enormous challenge in classifying bioinformatic problems. Therefore, an intelligent bioinformatics classification system must select the most relevant features to enhance machine learning performance. This paper proposes a feature selection model based on the fractal concept to improve the performance of intelligent systems in classifying high-dimensional biological problems. The proposed fractal feature selection (FFS) model divides features into blocks, measures the similarity between blocks using root mean square error (RMSE), and determines the importance of features based on low RMSE. The proposed FFS is tested and evaluated over ten high-dimensional bioinformatics datasets. The experiment results showed that the model significantly improved machine learning accuracy. The average accuracy rate was 79% with full features in machine learning algorithms, while FFS delivered promising results with an accuracy rate of 94%.
引用
收藏
页数:23
相关论文
共 50 条
  • [1] Fractal feature selection model for enhancing high-dimensional biological problems
    Ali Hakem Alsaeedi
    Haider Hameed R. Al-Mahmood
    Zainab Fahad Alnaseri
    Mohammad R. Aziz
    Dhiah Al-Shammary
    Ayman Ibaida
    Khandakar Ahmed
    [J]. BMC Bioinformatics, 25
  • [2] Preconditioning for feature selection and regression in high-dimensional problems'
    Paul, Debashis
    Bair, Eric
    Hastie, Trevor
    Tibshirani, Robert
    [J]. ANNALS OF STATISTICS, 2008, 36 (04): : 1595 - 1618
  • [3] FsNet: Feature Selection Network on High-dimensional Biological Data
    Singh, Dinesh
    Climente-Gonzalez, Hector
    Petrovich, Mathis
    Kawakami, Eiryo
    Yamada, Makoto
    [J]. 2023 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, IJCNN, 2023,
  • [4] Simultaneous Feature and Model Selection for High-Dimensional Data
    Perolini, Alessandro
    Guerif, Sebastien
    [J]. 2011 23RD IEEE INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI 2011), 2011, : 47 - 50
  • [5] Projective inference in high-dimensional problems: Prediction and feature selection
    Piironen, Juho
    Paasiniemi, Markus
    Vehtari, Aki
    [J]. ELECTRONIC JOURNAL OF STATISTICS, 2020, 14 (01): : 2155 - 2197
  • [6] Ultra High-Dimensional Nonlinear Feature Selection for Big Biological Data
    Yamada, Makoto
    Tang, Jiliang
    Lugo-Martinez, Jose
    Hodzic, Ermin
    Shrestha, Raunak
    Saha, Avishek
    Ouyang, Hua
    Yin, Dawei
    Mamitsuka, Hiroshi
    Sahinalp, Cenk
    Radivojac, Predrag
    Menczer, Filippo
    Chang, Yi
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2018, 30 (07) : 1352 - 1365
  • [7] Feature selection for high-dimensional data
    Bolón-Canedo V.
    Sánchez-Maroño N.
    Alonso-Betanzos A.
    [J]. Progress in Artificial Intelligence, 2016, 5 (2) : 65 - 75
  • [8] Feature selection for high-dimensional data
    Destrero A.
    Mosci S.
    De Mol C.
    Verri A.
    Odone F.
    [J]. Computational Management Science, 2009, 6 (1) : 25 - 40
  • [9] Enhancing protection in high-dimensional data: Distributed differential privacy with feature selection
    Putrama, I. Made
    Martinek, Peter
    [J]. INFORMATION PROCESSING & MANAGEMENT, 2024, 61 (06)
  • [10] Interaction-based feature selection and classification for high-dimensional biological data
    Wang, Haitian
    Lo, Shaw-Hwa
    Zheng, Tian
    Hu, Inchi
    [J]. BIOINFORMATICS, 2012, 28 (21) : 2834 - 2842