Dimensionality Reduction on Metagenomic Data with Recursive Feature Elimination

被引:0
|
作者
Huong Hoang Luong [1 ]
Nghia Trong Le Phan [1 ]
Tin Tri Duong [1 ]
Thuan Minh Dang [1 ]
Tong Duc Nguyen [1 ]
Hai Thanh Nguyen [2 ]
机构
[1] FPT Univ, Can Tho 900000, Vietnam
[2] Can Tho Univ, Coll Informat & Commun Technol, Can Tho, Vietnam
关键词
Inflammatory bowel disease; Metagenomic data; Recursive Feature Elimination; Dimensionality reduction; K-FOLD;
D O I
10.1007/978-3-030-79725-6_7
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The Fourth Industrial Revolution has a significant impact on many aspects, which help improve and develop significantly. These beneficial works give a better life for all society. When we mention the medical or healthcare field, there has been much creative and vital research that promotes everyone's life. Inflammatory Bowel Disease (IBD) is one of the most dangerous diseases that can cause millions of deaths every year. In this research, we would like to raise a topic about IBD diagnosis using metagenomic data to advance prediction for initial detection. The problem is not well-studied adequately due to the lack of data and information in the past. However, with the rapid development of technology, we obtain massive data where a metagenomic sample can contain thousands of bacterial species. To evaluate which species are essential to the considered disease, this work investigates a dimension reduction approach based on Recursive Feature Elimination combining with Random Forest to provide practical prediction tasks on metagenomic data. The relationship between bacteria causing IBD is what we have to figure out. Our goal is to evaluate whether we can make a more reliable prediction using a precise quantity of features decided by Recursive Feature Elimination (RFE). The proposed method gives positively promising results, which can reach 0.927 in accuracy using thirty selected features and achieve a significant improvement compared to the random feature selection.
引用
收藏
页码:68 / 79
页数:12
相关论文
共 50 条
  • [1] The Data Dimensionality Reduction in the Classification Process Through Greedy Backward Feature Elimination
    Kostrzewa, Daniel
    Brzeski, Robert
    [J]. MAN-MACHINE INTERACTIONS 5, ICMMI 2017, 2018, 659 : 397 - 407
  • [2] Feature selection and analysis on correlated gas sensor data with recursive feature elimination
    Yan, Ke
    Zhang, David
    [J]. SENSORS AND ACTUATORS B-CHEMICAL, 2015, 212 : 353 - 363
  • [3] Enhanced recursive feature elimination
    Chen, Xue-Wen
    Jeong, Jong Cheol
    [J]. ICMLA 2007: SIXTH INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS, PROCEEDINGS, 2007, : 429 - 435
  • [4] Feature extraction and dimensionality reduction for mass spectrometry data
    Liu, Yihui
    [J]. COMPUTERS IN BIOLOGY AND MEDICINE, 2009, 39 (09) : 818 - 823
  • [5] Feature subset selection and ranking for data dimensionality reduction
    Wei, Hua-Liang
    Billings, Stephen A.
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2007, 29 (01) : 162 - 166
  • [6] Stepwise optimal feature selection for data dimensionality reduction
    Qin, Lifeng
    He, Dongjian
    Long, Yan
    [J]. Journal of Computational Information Systems, 2015, 11 (05): : 1647 - 1656
  • [7] The reduction of hyperspectral data dimensionality and classification based on recursive subspace fusion
    Wang, Q
    Zhang, Y
    Li, S
    Shen, Y
    [J]. CHINESE JOURNAL OF ELECTRONICS, 2002, 11 (01) : 12 - 15
  • [8] Recursive Feature Elimination by Sensitivity Testing
    Escanilla, Nicholas Sean
    Hellerstein, Lisa
    Kleiman, Ross
    Kuang, Zhaobin
    Shull, James D.
    Page, David
    [J]. 2018 17TH IEEE INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS (ICMLA), 2018, : 40 - 47
  • [9] Feature Dimensionality Reduction for Visualization and Clustering on Learning Process Data
    Supianto, Ahmad Afif
    Christyawan, Tomi Yahya
    Hafis, Muhammad
    Hayashi, Yusuke
    Hirashima, Tsukasa
    Hasanah, Nur
    [J]. PROCEEDINGS OF 2019 4TH INTERNATIONAL CONFERENCE ON SUSTAINABLE INFORMATION ENGINEERING AND TECHNOLOGY (SIET 2019), 2019, : 84 - 89
  • [10] Dimensionality reduction of hyperspectral data using spectral fractal feature
    Mukherjee, Kriti
    Ghosh, Jayanta Kumar
    Mittal, Ramesh Chand
    [J]. GEOCARTO INTERNATIONAL, 2012, 27 (06) : 515 - 531