A Machine Learning Approach for the Classification of Kidney Cancer Subtypes Using miRNA Genome Data

被引:34
|
作者
Ali, Ali Muhamed [1 ,2 ]
Zhuang, Hanqi [1 ,2 ]
Ibrahim, Ali [1 ,2 ]
Rehman, Oneeb [1 ,2 ]
Huang, Michelle [1 ,2 ]
Wu, Andrew [1 ,2 ]
机构
[1] Florida Atlantic Univ, CEECS Dept, Boca Raton, FL 33431 USA
[2] 777 Glades Rd, Boca Raton, FL 33431 USA
来源
APPLIED SCIENCES-BASEL | 2018年 / 8卷 / 12期
基金
美国国家科学基金会;
关键词
kidney cancer; subtype classification; miRNA as biomarker; machine learning; TCGA; RENAL-CELL CARCINOMA; MICRORNA; IDENTIFICATION; CONSEQUENCES;
D O I
10.3390/app8122422
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Kidney cancer is one of the deadliest diseases and its diagnosis and subtype classification are crucial for patients' survival. Thus, developing automated tools that can accurately determine kidney cancer subtypes is an urgent challenge. It has been confirmed by researchers in the biomedical field that miRNA dysregulation can cause cancer. In this paper, we propose a machine learning approach for the classification of kidney cancer subtypes using miRNA genome data. Through empirical studies we found 35 miRNAs that possess distinct key features that aid in kidney cancer subtype diagnosis. In the proposed method, Neighbourhood Component Analysis (NCA) is employed to extract discriminative features from miRNAs and Long Short Term Memory (LSTM), a type of Recurrent Neural Network, is adopted to classify a given miRNA sample into kidney cancer subtypes. In the literature, only a couple of kidney subtypes have been considered for classification. In the experimental study, we used the miRNA quantitative read counts data, which was provided by The Cancer Genome Atlas data repository (TCGA). The NCA procedure selected 35 of the most discriminative miRNAs. With this subset of miRNAs, the LSTM algorithm was able to group kidney cancer miRNAs into five subtypes with average accuracy around 95% and Matthews Correlation Coefficient value around 0.92 under 10 runs of randomly grouped 5-fold cross-validation, which were very close to the average performance of using all miRNAs for classification.
引用
收藏
页数:14
相关论文
共 50 条
  • [21] A classification-based machine learning approach for the analysis of genome-wide expression data
    Lyons-Weiler, J
    Patel, S
    Bhattacharya, S
    [J]. GENOME RESEARCH, 2003, 13 (03) : 503 - 512
  • [22] Emotional state classification from EEG data using machine learning approach
    Wang, Xiao-Wei
    Nie, Dan
    Lu, Bao-Liang
    [J]. NEUROCOMPUTING, 2014, 129 : 94 - 106
  • [23] A Secure Data Classification Model in Cloud Computing Using Machine Learning Approach
    Kaur, Kulwinder
    Zandu, Vikas
    [J]. INTERNATIONAL JOURNAL OF GRID AND DISTRIBUTED COMPUTING, 2016, 9 (08): : 13 - 21
  • [24] An Approach for the Classification of Rock Types Using Machine Learning of Core and Log Data
    Xing, Yihan
    Yang, Huiting
    Yu, Wei
    [J]. SUSTAINABILITY, 2023, 15 (11)
  • [25] Poem Classification Using Machine Learning Approach
    Kumar, Vipin
    Minz, Sonajharia
    [J]. PROCEEDINGS OF THE SECOND INTERNATIONAL CONFERENCE ON SOFT COMPUTING FOR PROBLEM SOLVING (SOCPROS 2012), 2014, 236 : 675 - 682
  • [26] Seismic Data Classification using Machine Learning
    Li, Wenrui
    Nakshatra
    Narvekar, Nishita
    Raut, Nitisha
    Sirkeci, Birsen
    Gao, Jerry
    [J]. 2018 IEEE FOURTH INTERNATIONAL CONFERENCE ON BIG DATA COMPUTING SERVICE AND APPLICATIONS (IEEE BIGDATASERVICE 2018), 2018, : 56 - 63
  • [27] Multiclass Classification of Cancer Based on Microarray Data Using Extreme Learning Machine
    Khadijah
    Rismiyati
    Mantau, Aprinaldi Jasa
    [J]. 2017 1ST INTERNATIONAL CONFERENCE ON INFORMATICS AND COMPUTATIONAL SCIENCES (ICICOS), 2017, : 159 - 164
  • [28] Machine Learning Methods for Cancer Classification Using Gene Expression Data: A Review
    Alharbi, Fadi
    Vakanski, Aleksandar
    [J]. BIOENGINEERING-BASEL, 2023, 10 (02):
  • [29] Accurate Molecular Classification of Kidney Cancer Subtypes Using MicroRNA Signature
    Youssef, Youssef M.
    White, Nicole M. A.
    Grigull, Joerg
    Krizova, Adriana
    Samy, Christina
    Mejia-Guerrero, Salvador
    Evans, Andrew
    Yousef, George M.
    [J]. EUROPEAN UROLOGY, 2011, 59 (05) : 721 - 730
  • [30] Review of Acute Kidney Injury Classification Using Machine Learning
    Shah, Norliyana Nor Hisham
    Razak, Normy
    Abu-Samah, Asma
    Razak, Athirah Abdul
    [J]. 2020 IEEE-EMBS CONFERENCE ON BIOMEDICAL ENGINEERING AND SCIENCES (IECBES 2020): LEADING MODERN HEALTHCARE TECHNOLOGY ENHANCING WELLNESS, 2021, : 324 - 328