Classification of miRNA Expression Data Using Random Forests for Cancer Diagnosis

被引:7
|
作者
Razak, Eliza [1 ]
Yusorf, Faridah [1 ]
Raus, Raha Ahmad [1 ]
机构
[1] Int Islamic Univ Malaysia, Kuala Lumpur, Malaysia
关键词
miRNA; cancer; random forest; classification; MICRORNA; BIOMARKERS;
D O I
10.1109/ICCCE.2016.49
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Cancer is a major leading cause of death and responsible for around 13% of all deaths world-wide. Cancer incidence rate is growing at an alarming rate in Malaysia and the world as we know it. It is estimated that statistically one out of every four Malaysians will develop cancer by the age of 75. Conventional methods of diagnosing cancer rely solely on skilled physicians, with the help of medical imaging, to detect certain symptoms which usually appear in the late stage of cancer. Furthermore, biopsy examinations are highly invasive since tissue samples are required to be extracted from patients. There exist minimally invasive cancer biomarkers in forms of proteins from serum. Nevertheless, existing protein-based diagnosis techniques require labor-intensive analysis compounded by low diagnosis sensitivity. There have indeed been a number of studies to identify novel miRNA-based cancer biomarkers. However, the existing diagnosis techniques using miRNA suffer from low diagnosis accuracy, sensitivity, and specificity. The low diagnosis accuracy and sensitivity of the existing techniques stems from the fact that there is extremely low miRNA count in body fluids. There is also an inevitable problem of cross contamination between cells and exosomes in sample preparation steps. This paper proposes to circumvent these problems in data analysis stage with a machine learning technique called Random Forest. The proposed system achieved 93.48 % accuracy for gastric cancer and 100 % accuracy for ovarian cancer. The results are promising and encouraging. Despite much noise contaminated the sample preparation process and low miRNA count in body fluids, the proposed system able to identify miRNA markers responsible for classification of cancer.
引用
收藏
页码:187 / 190
页数:4
相关论文
共 50 条
  • [31] Web Document Classification by Keywords Using Random Forests
    Klassen, Myungsook
    Paturi, Nikhila
    NETWORKED DIGITAL TECHNOLOGIES, PT 2, 2010, 88 : 256 - 261
  • [32] Baker's Cyst Classification Using Random Forests
    Ciszkiewicz, Adam
    Milewski, Grzegorz
    Lorkowski, Jacek
    PROCEEDINGS OF THE 2018 FEDERATED CONFERENCE ON COMPUTER SCIENCE AND INFORMATION SYSTEMS (FEDCSIS), 2018, : 97 - 100
  • [33] Pathway analysis using random forests classification and regression
    Pang, Herbert
    Lin, Aiping
    Holford, Matthew
    Enerson, Bradley E.
    Lu, Bin
    Lawton, Michael P.
    Floyd, Eugenia
    Zhao, Hongyu
    BIOINFORMATICS, 2006, 22 (16) : 2028 - 2036
  • [34] An efficient random forests algorithm for high dimensional data classification
    Qiang Wang
    Thanh-Tung Nguyen
    Joshua Z. Huang
    Thuy Thi Nguyen
    Advances in Data Analysis and Classification, 2018, 12 : 953 - 972
  • [35] An efficient random forests algorithm for high dimensional data classification
    Wang, Qiang
    Thanh-Tung Nguyen
    Huang, Joshua Z.
    Thuy Thi Nguyen
    ADVANCES IN DATA ANALYSIS AND CLASSIFICATION, 2018, 12 (04) : 953 - 972
  • [36] Correction to: Adaptive random forests for evolving data stream classification
    Heitor M. Gomes
    Albert Bifet
    Jesse Read
    Jean Paul Barddal
    Fabrício Enembreck
    Bernhard Pfahringer
    Geoff Holmes
    Talel Abdessalem
    Machine Learning, 2019, 108 : 1877 - 1878
  • [37] Data mining with Random Forests as a methodology for biomedical signal classification
    Proniewska, Klaudia
    BIO-ALGORITHMS AND MED-SYSTEMS, 2016, 12 (02) : 89 - 92
  • [38] Segmentation of PMSE Data Using Random Forests
    Jozwicki, Dorota
    Sharma, Puneet
    Mann, Ingrid
    Hoppe, Ulf-Peter
    REMOTE SENSING, 2022, 14 (13)
  • [39] Predictors of colorectal cancer survival using cox regression and random survival forests models based on gene expression data
    Mohammed, Mohanad
    Mboya, Innocent B.
    Mwambi, Henry
    Elbashir, Murtada K.
    Omolo, Bernard
    PLOS ONE, 2021, 16 (12):
  • [40] Prediction of Poor Mental Health Following Breast Cancer Diagnosis Using Random Forests
    Mylona, Eugenia
    Kourou, Konstantina
    Manikis, Georgios
    Kondylakis, Haridimos
    Marias, Kostas
    Karademas, Evangelos
    Poikonen-Saksela, Paula
    Mazzocco, Ketti
    Marzorati, Chiara
    Pat-Horenczyk, Ruth
    Roziner, Ilan
    Sousa, Berta
    Oliveira-Maia, Albino
    Simos, Panagiotis
    Fotiadis, Dimitrios, I
    2021 43RD ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE & BIOLOGY SOCIETY (EMBC), 2021, : 1753 - 1756