Classification of miRNA Expression Data Using Random Forests for Cancer Diagnosis

被引:7
|
作者
Razak, Eliza [1 ]
Yusorf, Faridah [1 ]
Raus, Raha Ahmad [1 ]
机构
[1] Int Islamic Univ Malaysia, Kuala Lumpur, Malaysia
关键词
miRNA; cancer; random forest; classification; MICRORNA; BIOMARKERS;
D O I
10.1109/ICCCE.2016.49
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Cancer is a major leading cause of death and responsible for around 13% of all deaths world-wide. Cancer incidence rate is growing at an alarming rate in Malaysia and the world as we know it. It is estimated that statistically one out of every four Malaysians will develop cancer by the age of 75. Conventional methods of diagnosing cancer rely solely on skilled physicians, with the help of medical imaging, to detect certain symptoms which usually appear in the late stage of cancer. Furthermore, biopsy examinations are highly invasive since tissue samples are required to be extracted from patients. There exist minimally invasive cancer biomarkers in forms of proteins from serum. Nevertheless, existing protein-based diagnosis techniques require labor-intensive analysis compounded by low diagnosis sensitivity. There have indeed been a number of studies to identify novel miRNA-based cancer biomarkers. However, the existing diagnosis techniques using miRNA suffer from low diagnosis accuracy, sensitivity, and specificity. The low diagnosis accuracy and sensitivity of the existing techniques stems from the fact that there is extremely low miRNA count in body fluids. There is also an inevitable problem of cross contamination between cells and exosomes in sample preparation steps. This paper proposes to circumvent these problems in data analysis stage with a machine learning technique called Random Forest. The proposed system achieved 93.48 % accuracy for gastric cancer and 100 % accuracy for ovarian cancer. The results are promising and encouraging. Despite much noise contaminated the sample preparation process and low miRNA count in body fluids, the proposed system able to identify miRNA markers responsible for classification of cancer.
引用
收藏
页码:187 / 190
页数:4
相关论文
共 50 条
  • [41] Rectal Cancer Outcome Prediction Based On Institutional Data with Random Forests and Random Survival Forests
    Huang, M.
    Zhong, H.
    Liu, D.
    Gabriel, P.
    Ben-Josef, E.
    Yin, L.
    Geng, H.
    Cheng, C.
    Bilker, W.
    Xiao, Y.
    MEDICAL PHYSICS, 2017, 44 (06)
  • [42] Increasing prediction performance of colorectal cancer disease status using random forests classification based on metagenomic shotgun sequencing data
    Gao, Yilin
    Zhu, Zifan
    Sun, Fengzhu
    SYNTHETIC AND SYSTEMS BIOTECHNOLOGY, 2022, 7 (01) : 574 - 585
  • [43] Relevance of airborne lidar and multispectral image data for urban scene classification using Random Forests
    Guo, Li
    Chehata, Nesrine
    Mallet, Clement
    Boukir, Samia
    ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2011, 66 (01) : 56 - 66
  • [44] Urban landcover classification from multispectral image data using optimized AdaBoosted random forests
    Isaac, Ebenezer
    Easwarakumar, K. S.
    Isaac, Joseph
    REMOTE SENSING LETTERS, 2017, 8 (04) : 350 - 359
  • [45] Accelerated Real-Time Classification of Evolving Data Streams using Adaptive Random Forests
    Ridder, Frank
    Chen, Kuan-Hsun
    Alachiotis, Nikolaos
    2023 INTERNATIONAL CONFERENCE ON FIELD PROGRAMMABLE TECHNOLOGY, ICFPT, 2023, : 232 - 237
  • [46] Gas Turbine Fault Diagnosis using Random Forests
    Maragoudakis, Manolis
    Loukis, Euripides
    Pantelides, Panayotis-Prodromos
    ECAI 2008, PROCEEDINGS, 2008, 178 : 769 - +
  • [47] Oxides Classification with Random Forests
    Xiao, Kai
    Chen, Baitong
    Bao, Wenzheng
    Cheng, Honglin
    INTELLIGENT COMPUTING THEORIES AND APPLICATION, ICIC 2022, PT II, 2022, 13394 : 680 - 686
  • [48] Random forests for classification in ecology
    Cutler, D. Richard
    Edwards, Thomas C., Jr.
    Beard, Karen H.
    Cutler, Adele
    Hess, Kyle T.
    ECOLOGY, 2007, 88 (11) : 2783 - 2792
  • [49] Classification and interaction in random forests
    Denisko, Danielle
    Hoffman, Michael M.
    PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2018, 115 (08) : 1690 - 1692
  • [50] Diagnosis of Prostate Cancer Using Gene Expression Data
    Kaplan, Kaplan
    Ertunc, Huseyin Metin
    2018 26TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2018,