Classification of miRNA Expression Data Using Random Forests for Cancer Diagnosis

被引:7
|
作者
Razak, Eliza [1 ]
Yusorf, Faridah [1 ]
Raus, Raha Ahmad [1 ]
机构
[1] Int Islamic Univ Malaysia, Kuala Lumpur, Malaysia
关键词
miRNA; cancer; random forest; classification; MICRORNA; BIOMARKERS;
D O I
10.1109/ICCCE.2016.49
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Cancer is a major leading cause of death and responsible for around 13% of all deaths world-wide. Cancer incidence rate is growing at an alarming rate in Malaysia and the world as we know it. It is estimated that statistically one out of every four Malaysians will develop cancer by the age of 75. Conventional methods of diagnosing cancer rely solely on skilled physicians, with the help of medical imaging, to detect certain symptoms which usually appear in the late stage of cancer. Furthermore, biopsy examinations are highly invasive since tissue samples are required to be extracted from patients. There exist minimally invasive cancer biomarkers in forms of proteins from serum. Nevertheless, existing protein-based diagnosis techniques require labor-intensive analysis compounded by low diagnosis sensitivity. There have indeed been a number of studies to identify novel miRNA-based cancer biomarkers. However, the existing diagnosis techniques using miRNA suffer from low diagnosis accuracy, sensitivity, and specificity. The low diagnosis accuracy and sensitivity of the existing techniques stems from the fact that there is extremely low miRNA count in body fluids. There is also an inevitable problem of cross contamination between cells and exosomes in sample preparation steps. This paper proposes to circumvent these problems in data analysis stage with a machine learning technique called Random Forest. The proposed system achieved 93.48 % accuracy for gastric cancer and 100 % accuracy for ovarian cancer. The results are promising and encouraging. Despite much noise contaminated the sample preparation process and low miRNA count in body fluids, the proposed system able to identify miRNA markers responsible for classification of cancer.
引用
收藏
页码:187 / 190
页数:4
相关论文
共 50 条
  • [21] Cancer Classification Using Gene Expression Data
    Sonsare, Pravinkumar
    Mujumdar, Aarya
    Joshi, Pranjali
    Morayya, Nipun
    Hablani, Sachal
    Khergade, Vedant
    SMART TRENDS IN COMPUTING AND COMMUNICATIONS, VOL 1, SMARTCOM 2024, 2024, 945 : 1 - 11
  • [22] Sleep classification from wrist-worn accelerometer data using random forests
    Sundararajan, Kalaivani
    Georgievska, Sonja
    te Lindert, Bart H. W.
    Gehrman, Philip R.
    Ramautar, Jennifer
    Mazzotti, Diego R.
    Sabia, Severine
    Weedon, Michael N.
    van Someren, Eus J. W.
    Ridder, Lars
    Wang, Jian
    van Hees, Vincent T.
    SCIENTIFIC REPORTS, 2021, 11 (01)
  • [23] Classification of sensor independent point cloud data of building objects using random forests
    Bassier, Maarten
    Van Genechten, Bjorn
    Vergauwen, Maarten
    JOURNAL OF BUILDING ENGINEERING, 2019, 21 : 468 - 477
  • [24] Sleep classification from wrist-worn accelerometer data using random forests
    Kalaivani Sundararajan
    Sonja Georgievska
    Bart H. W. te Lindert
    Philip R. Gehrman
    Jennifer Ramautar
    Diego R. Mazzotti
    Séverine Sabia
    Michael N. Weedon
    Eus J. W. van Someren
    Lars Ridder
    Jian Wang
    Vincent T. van Hees
    Scientific Reports, 11
  • [25] Improvement of rainfall estimation from MSG data using Random Forests classification and regression
    Ouallouche, Fethi
    Lazri, Mourad
    Ameur, Soltane
    ATMOSPHERIC RESEARCH, 2018, 211 : 62 - 72
  • [26] Ovarian Cancer Data Classification Using Bagging and Random Forest
    Arfiani, A.
    Rustam, Z.
    PROCEEDINGS OF THE 4TH INTERNATIONAL SYMPOSIUM ON CURRENT PROGRESS IN MATHEMATICS AND SCIENCES (ISCPMS2018), 2019, 2168
  • [27] Decision tree-based classifiers for lung cancer diagnosis and subtyping using TCGA miRNA expression data
    Sherafatian, Masih
    Arjmand, Fateme
    ONCOLOGY LETTERS, 2019, 18 (02) : 2125 - 2131
  • [28] Molecular subtype classification of papillary renal cell cancer using miRNA expression
    Yu, Changwen
    Dai, Danjing
    Xie, Juan
    ONCOTARGETS AND THERAPY, 2019, 12 : 2311 - 2322
  • [29] Diagnosis of Alzheimer's Disease Using fMRI Data and Modifications of Random Forests Algorithm
    Tripoliti, Evanthia E.
    Fotiadis, Dimitrios I.
    Argyropoulou, Maria
    WORLD CONGRESS ON MEDICAL PHYSICS AND BIOMEDICAL ENGINEERING, VOL 25, PT 2 - DIAGNOSTIC IMAGING, 2009, 25 : 754 - 757
  • [30] Classification of Linear Structures in Mammograms Using Random Forests
    Chen, Zezhi
    Berks, Michael
    Astley, Susan
    Taylor, Chris
    DIGITAL MAMMOGRAPHY, 2010, 6136 : 153 - 160