A hierarchical clustering approach for colorectal cancer molecular subtypes identification from gene expression data

被引:1
|
作者
Raghav, Shivangi [1 ]
Suri, Aastha [1 ]
Kumar, Deepika [1 ]
Aakansha, Aakansha [1 ]
Rathore, Muskan [1 ]
Roy, Sudipta [2 ]
机构
[1] Bharati Vidyapeeths Coll Engn, Dept Comp Sci & Engn, New Delhi, India
[2] Jio Inst, Artificial Intelligence & Data Sci, Navi Mumbai 410206, India
来源
INTELLIGENT MEDICINE | 2024年 / 4卷 / 01期
关键词
Machine learning; Colorectal cancer; Feature selection; Classification; Clustering; BRAIN-TISSUES; CLASSIFICATION; SEGMENTATION;
D O I
10.1016/j.imed.2023.04.002
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Background Colorectal cancer (CRC) is the second leading cause of cancer fatalities and the third most common human disease. Identifying molecular subgroups of CRC and treating patients accordingly could result in better therapeutic success compared with treating all CRC patients similarly. Studies have highlighted the significance of CRC as a major cause of mortality worldwide and the potential benefits of identifying molecular subtypes to tailor treatment strategies and improve patient outcomes. Methods This study proposed an unsupervised learning approach using hierarchical clustering and feature selection to identify molecular subtypes and compares its performance with that of conventional methods. The proposed model contained gene expression data from CRC patients obtained from Kaggle and used dimension reduction techniques followed by Z-score-based outlier removal. Agglomerative hierarchy clustering was used to identify molecular subtypes, with a P -value-based approach for feature selection. The performance of the model was evaluated using various classifiers including multilayer perceptron (MLP). Results The proposed methodology outperformed conventional methods, with the MLP classifier achieving the highest accuracy of 89% after feature selection. The model successfully identified molecular subtypes of CRC and differentiated between different subtypes based on their gene expression profiles. Conclusion This method could aid in developing tailored therapeutic strategies for CRC patients, although there is a need for further validation and evaluation of its clinical significance.
引用
收藏
页码:43 / 51
页数:9
相关论文
共 50 条
  • [1] Molecular identification of bladder cancer gene expression subtypes
    Chu, In-Sun
    Song, Bic-Na
    Leem, Sun-Hee
    [J]. CANCER RESEARCH, 2018, 78 (13)
  • [2] A Hierarchical Approach for Clustering and Pattern Matching of Gene Expression Data
    Hoque, Soriful
    Istyaq, Salim
    Riaz, Md Mushir
    [J]. 2012 SIXTH INTERNATIONAL CONFERENCE ON GENETIC AND EVOLUTIONARY COMPUTING (ICGEC), 2012, : 413 - 416
  • [3] Hierarchical clustering of gene expression data
    Luo, F
    Tang, K
    Khan, L
    [J]. THIRD IEEE SYMPOSIUM ON BIOINFORMATICS AND BIOENGINEERING - BIBE 2003, PROCEEDINGS, 2003, : 328 - 335
  • [4] Kernel hierarchical gene clustering from microarray expression data
    Qin, J
    Lewis, DP
    Noble, WS
    [J]. BIOINFORMATICS, 2003, 19 (16) : 2097 - 2104
  • [5] An Integrated Approach for Identifying Molecular Subtypes in Human Colon Cancer Using Gene Expression Data
    Wang, Wen-Hui
    Xie, Ting-Yan
    Xie, Guang-Lei
    Ren, Zhong-Lu
    Li, Jin-Ming
    [J]. GENES, 2018, 9 (08):
  • [6] Gene expression based subtypes of colorectal cancer
    Berg, Kaja G.
    Eilertsen, Ina A.
    Alagaratnam, Sharmini
    Danielsen, Stine A.
    Nesbakken, Arild
    Sveen, Anita
    Lothe, Ragnhild A.
    [J]. CANCER RESEARCH, 2016, 76
  • [7] Bayesian Hierarchical Clustering for Studying Cancer Gene Expression Data with Unknown Statistics
    Sirinukunwattana, Korsuk
    Savage, Richard S.
    Bari, Muhammad F.
    Snead, David R. J.
    Rajpoot, Nasir M.
    [J]. PLOS ONE, 2013, 8 (10):
  • [8] Identification of Molecular Subtypes of Breast Cancer Using Hierarchical Clustering: Analysis of Inter-Observer Agreement
    Mackay, A.
    Weigelt, B.
    Grigoriadis, A.
    Kreike, B.
    Natrajan, R.
    A'Hern, R.
    Tan, D. S.
    Dowsett, M.
    Ashworth, A.
    Reis-Filho, J. S.
    [J]. LABORATORY INVESTIGATION, 2011, 91 : 53A - 53A
  • [9] Identification and validation of gene expression subtypes in a large set of colorectal cancer samples.
    Budinska, Eva
    Popovici, Vlad Calin
    Sikora, Katarzyna Otylia
    Lapique, Nicolas
    Tejpar, Sabine
    Hodgson, John Graeme
    Weinrich, Scott
    Roth, Arnaud
    Bosman, Fred
    Delorenzi, Mauro
    [J]. JOURNAL OF CLINICAL ONCOLOGY, 2012, 30 (15)
  • [10] Unsupervised Hierarchical Clustering Identifies Immune Gene Subtypes in Gastric Cancer
    Cao, Jing
    Gong, Jiao
    Li, Xinhua
    Hu, Zhaoxia
    Xu, Yingjun
    Shi, Hong
    Li, Danyang
    Liu, Guangjian
    Jie, Yusheng
    Hu, Bo
    Chong, Yutian
    [J]. FRONTIERS IN PHARMACOLOGY, 2021, 12