Machine Learning Clustering for Cancer Analysis Employing Gene Expression Data

被引:0
|
作者
Ospino, Camilo Andres Perez [1 ]
Rivera, Jorman Arbey Castro [1 ]
Orjuela-Canon, Alvaro D. [2 ]
机构
[1] Univ Rosario, Bogota, Colombia
[2] Univ Rosario, Sch Med & Hlth Sci, Bogota, Colombia
关键词
Pan-Cancer; K-means; Data base; Genomics; Clustering;
D O I
10.1109/COLCACI59285.2023.10226026
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The idea that cancer types vary in their molecular structure (DNA, RNA, proteins, and epigenetics) depending on the origin and location of the cancer, has been worked on. The Cancer Genome Atlas (TCGA) has generated an initiative to carefully create a database to ensure quality data in the profiling of different tumors to promote research, a part of this large database was called Pan-Cancer, which has the genomic, epigenetic, transcriptional and proteomic profiling of 12 different types of cancer. In this research we took one of the profiling, RNA profiling, in 5 cancer types, in order to determine the possibility of segmenting in an unsupervised manner and to evaluate the difference of them by their origin. The results indicate that the number of clusters can vary from 5 to 7, with 5 clusters being established by the database labels, however, the division of 6 or 7 clusters is due to the clustering of breast cancer (BRCA) which has several origins.
引用
收藏
页数:6
相关论文
共 50 条
  • [41] Gene reduction and machine learning algorithms for cancer classification based on microarray gene expression data: A comprehensive review
    Osama, Sarah
    Shaban, Hassan
    Ali, Abdelmgeid A.
    EXPERT SYSTEMS WITH APPLICATIONS, 2023, 213
  • [42] Fuzzy Clustering Algorithm of Kernel for Gene Expression Data Analysis
    Liu, Wenyuan
    Zhang, Bin
    2009 INTERNATIONAL SYMPOSIUM ON INTELLIGENT INFORMATION SYSTEMS AND APPLICATIONS, PROCEEDINGS, 2009, : 553 - 556
  • [43] Identification of Robust Clustering Methods in Gene Expression Data Analysis
    Hossen, Md. Bipul
    Siraj-Ud-Doulah, Md.
    CURRENT BIOINFORMATICS, 2017, 12 (06) : 558 - 562
  • [44] A novel clustering method for analysis of gene microarray expression data
    Luo, F
    Liu, J
    DATA MINING FOR BIOMEDICAL APPLICATIONS, PROCEEDINGS, 2006, 3916 : 71 - 81
  • [45] Kernel independent component analysis for gene expression data clustering
    Jin, X
    Xu, AB
    Bie, RF
    Guo, P
    INDEPENDENT COMPONENT ANALYSIS AND BLIND SIGNAL SEPARATION, PROCEEDINGS, 2006, 3889 : 454 - 461
  • [46] Learning structure in gene expression data using deep architectures, with an application to gene clustering
    Gupta, Aman
    Wang, Haohan
    Ganapathiraju, Madhavi
    PROCEEDINGS 2015 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE, 2015, : 1328 - 1335
  • [47] An evolutionary clustering algorithm for gene expression microarray data analysis
    Ma, Patrick C. H.
    Chan, Keith C. C.
    Yao, Xin
    Chiu, David K. Y.
    IEEE TRANSACTIONS ON EVOLUTIONARY COMPUTATION, 2006, 10 (03) : 296 - 314
  • [48] Clustering analysis of microarray gene expression data by splitting algorithm
    Wang, RY
    Scharenbroich, L
    Hart, C
    Wold, B
    Mjolsness, E
    JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2003, 63 (7-8) : 692 - 706
  • [49] Fuzzy clustering algorithm of kernel for gene expression data analysis
    Chen, Zhiru
    Hong, Wenxue
    Wang, Changwu
    ICIC Express Letters, 2009, 3 (04): : 1435 - 1440
  • [50] Nonlinear dimensionality reduction of gene expression data for visualization and clustering analysis of cancer tissue samples
    Shi, Jinlong
    Luo, Zhigang
    COMPUTERS IN BIOLOGY AND MEDICINE, 2010, 40 (08) : 723 - 732