The Application of Bi-clustering and Bayesian Network for Gene Sets Network Construction in Breast Cancer Microarray Data

被引:0
|
作者
Sohrabi, Ahmad [1 ]
Saraygord-Afshari, Neda [2 ]
Roudbari, Masoud [1 ]
机构
[1] Iran Univ Med Sci, Sch Publ Hlth, Dept Biostat, Tehran, Iran
[2] Iran Univ Med Sci, Fac Allied Med Sci, Dept Med Biotechnol, Tehran, Iran
关键词
Breast cancer; Bi-clustering; Cluster analysis; Microarray data; Gene expression; Neoplasms; Bayesian network; EXPRESSION ANALYSIS; MODELS;
D O I
10.30476/mejc.2022.89998.1557
中图分类号
R73 [肿瘤学];
学科分类号
100214 ;
摘要
Background: Breast cancer is one of the most prevalent types of cancer in Iranian women and the second cause of death in women worldwide. Gene mutations are the key determinants of the disease; therefore, the genetic study of this disease is of paramount importance. One of the genetic evaluation methods of this disease is microarray technology, which allows the examination of the simultaneous expression of thousands of genes. Clustering is the method for analyzing high-dimension data, which we used in the present research for collecting similar genes in separated clusters. Method: A descriptive and inferential statistical analysis was carried out to evaluate unsupervised learning models of gene expression analysis and five bi-clustering methods (including PLAID (PL), Fabia, Bimax, Cheng & Church (CC), and Xmotif) were compared. For this purpose, we obtained the microarray gene expression data for lapatinib-resistant breast cancer cell lines from previously published research. The enrichment efficacy of the clusters was evaluated with gene ontology, and the results of these five models were compared with the Jaccard index, variance stability, least-square error, and goodness of fit indices. Furthermore, the results of the best model were assessed for building a genes sets network with Bayesian networks.Results: After preprocessing, clustering was performed on the data with the dimension (4710 x 18) of the genes. Four models, except for CC, successfully found bi-clusters in the data set. The data evaluation revealed that the results of the models were almost the same, but the PL model performed better than the others, finding 11 bi-clusters; this model was used to build the network of gene sets.Conclusion: According to the results, the PL method was suitable for clustering the data. Accordingly, it could be recommended for data analysis. In addition, the gene sets network formed on gene expression data was incompetent.
引用
收藏
页码:624 / 640
页数:17
相关论文
共 50 条
  • [21] Bayesian neural network for microarray data
    Liang, YL
    George, EO
    Kelemen, A
    [J]. PROCEEDING OF THE 2002 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, VOLS 1-3, 2002, : 193 - 197
  • [22] The application of gene co-expression network reconstruction based on CNVs and gene expression microarray data in breast cancer
    Xu, Yan
    DuanMu, Huizi
    Chang, Zhiqiang
    Zhang, Shanzhen
    Li, Zhenqi
    Li, Zihui
    Liu, Yufeng
    Li, Kening
    Qiu, Fujun
    Li, Xia
    [J]. MOLECULAR BIOLOGY REPORTS, 2012, 39 (02) : 1627 - 1637
  • [23] The application of gene co-expression network reconstruction based on CNVs and gene expression microarray data in breast cancer
    Yan Xu
    Huizi DuanMu
    Zhiqiang Chang
    Shanzhen Zhang
    Zhenqi Li
    Zihui Li
    Yufeng Liu
    Kening Li
    Fujun Qiu
    Xia Li
    [J]. Molecular Biology Reports, 2012, 39 : 1627 - 1637
  • [24] Bi-clustering Gene Expression Data Using Co-similarity
    Hussain, Syed Fawad
    [J]. ADVANCED DATA MINING AND APPLICATIONS, PT I, 2011, 7120 : 190 - 200
  • [25] LPRP: A Gene-Gene Interaction Network Construction Algorithm and Its Application in Breast Cancer Data Analysis
    Su, Lingtao
    Meng, Xiangyu
    Ma, Qingshan
    Bai, Tian
    Liu, Guixia
    [J]. INTERDISCIPLINARY SCIENCES-COMPUTATIONAL LIFE SCIENCES, 2018, 10 (01) : 131 - 142
  • [26] Combining Clustering and Bayesian Network For Gene Network Inference
    Zainudin, Suhaila
    Deris, Safaai
    [J]. ISDA 2008: EIGHTH INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS DESIGN AND APPLICATIONS, VOL 2, PROCEEDINGS, 2008, : 557 - +
  • [27] Interactive gene clustering - A case study of breast cancer microarray data
    Gruzdz, A
    Ihnatowicz, A
    Slezak, D
    [J]. INFORMATION SYSTEMS FRONTIERS, 2006, 8 (01) : 21 - 27
  • [28] Construction and application of Bayesian network model for spatial data mining
    Huang, Jiejun
    Yuan, Yanbin
    [J]. 2007 IEEE INTERNATIONAL CONFERENCE ON CONTROL AND AUTOMATION, VOLS 1-7, 2007, : 914 - 917
  • [29] Interactive Gene Clustering—A Case Study of Breast Cancer Microarray Data
    Alicja Gruźdź
    Aleksandra Ihnatowicz
    Dominik Ślʁzak
    [J]. Information Systems Frontiers, 2006, 8 : 21 - 27
  • [30] Construction and Clarification of Dynamic Gene Regulatory Network of Cancer Cell Cycle via Microarray Data
    Li, Cheng-Wei
    Chu, Yung-Hsiang
    Chen, Bor-Sen
    [J]. CANCER INFORMATICS, 2006, 2 : 223 - 241