A New Binary Biclustering Algorithm Based on Weight Adjacency Difference Matrix for Analyzing Gene Expression Data

被引:2
|
作者
Chu, He-Ming [1 ]
Kong, Xiang-Zhen [1 ]
Liu, Jin-Xing [1 ]
Zheng, Chun-Hou [1 ]
Zhang, Han [2 ]
机构
[1] Qufu Normal Univ, Sch Comp Sci, Rizhao 276826, Shandong, Peoples R China
[2] Jishou Univ, Sch Informat Sci & Engn, Jishou 416000, Hunan, Peoples R China
基金
美国国家科学基金会; 中国国家自然科学基金;
关键词
Biclustering; gene expression data; weight matrix; binary matrix; HETEROGENEITY; PATHWAYS; PATTERNS;
D O I
10.1109/TCBB.2023.3283801
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Biclustering algorithms are essential for processing gene expression data. However, to process the dataset, most biclustering algorithms require preprocessing the data matrix into a binary matrix. Regrettably, this type of preprocessing may introduce noise or cause information loss in the binary matrix, which would reduce the biclustering algorithm's ability to effectively obtain the optimal biclusters. In this paper, we propose a new preprocessing method named Mean-Standard Deviation (MSD) to resolve the problem. Additionally, we introduce a new biclustering algorithm called Weight Adjacency Difference Matrix Binary Biclustering (W-AMBB) to effectively process datasets containing overlapping biclusters. The basic idea is to create a weighted adjacency difference matrix by applying weights to a binary matrix that is derived from the data matrix. This allows us to identify genes with significant associations in sample data by efficiently identifying similar genes that respond to specific conditions. Furthermore, the performance of the W-AMBB algorithm was tested on both synthetic and real datasets and compared with other classical biclustering methods. The experiment results demonstrate that the W-AMBB algorithm is significantly more robust than the compared biclustering methods on the synthetic dataset. Additionally, the results of the GO enrichment analysis show that the W-AMBB method possesses biological significance on real datasets.
引用
下载
收藏
页码:2802 / 2809
页数:8
相关论文
共 50 条
  • [1] A binary biclustering algorithm based on the adjacency difference matrix for gene expression data analysis
    He-Ming Chu
    Jin-Xing Liu
    Ke Zhang
    Chun-Hou Zheng
    Juan Wang
    Xiang-Zhen Kong
    BMC Bioinformatics, 23
  • [2] A binary biclustering algorithm based on the adjacency difference matrix for gene expression data analysis
    Chu, He-Ming
    Liu, Jin-Xing
    Zhang, Ke
    Zheng, Chun-Hou
    Wang, Juan
    Kong, Xiang-Zhen
    BMC BIOINFORMATICS, 2022, 23 (01)
  • [3] Biclustering of Gene Expression Data Based on Binary Artificial Fish Swarm Algorithm
    Zhang, Rui
    Gao, Huacheng
    Liu, Yinqiu
    Lu, Yuanyuan
    Cui, Yan
    PROCEEDINGS OF 2018 5TH IEEE INTERNATIONAL CONFERENCE ON CLOUD COMPUTING AND INTELLIGENCE SYSTEMS (CCIS), 2018, : 247 - 251
  • [4] Evolutionary Algorithm Based on New Crossover for the Biclustering of Gene Expression Data
    Maatouk, Ons
    Ayadi, Wassim
    Bouziri, Hend
    Duval, Beatrice
    PATTERN RECOGNITION IN BIOINFORMATICS, PRIB 2014, 2014, 8626 : 48 - 59
  • [5] Binary matrix factorization for analyzing gene expression data
    Zhang, Zhong-Yuan
    Li, Tao
    Ding, Chris
    Ren, Xian-Wen
    Zhang, Xiang-Sun
    DATA MINING AND KNOWLEDGE DISCOVERY, 2010, 20 (01) : 28 - 52
  • [6] Binary matrix factorization for analyzing gene expression data
    Zhong-Yuan Zhang
    Tao Li
    Chris Ding
    Xian-Wen Ren
    Xiang-Sun Zhang
    Data Mining and Knowledge Discovery, 2010, 20 : 28 - 52
  • [7] ARBic: an all-round biclustering algorithm for analyzing gene expression data
    Liu, Xiangyu
    Yu, Ting
    Zhao, Xiaoyu
    Long, Chaoyi
    Han, Renmin
    Su, Zhengchang
    Li, Guojun
    NAR GENOMICS AND BIOINFORMATICS, 2023, 5 (01)
  • [8] Biclustering of gene expression data based on hybrid genetic algorithm
    Bagyamani, J.
    Thangavel, K.
    Rathipriya, R.
    INTERNATIONAL JOURNAL OF DATA MINING MODELLING AND MANAGEMENT, 2013, 5 (04) : 333 - 350
  • [9] Evolutionary Biclustering Algorithm of Gene Expression Data
    Ayadi, Wassim
    Maatouk, Ons
    Bouziri, Hend
    2012 23RD INTERNATIONAL WORKSHOP ON DATABASE AND EXPERT SYSTEMS APPLICATIONS (DEXA), 2012, : 206 - 210
  • [10] An improved biclustering algorithm for gene expression data
    Jin, Sheng-Hua
    Hua, Li
    Open Cybernetics and Systemics Journal, 2014, 8 : 1141 - 1144