Multi-view based integrative analysis of gene expression data for identifying biomarkers

被引:9
|
作者
Yang, Zi-Yi [1 ,2 ]
Liu, Xiao-Ying [3 ]
Shu, Jun [4 ]
Zhang, Hui [1 ,2 ]
Ren, Yan-Qiong [1 ,2 ]
Xu, Zong-Ben [4 ]
Liang, Yong [1 ,2 ]
机构
[1] Macau Univ Sci & Technol, Fac Informat Technol, Taipa 999078, Macao, Peoples R China
[2] Macau Univ Sci & Technol, State Key Lab Qual Res Chinese Med, Taipa 999078, Macao, Peoples R China
[3] Guangdong Polytech Sci & Technol, Comp Engn Tech Coll, Zhuhai 519090, Peoples R China
[4] Xi An Jiao Tong Univ, Sch Math & Stat, Minist Educ, Key Lab Intelligent Networks & Network Secur, Xian 710049, Shaanxi, Peoples R China
关键词
VARIABLE SELECTION; MICROARRAY; METAANALYSIS; REGULARIZATION; PROFILES;
D O I
10.1038/s41598-019-49967-4
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
The widespread applications in microarray technology have produced the vast quantity of publicly available gene expression datasets. However, analysis of gene expression data using biostatistics and machine learning approaches is a challenging task due to (1) high noise; (2) small sample size with high dimensionality; (3) batch effects and (4) low reproducibility of significant biomarkers. These issues reveal the complexity of gene expression data, thus significantly obstructing microarray technology in clinical applications. The integrative analysis offers an opportunity to address these issues and provides a more comprehensive understanding of the biological systems, but current methods have several limitations. This work leverages state of the art machine learning development for multiple gene expression datasets integration, classification and identification of significant biomarkers. We design a novel integrative framework, MVIAm - Multi-View based Integrative Analysis of microarray data for identifying biomarkers. It applies multiple cross-platform normalization methods to aggregate multiple datasets into a multi-view dataset and utilizes a robust learning mechanism Multi-View Self-Paced Learning (MVSPL) for gene selection in cancer classification problems. We demonstrate the capabilities of MVIAm using simulated data and studies of breast cancer and lung cancer, it can be applied flexibly and is an effective tool for facing the four challenges of gene expression data analysis. Our proposed model makes microarray integrative analysis more systematic and expands its range of applications.
引用
收藏
页数:15
相关论文
共 50 条
  • [41] A multi-view genomic data simulator
    Michele Fratello
    Angela Serra
    Vittorio Fortino
    Giancarlo Raiconi
    Roberto Tagliaferri
    Dario Greco
    [J]. BMC Bioinformatics, 16
  • [42] Clustering-Based Anomaly Detection in Multi-View Data
    Alvarez, Alejandro Marcos
    Yamada, Makoto
    Kimura, Akisato
    Iwata, Tomoharu
    [J]. PROCEEDINGS OF THE 22ND ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT (CIKM'13), 2013, : 1545 - 1548
  • [43] POLSAR DATA ONLINE CLASSIFICATION BASED ON MULTI-VIEW LEARNING
    Nie, Xiangli
    Ding, Shuguang
    Zhang, Bo
    Qiao, Hong
    Huang, Xiayuan
    [J]. 2017 24TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2017, : 2354 - 2358
  • [44] Efficient compression of multi-view depth data based on MVC
    Merkle, Philipp
    Smolic, Aljoscha
    Mueller, Karsten
    Wiegand, Thomas
    [J]. 2007 3DTV CONFERENCE, 2007, : 249 - 252
  • [45] Co-clustering based classification of multi-view data
    Hussain, Syed Fawad
    Khan, Mohsin
    Siddiqi, Imran
    [J]. APPLIED INTELLIGENCE, 2022, 52 (13) : 14756 - 14772
  • [46] scICML: Information-Theoretic Co-Clustering-Based Multi-View Learning for the Integrative Analysis of Single-Cell Multi-Omics Data
    Zeng, Pengcheng
    Lin, Zhixiang
    [J]. IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2024, 21 (01) : 200 - 207
  • [47] Multi-View Missing Data Completion
    Zhang, Lei
    Zhao, Yao
    Zhu, Zhenfeng
    Shen, Dinggang
    Ji, Shuiwang
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2018, 30 (07) : 1296 - 1309
  • [48] Identifying differential networks based on multi-platform gene expression data
    Le Ou-Yang
    Yan, Hong
    Zhang, Xiao-Fei
    [J]. MOLECULAR BIOSYSTEMS, 2017, 13 (01) : 183 - 192
  • [49] Multi-View Integrative Attention-Based Deep Representation Learning for Irregular Clinical Time-Series Data
    Lee, Yurim
    Jun, Eunji
    Choi, Jaehun
    Suk, Heung-Il
    [J]. IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2022, 26 (08) : 4270 - 4280
  • [50] Graph Learning With Riemannian Optimization for Multi-View Integrative Clustering
    Khan, Aparajita
    Maji, Pradipta
    [J]. IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTATIONAL INTELLIGENCE, 2024,