Visualization of genetic disease-phenotype similarities by multiple maps t-SNE with Laplacian regularization

被引:17
|
作者
Xu, Weiwei [1 ]
Jiang, Xingpeng [2 ]
Hu, Xiaohua [2 ]
Li, Guangrong [3 ]
机构
[1] Wuhan Univ, Int Sch Software, Wuhan 430079, Hubei, Peoples R China
[2] Drexel Univ, Coll Comp & Informat, Philadelphia, PA 19104 USA
[3] Hunan Univ, Sch Business, Changsha 410012, Hunan, Peoples R China
关键词
NETWORK; DISORDERS;
D O I
10.1186/1755-8794-7-S2-S1
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Background: From a phenotypic standpoint, certain types of diseases may prove to be difficult to accurately diagnose, due to specific combinations of confounding symptoms. Referred to as phenotypic overlap, these sets of disease-related symptoms suggest shared pathophysiological mechanisms. Few attempts have been made to visualize the phenotypic relationships between different human diseases from a machine learning perspective. The proposed research, it is anticipated, will visually assist researchers in quickly disambiguating symptoms which can confound the timely and accurate diagnosis of a disease. Methods: Our method is primarily based on multiple maps t-SNE (mm-tSNE), which is a probabilistic method for visualizing data points in multiple low dimensional spaces. We improved mm-tSNE by adding a Laplacian regularization term and subsequently provide an algorithm for optimizing the new objective function. The advantage of Laplacian regularization is that it adopts clustering structures of variables and provides more sparsity to the estimated parameters. Results: In order to further assess our modified mm-tSNE algorithm from a comparative standpoint, we reexamined two social network datasets used by the previous authors. Subsequently, we apply our method on phenotype dataset. In all these cases, our proposed method demonstrated better performance than the original version of mm-tSNE, as measured by the neighbourhood preservation ratio. Conclusions: Phenotype grouping reflects the nature of human disease genetics. Thus, phenotype visualization may be complementary to investigate candidate genes for diseases as well as functional relations between genes and proteins. These relationships can be modelled by the modified mm-tSNE method. The modified mm-tSNE can be applied directly in other domain including social and biological datasets.
引用
收藏
页数:9
相关论文
共 5 条
  • [1] Visualization of genetic disease-phenotype similarities by multiple maps t-SNE with Laplacian regularization
    Weiwei Xu
    Xingpeng Jiang
    Xiaohua Hu
    Guangrong Li
    [J]. BMC Medical Genomics, 7
  • [2] Visualization of Disease Relationships by Multiple Maps t-SNE Regularization Based on Nesterov Accelerated Gradient
    Shen, Xianjun
    Zhu, Xianchao
    Jiang, Xingpeng
    He, Tingting
    Hu, Xiaohua
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE (BIBM), 2017, : 604 - 607
  • [3] Visualization of Non-metric Relationships by Adaptive Learning Multiple Maps t-SNE Regularization
    Shen, Xianjun
    Zhu, Xianchao
    Jiang, Xingpeng
    Gao, Li
    He, Tingting
    Hu, Xiaohua
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2017, : 3882 - 3887
  • [4] Laplacian-based Cluster-Contractive t-SNE for High-Dimensional Data Visualization
    Sun, Yan
    Han, Yi
    Fan, Jicong
    [J]. ACM TRANSACTIONS ON KNOWLEDGE DISCOVERY FROM DATA, 2024, 18 (01)
  • [5] Visualization of Multi-Objective Switched Reluctance Machine Optimization at Multiple Operating Conditions with t-SNE
    Zhang, Shen
    Zhang, Shibo
    Li, Sufei
    Du, Liang
    Habetler, Thomas G.
    [J]. 2019 IEEE ENERGY CONVERSION CONGRESS AND EXPOSITION (ECCE), 2019, : 3793 - 3798