Hierarchical classification-based pan-cancer methylation analysis to classify primary cancer

被引:0
|
作者
Yang, Youpeng [1 ]
Zeng, Qiuhong [2 ]
Liu, Gaotong [2 ]
Zheng, Shiyao [1 ]
Luo, Tianyang [1 ]
Guo, Yibin [1 ]
Tang, Jia [3 ,4 ]
Huang, Yi [2 ]
机构
[1] Sun Yat Sen Univ, Med Sch, Shenzhen 518107, Peoples R China
[2] Geneplus Shenzhen Inst, Shenzhen 518118, Peoples R China
[3] Guangdong Prov Fertil Hosp, Guangdong Prov Reprod Sci Inst, NHC Key Lab Male Reprod & Genet, Guangzhou 510062, Peoples R China
[4] Jinan Univ, Sch Med, Guangzhou 510632, Peoples R China
关键词
Cancer; Classification; Cluster analysis; Machine learning;
D O I
10.1186/s12859-023-05529-0
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Hierarchical classification offers a more specific categorization of data and breaks down large classification problems into subproblems, providing improved prediction accuracy and predictive power for undefined categories, while also mitigating the impact of poor-quality data. Despite these advantages, its application in predicting primary cancer is rare. To leverage the similarity of cancers and the specificity of methylation patterns among them, we developed the Cancer Hierarchy Classification Tool (CHCT) using the idea of hierarchical classification, with methylation data from 30 cancer types and 8239 methylome samples downloaded from publicly available databases (The Cancer Genome Atlas (TCGA) and the Gene Expression Omnibus (GEO)). We used unsupervised clustering to divide the classification subproblems and screened differentially methylated sites using Analysis of variance (ANOVA) test, Tukey-kramer test, and Boruta algorithms to construct models for each classifier module. After validation, CHCT accurately classified 1568 out of 1660 cases in the test set, with an average accuracy of 94.46%. We further curated an independent validation cohort of 677 cancer samples from GEO and assigned a diagnosis using CHCT, which showed high diagnostic potential with generally high accuracies (an average accuracy of 91.40%). Moreover, CHCT demonstrates predictive capability for additional cancer types beyond its original classifier scope as demonstrated in the medulloblastoma and pituitary tumor datasets. In summary, CHCT can hierarchically classify primary cancer by methylation profile, by splitting a large-scale classification of 30 cancer types into ten smaller classification problems. These results indicate that cancer hierarchical classification has the potential to be an accurate and robust cancer classification method.
引用
收藏
页数:14
相关论文
共 50 条
  • [1] Hierarchical classification-based pan-cancer methylation analysis to classify primary cancer
    Youpeng Yang
    Qiuhong Zeng
    Gaotong Liu
    Shiyao Zheng
    Tianyang Luo
    Yibin Guo
    Jia Tang
    Yi Huang
    [J]. BMC Bioinformatics, 24
  • [2] Pan-cancer analysis of differential DNA methylation patterns
    Shi, Mai
    Tsui, Stephen Kwok-Wing
    Wu, Hao
    Wei, Yingying
    [J]. BMC MEDICAL GENOMICS, 2020, 13 (Suppl 10)
  • [3] Pan-cancer analysis of differential DNA methylation patterns
    Mai Shi
    Stephen Kwok-Wing Tsui
    Hao Wu
    Yingying Wei
    [J]. BMC Medical Genomics, 13
  • [4] A Pan-Cancer Analysis of Predictive Methylation Signatures of Response to Cancer Immunotherapy
    Xu, Bingxiang
    Lu, Mingjie
    Yan, Linlin
    Ge, Minghui
    Ren, Yong
    Wang, Ru
    Shu, Yongqian
    Hou, Lin
    Guo, Hao
    [J]. FRONTIERS IN IMMUNOLOGY, 2021, 12
  • [5] Pan-cancer patterns of DNA methylation
    Witte, Tania
    Plass, Christoph
    Gerhauser, Clarissa
    [J]. GENOME MEDICINE, 2014, 6
  • [6] Pan-cancer patterns of DNA methylation
    Tania Witte
    Christoph Plass
    Clarissa Gerhauser
    [J]. Genome Medicine, 6
  • [7] Pan-cancer methylome analysis for cancer diagnosis and classification of cancer cell of origin
    Dai Shimizu
    Kenzui Taniue
    Yusuke Matsui
    Hiroshi Haeno
    Hiromitsu Araki
    Fumihito Miura
    Mitsuko Fukunaga
    Kenji Shiraishi
    Yuji Miyamoto
    Seiichi Tsukamoto
    Aya Komine
    Yuta Kobayashi
    Akihiro Kitagawa
    Yukihiro Yoshikawa
    Kuniaki Sato
    Tomoko Saito
    Shuhei Ito
    Takaaki Masuda
    Atsushi Niida
    Makoto Suzuki
    Hideo Baba
    Takashi Ito
    Nobuyoshi Akimitsu
    Yasuhiro Kodera
    Koshi Mimori
    [J]. Cancer Gene Therapy, 2022, 29 : 428 - 436
  • [8] Research on Specific DNA Methylation Regulatory Mechanism Based on Pan-Cancer Analysis
    Guo, Y.
    Gu, J.
    Ge, D.
    [J]. JOURNAL OF THORACIC ONCOLOGY, 2023, 18 (11) : S193 - S193
  • [9] Pan-cancer methylome analysis for cancer diagnosis and classification of cancer cell of origin
    Shimizu, Dai
    Taniue, Kenzui
    Matsui, Yusuke
    Haeno, Hiroshi
    Araki, Hiromitsu
    Miura, Fumihito
    Fukunaga, Mitsuko
    Shiraishi, Kenji
    Miyamoto, Yuji
    Tsukamoto, Seiichi
    Komine, Aya
    Kobayashi, Yuta
    Kitagawa, Akihiro
    Yoshikawa, Yukihiro
    Sato, Kuniaki
    Saito, Tomoko
    Ito, Shuhei
    Masuda, Takaaki
    Niida, Atsushi
    Suzuki, Makoto
    Baba, Hideo
    Ito, Takashi
    Akimitsu, Nobuyoshi
    Kodera, Yasuhiro
    Mimori, Koshi
    [J]. CANCER GENE THERAPY, 2022, 29 (05) : 428 - 436
  • [10] Functional and embedding feature analysis for pan-cancer classification
    Lu, Jian
    Li, JiaRui
    Ren, Jingxin
    Ding, Shijian
    Zeng, Zhenbing
    Huang, Tao
    Cai, Yu-Dong
    [J]. FRONTIERS IN ONCOLOGY, 2022, 12