Pan-cancer classification by regularized multi-task learning

被引:6
|
作者
Hossain, Sk Md Mosaddek [1 ]
Khatun, Lutfunnesa [2 ]
Ray, Sumanta [1 ]
Mukhopadhyay, Anirban [2 ]
机构
[1] Aliah Univ, Comp Sci & Engn, Kolkata 700160, India
[2] Univ Kalyani, Comp Sci & Engn, Kalyani 741235, W Bengal, India
关键词
INFORMATION; PROGNOSIS; MODEL;
D O I
10.1038/s41598-021-03554-8
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Classifying pan-cancer samples using gene expression patterns is a crucial challenge for the accurate diagnosis and treatment of cancer patients. Machine learning algorithms have been considered proven tools to perform downstream analysis and capture the deviations in gene expression patterns across diversified diseases. In our present work, we have developed PC-RMTL, a pan-cancer classification model using regularized multi-task learning (RMTL) for classifying 21 cancer types and adjacent normal samples using RNASeq data obtained from TCGA. PC-RMTL is observed to outperform when compared with five state-of-the-art classification algorithms, viz. SVM with the linear kernel (SVM-Lin), SVM with radial basis function kernel (SVM-RBF), random forest (RF), k-nearest neighbours (kNN), and decision trees (DT). The PC-RMTL achieves 96.07% accuracy and 95.80% MCC score for a completely unknown independent test set. The only method that appears as the real competitor is SVM-Lin, which nearly equalizes the accuracy in prediction of PC-RMTL but only when complete feature sets are provided for training; otherwise, PC-RMTL outperformed all other classification models. To the best of our knowledge, this is a significant improvement over all the existing works in pan-cancer classification as they have failed to classify many cancer types from one another reliably. We have also compared gene expression patterns of the top discriminating genes across the cancers and performed their functional enrichment analysis that uncovers several interesting facts in distinguishing pan-cancer samples.
引用
收藏
页数:11
相关论文
共 50 条
  • [41] Multi-modal microblog classification via multi-task learning
    Zhao, Sicheng
    Yao, Hongxun
    Zhao, Sendong
    Jiang, Xuesong
    Jiang, Xiaolei
    MULTIMEDIA TOOLS AND APPLICATIONS, 2016, 75 (15) : 8921 - 8938
  • [42] Semi-supervised manifold regularized multi-task learning with privileged information
    Liu, Bo
    Li, Baoqing
    Xiao, Yanshan
    Wang, Zhitong
    Zhou, Boxu
    He, Shengxin
    Ye, Chenlong
    Cao, Fan
    INFORMATION SCIENCES, 2025, 711
  • [43] Multi-task Sparse Regression Metric Learning for Heterogeneous Classification
    Wu, Haotian
    Zhou, Bin
    Zhu, Pengfei
    Hu, Qinghua
    Shi, Hong
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2019: DEEP LEARNING, PT II, 2019, 11728 : 543 - 553
  • [44] Association Graph Learning for Multi-Task Classification with Category Shifts
    Shen, Jiayi
    Xiao, Zehao
    Zhen, Xiantong
    Snoek, Cees G. M.
    Worring, Marcel
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
  • [45] A Multi-Task Representation Learning Architecture for Enhanced Graph Classification
    Xie, Yu
    Gong, Maoguo
    Gao, Yuan
    Qin, A. K.
    Fan, Xiaolong
    FRONTIERS IN NEUROSCIENCE, 2020, 13
  • [46] Multi-task Learning of Negation and Speculation for Targeted Sentiment Classification
    Moore, Andrew
    Barnes, Jeremy
    2021 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL-HLT 2021), 2021, : 2838 - 2869
  • [47] MEDIC: a multi-task learning dataset for disaster image classification
    Alam, Firoj
    Alam, Tanvirul
    Hasan, Md Arid
    Hasnat, Abul
    Imran, Muhammad
    Ofli, Ferda
    NEURAL COMPUTING & APPLICATIONS, 2023, 35 (03): : 2609 - 2632
  • [48] A Multi-task Learning Approach for Weather Classification on Railway Transportation
    Wang, Shan
    Li, Yidong
    Feng, Songhe
    2018 INTERNATIONAL CONFERENCE ON INTELLIGENT RAIL TRANSPORTATION (ICIRT), 2018,
  • [49] Multi-task Learning with Bidirectional Language Models for Text Classification
    Yang, Qi
    Shang, Lin
    2019 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2019,
  • [50] Facial Attributes Classification using Multi-Task Representation Learning
    Ehrlich, Max
    Shields, Timothy J.
    Almaev, Timur
    Amer, Mohamed R.
    PROCEEDINGS OF 29TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, (CVPRW 2016), 2016, : 752 - 760