A comparison of rule-based and centroid single-sample multiclass predictors for transcriptomic classification

被引:19
|
作者
Eriksson, Pontus [1 ]
Marzouka, Nour-al-dain [1 ]
Sjodahl, Gottfrid [2 ]
Bernardo, Carina [1 ]
Liedberg, Fredrik [2 ]
Hoglund, Mattias [1 ]
机构
[1] Lund Univ, Dept Clin Sci, Div Oncol, Lund, Sweden
[2] Lund Univ, Wine Univ Hosp, Dept Translat Med, Urol Urothelial Canc, Malmo, Sweden
关键词
GENE-EXPRESSION; RNA-SEQ; MICROARRAYS; PACKAGE;
D O I
10.1093/bioinformatics/btab763
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: Gene expression-based multiclass prediction, such as tumor subtyping, is a non-trivial bioinformatic problem. Most classifier methods operate by comparing expression levels relative to other samples. Methods that base predictions on the expression pattern within a sample have been proposed as an alternative. As these methods are invariant to the cohort composition and can be applied to a sample in isolation, they can collectively be termed single sample predictors (SSP). Such predictors could potentially be used for preprocessing-free classification of new samples and be built to function across different expression platforms where proper batch and dataset normalization is challenging. Here, we evaluate the behavior of several multiclass SSPs based on binary gene-pair rules (k-Top Scoring Pairs, Absolute Intrinsic Molecular Subtyping and a new Random Forest approach) and compare them to centroids built with centered or raw expression values, with the criteria that an optimal predictor should have high accuracy, overcome differences in tumor purity, be robust across expression platforms and provide an informative prediction output score. Results: We found that gene-pair-based SSPs showed excellent performance on many expression-based classification tasks. The three methods differed in prediction score output, handling of tied scores and behavior in low purity samples. The k-Top Scoring Pairs and Random Forest approach both achieved high classification accuracy while providing an informative prediction score. Although gene-pair-based SSPs have been touted as being crossplatform compatible (through training on mixed platform data), out-of-the-box compatibility with a new dataset remains a potential issue that warrants cohort-to-cohort verification.
引用
收藏
页码:1022 / 1029
页数:8
相关论文
共 50 条
  • [1] A Rule-based Filter Network for Multiclass Data Classification
    Tusor, Balazs
    Varkonyi-Koczy, Annamaria R.
    [J]. 2015 IEEE INTERNATIONAL INSTRUMENTATION AND MEASUREMENT TECHNOLOGY CONFERENCE (I2MTC), 2015, : 1102 - 1107
  • [2] Rule-based adversarial sample generation for text classification
    Zhou, Nai
    Yao, Nianmin
    Zhao, Jian
    Zhang, Yanan
    [J]. NEURAL COMPUTING & APPLICATIONS, 2022, 34 (13): : 10575 - 10586
  • [3] Rule-based adversarial sample generation for text classification
    Nai Zhou
    Nianmin Yao
    Jian Zhao
    Yanan Zhang
    [J]. Neural Computing and Applications, 2022, 34 : 10575 - 10586
  • [4] A comparison of classification strategies in rule-based classifiers
    Wojciechowski, Szymon
    [J]. LOGIC JOURNAL OF THE IGPL, 2018, 26 (01) : 29 - 46
  • [5] Rule-based Similarity for Classification
    Janusz, Andrzej
    [J]. 2009 IEEE/WIC/ACM INTERNATIONAL JOINT CONFERENCES ON WEB INTELLIGENCE (WI) AND INTELLIGENT AGENT TECHNOLOGIES (IAT), VOL 3, 2009, : 449 - 452
  • [6] A Micro-Extended Belief Rule-Based System for Big Data Multiclass Classification Problems
    Yang, Long-Hao
    Liu, Jun
    Wang, Ying-Ming
    Martinez, Luis
    [J]. IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2021, 51 (01): : 420 - 440
  • [7] A Design of Fuzzy Rule-Based Classifier for Multiclass Classification and Its Realization in Horizontal Federated Learning
    Hu, Xingchen
    Zhu, Xiubin
    Yang, Lan
    Pedrycz, Witold
    Li, Zhiwu
    [J]. IEEE TRANSACTIONS ON FUZZY SYSTEMS, 2024, 32 (09) : 5098 - 5108
  • [8] Hierarchical extended collaborative representation based classification for single-sample face recognition
    Yuan, Yue-Lai
    Chen, Di-Hu
    Hu, Hai-Feng
    Du, Ling-Shuang
    [J]. IET COMPUTER VISION, 2019, 13 (07) : 651 - 658
  • [9] An evaluation of single-sample tumor subtype classification methods
    Eriksson, P.
    Marzouka, N. A. D.
    Sjodahl, G.
    Bernardo, C.
    Liedberg, F.
    Hoglund, M.
    [J]. UROLOGIC ONCOLOGY-SEMINARS AND ORIGINAL INVESTIGATIONS, 2020, 38 (12) : 906 - 907
  • [10] ANALOGICAL VERSUS RULE-BASED CLASSIFICATION
    WATTENMAKER, WD
    MCQUAID, HL
    SCHWERTZ, SJ
    [J]. MEMORY & COGNITION, 1995, 23 (04) : 495 - 509