Improving the accuracy of multiclass classification in machine learning: A case study in a cell signaling dataset

被引:3
|
作者
Pablo Gonzalez-Perez, Pedro [1 ]
Eduardo Sanchez-Gutierrez, Maximo [2 ]
机构
[1] Univ Autonoma Metropolitana Cuajimalpa, Dept Matemat Aplicadas & Sistemas, Ciudad De Mexico, Mexico
[2] Univ Autonoma Ciudad Mexico, Colegio Ciencia & Tecnol, Ciudad De Mexico, Mexico
关键词
Multiclass classification; machine learning; exploratory data analysis; dimensionality reduction; cellular signaling data; FEATURE-SELECTION; DIAGNOSIS;
D O I
10.3233/IDA-215826
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
It is important to make sense of the data within its context to propose a useful model to solve a problem. This domain knowledge includes information not contained in the data, but that will help us understand the data to be fed into a machine-learning algorithm and guide us on what features might help our model. Nevertheless, domain knowledge may become insufficient as the input variables increase, forcing the need to try automated feature selection techniques. In this study, we investigate whether the joint use of 1) feature selection techniques, such as Chi-square, Tree-based Feature Selection, Pearson's Correlation, LASSO, Low Variance, and Recursive Feature Elimination, 2) outlier detection methods such as Isolation-Forest, and 3) Cross-Validation techniques lead to improving the accuracy in multiclass classification in machine learning. Specifically, we address the classification of patterns representing the activation state of cell signaling components into classes that symbolize the different cellular processes triggered in cancer cells. The results presented in this work have shown an accuracy increase with up to 80% fewer input features by only using 3 out of the 16 original descriptors.
引用
收藏
页码:481 / 500
页数:20
相关论文
共 50 条
  • [41] Machine Learning Assisted Methodology for Multiclass Classification of Malignant Brain Tumors
    Vidyarthi, Ankit
    Agarwal, Ruchi
    Gupta, Deepak
    Sharma, Rahul
    Draheim, Dirk
    Tiwari, Prayag
    IEEE ACCESS, 2022, 10 : 50624 - 50640
  • [42] Machine Learning Assisted Methodology for Multiclass Classification of Malignant Brain Tumors
    Vidyarthi, Ankit
    Agarwal, Ruchi
    Gupta, Deepak
    Sharma, Rahul
    Draheim, Dirk
    Tiwari, Prayag
    IEEE Access, 2022, 10 : 50624 - 50640
  • [43] A Machine Learning Based Ensemble Method for Automatic Multiclass Classification of Decisions
    Fu, Liming
    Liang, Peng
    Li, Xueying
    Yang, Chen
    PROCEEDINGS OF EVALUATION AND ASSESSMENT IN SOFTWARE ENGINEERING (EASE 2021), 2021, : 40 - 49
  • [44] Multiclass Geospatial Object Detection using Machine Learning-Aviation Case Study
    Dhulipudi, Durga Prasad
    Rajan, K. S.
    2020 AIAA/IEEE 39TH DIGITAL AVIONICS SYSTEMS CONFERENCE (DASC) PROCEEDINGS, 2020,
  • [45] Improving platelet-RNA-based diagnostics: a comparative analysis of machine learning models for cancer detection and multiclass classification
    Jopek, Maksym A.
    Pastuszak, Krzysztof
    Sieczczynski, Michal
    Cygert, Sebastian
    Zaczek, Anna J.
    Rondina, Matthew T.
    Supernat, Anna
    MOLECULAR ONCOLOGY, 2024, 18 (11) : 2743 - 2754
  • [46] Improving Multiclass Classification of Cybersecurity Breaches in Railway Infrastructure using Imbalanced Learning
    Nebaba, Aleksandr N.
    Savvas, Ilias K.
    Butakova, Maria A.
    Chernov, Andrey V.
    Shevchuk, Petr S.
    ESSE 2021: THE 2ND EUROPEAN SYMPOSIUM ON SOFTWARE ENGINEERING, 2021, : 100 - 105
  • [47] Classification of Intrusion Detection Dataset using machine learning Approaches
    Subramanyam, Doodipalli
    PROCEEDINGS OF THE 2018 INTERNATIONAL CONFERENCE ON COMPUTATIONAL TECHNIQUES, ELECTRONICS AND MECHANICAL SYSTEMS (CTEMS), 2018, : 280 - 283
  • [48] A Study on Multiple Factors Affecting the Accuracy of Multiclass Skin Disease Classification
    Fan, Jiayi
    Kim, Jongwook
    Jung, Insu
    Lee, Yongkeun
    APPLIED SCIENCES-BASEL, 2021, 11 (17):
  • [49] Probability based voting extreme learning machine for multiclass XML documents classification
    Zhao, Xiangguo
    Bi, Xin
    Qiao, Baiyou
    WORLD WIDE WEB-INTERNET AND WEB INFORMATION SYSTEMS, 2014, 17 (05): : 1217 - 1231
  • [50] Improving accuracy of automatic optical inspection with machine learning
    Xinyu Tong
    Ziao Yu
    Xiaohua Tian
    Houdong Ge
    Xinbing Wang
    Frontiers of Computer Science, 2022, 16