A Novel feature reduction method to improve the performance of Machine Learning model

被引:1
|
作者
Mirniaharikandehei, Seyedehnafiseh [1 ]
Heidari, Morteza [1 ]
Danala, Gopichandh [1 ]
Lakshmivarahan, Sivaramakrishnan [2 ]
Zheng, Bin [1 ]
机构
[1] Univ Oklahoma, Sch Elect & Comp Engn, Norman, OK 73019 USA
[2] Univ Oklahoma, Sch Comp Sci, Norman, OK 73019 USA
关键词
GASTRIC-CANCER; RADIOMICS; DIAGNOSIS;
D O I
10.1117/12.2580732
中图分类号
R318 [生物医学工程];
学科分类号
0831 ;
摘要
Developing radiomic based machine learning models has drawn considerable attention in recent years. However, identifying a small and optimal feature vector to build a robust machine learning models has always been a controversial issue. In this study, we investigated the feasibility of applying a random projection algorithm to create an optimal feature vector from the CAD-generated large feature pool and improve the performance of the machine learning model. We assemble a retrospective dataset involving abdominal computed tomography (CT) images acquired from 188 patients diagnosed with gastric cancer. Among them, 141 cases have peritoneal metastasis (PM), while 47 cases do not have PM. A computer-aided detection (CAD) scheme is applied to segment the gastric tumor area and computes 325 image features. Then, two Logistic Regression models embedded with two different feature dimensionality reduction methods, namely, the principal component analysis (PCA) and a random projection algorithm (RPA). Afterward, a synthetic minority oversampling technique (SMO1E) is used to balance the dataset. The proposed ML model is built to predict the risk of the patients having advanced gastric cancer (AGC). All Logistic Regression models are trained and tested using a leave-one-case-out cross-validation method. Results show that the logistic regression embedded with RPA yielded a significantly higher AUC value (0.69 +/- 0.025) than using PCA (0.62 +/- 0.014) (p<0.05). The study demonstrated that CT images of the gastric tumors contain discriminatory information to predict the risk of PM in AGC patients, and RPA is a promising method to generate optimal feature vector, improving the performance of ML models of medical images.
引用
收藏
页数:7
相关论文
共 50 条
  • [1] Automated Feature Reduction in Machine Learning
    Shilane, David
    [J]. 2022 IEEE 12TH ANNUAL COMPUTING AND COMMUNICATION WORKSHOP AND CONFERENCE (CCWC), 2022, : 45 - 49
  • [2] Heuristic Model to Improve Feature Selection Based on Machine Learning in Data Mining
    Majumdar, Jahin
    Mal, Anwesha
    Gupta, Shruti
    [J]. 2016 6TH INTERNATIONAL CONFERENCE - CLOUD SYSTEM AND BIG DATA ENGINEERING (CONFLUENCE), 2016, : 73 - 77
  • [3] Novel Feature Reduction (NFR) Model With Machine Learning and Data Mining Algorithms for Effective Disease Risk Prediction
    Pasha, Syed Javeed
    Mohamed, E. Syed
    [J]. IEEE ACCESS, 2020, 8 : 184087 - 184108
  • [4] A Novel Driver Performance Model Based on Machine Learning
    Aksjonov, Andrei
    Nedoma, Pavel
    Vodovozov, Valery
    Petlenkov, Eduard
    Herrmann, Martin
    [J]. IFAC PAPERSONLINE, 2018, 51 (09): : 267 - 272
  • [5] Employing feature engineering strategies to improve the performance of machine learning algorithms on echocardiogram dataset
    Huang, Huang-Nan
    Chen, Hong-Ming
    Lin, Wei-Wen
    Huang, Chau-Jian
    Chen, Yung-Cheng
    Wang, Yu-Huei
    Yang, Chao-Tung
    [J]. DIGITAL HEALTH, 2023, 9
  • [6] Voice Feature Selection to Improve Performance of Machine Learning Models for Voice Production Inversion
    Zhang, Zhaoyan
    [J]. JOURNAL OF VOICE, 2023, 37 (04) : 479 - 485
  • [7] Performance Evaluation of Feature Extraction and Dimensionality Reduction Techniques on Various machine learning classifiers
    Sarowar, Md. Golam
    Jamal, Arthy Anjum
    Saha, Anik
    Saha, Abir
    [J]. PROCEEDINGS OF THE 2019 IEEE 9TH INTERNATIONAL CONFERENCE ON ADVANCED COMPUTING (IACC 2019), 2019, : 19 - 24
  • [8] An effective model for observational learning to improve novel motor performance
    Kawasaki, Tsubasa
    Aramaki, Hidefumi
    Tozawa, Ryosuke
    [J]. JOURNAL OF PHYSICAL THERAPY SCIENCE, 2015, 27 (12) : 3829 - 3832
  • [9] DPDR: A Novel Machine Learning Method for the Decision Process for Dimensionality Reduction
    Dessureault J.-S.
    Massicotte D.
    [J]. SN Computer Science, 5 (1)
  • [10] A Machine Learning Model to Classify the Feature Model Maintainability
    Silva, Publio
    Bezerra, Carla I. M.
    Machado, Ivan
    [J]. SPLC '21: PROCEEDINGS OF THE 25TH ACM INTERNATIONAL SYSTEMS AND SOFTWARE PRODUCT LINE CONFERENCE, VOL A, 2021,