Feature Selection and Comparative Analysis of Breast Cancer Prediction Using Clinical Data and Histopathological Whole Slide Images

被引:0
|
作者
Mohammed, Sarfaraz Ahmed [1 ]
Abeysinghe, Senuka [2 ]
Ralescu, Anca [1 ]
机构
[1] Univ Cincinnati, Dept Comp Sci, Cincinnati, OH 45221 USA
[2] Indian Hill High Sch, Ohios Coll, Credit Plus Program, Cincinnati, OH 45243 USA
关键词
Breast cancer; Machine learning; Principal component analysis; Particle swarm optimization; Feature selection; Logistic regression; Na & iuml; ve bayes classification; k-NN; Support vector machines; Random forest; K-Means; Whole slide images; TCGA; Histopathology; Deep learning; Digital image analysis; Convolutional neural network; H&E-stained images; Nuclei segmentation;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Breast Carcinoma is a common cancer among women, with invasive ductal carcinoma and lobular carcinoma being the two most frequent types. Early detection is critical to prevent cancer from becoming malignant. Diagnostic tests include mammogram, ultrasound, MRI, or biopsy. Machine Learning algorithms can play a key role in analyzing complex clinical datasets to predict disease outcomes. This study uses machine learning and deep learning techniques to analyze publicly available clinical and medical image data. For clinical data, Principal Component Analysis (PCA) and Particle Swarm Optimization (PSO) are applied on the Wisconsin Breast Cancer dataset (WDBC) for feature selection and evaluate the performance of each modality in distinguishing between benign and malignant tumors. The results obtained show that the Random Forest (RF) classifier outperforms other classification algorithms using both PSO and PCA feature selections, achieving predictive accuracies of 95.7% and 97.2% respectively. The first part of the paper contains a comprehensive analysis of the two feature selection methods on clinical data to optimize predictive performance. The second part of the paper is concerned with image data. Although Histopathological Whole Slide Imaging (WSI) has been validated for a variety of pathological applications for over two decades of manual detection of cancerous tumors, it remains challenging and prone to human error. With the potential of deep learning models to aid pathologists in detecting cancer subtypes, and the increasing predictive ability of current image analysis techniques in identifying the underlying genomic data and cancer-causing mutations, the second half of the paper focusses on feature extraction using a deep convolutional neural network (U-Net) trained on WSI's from The Cancer Genome Atlas (TCGA) to accurately classify and extract relevant features. The focus is on feature extraction, nuclei-based instance segmentation, H&E-stained image extraction, and quantifying intensity information for a given WSI to classify the disease type. A comprehensive analysis of feature selection methods is presented for both clinical and medical image data.
引用
收藏
页码:1494 / 1525
页数:32
相关论文
共 50 条
  • [41] Region of interest (ROI) selection using vision transformer for automatic analysis using whole slide images
    Md Shakhawat Hossain
    Galib Muhammad Shahriar
    M. M. Mahbubul Syeed
    Mohammad Faisal Uddin
    Mahady Hasan
    Shingla Shivam
    Suresh Advani
    Scientific Reports, 13
  • [42] Survival outcome prediction of breast carcinomas on whole-slide histopathology images using deep learning
    Paul, Julian
    Bossard, Celine
    Rynkiewicz, Joseph
    Molinie, Florence
    Salhi, Sanae
    Frenel, Jean-Sebastien
    Salhi, Yahia
    Chetritt, Jerome
    JOURNAL OF CLINICAL ONCOLOGY, 2024, 42 (16)
  • [43] Detection and classification of cancer in whole slide breast histopathology images using deep convolutional networks
    Gecer, Bads
    Aksoy, Selim
    Mercan, Ezgi
    Shapiro, Linda G.
    Weaver, Donald L.
    Elmore, Joann G.
    PATTERN RECOGNITION, 2018, 84 : 345 - 356
  • [44] Automated Analysis of Histopathological Whole Slide Images to Diagnose Pediatric Heart Transplant Rejection
    Bhatia, A. K.
    Phan, J. H.
    Kothari, S.
    Shehata, B.
    Wang, M.
    JOURNAL OF HEART AND LUNG TRANSPLANTATION, 2015, 34 (04): : S327 - S327
  • [45] Detecting sebaceous carcinoma in whole-histopathological slide images using deep learning
    Funatsu, Naohiko
    Akiyama, Masato
    Tanabe, Mika
    Yoshikawa, Hiroshi
    Sonoda, Koh-Hei
    INVESTIGATIVE OPHTHALMOLOGY & VISUAL SCIENCE, 2023, 64 (08)
  • [46] Detection of Breast Cancer Through Clinical Data Using Supervised and Unsupervised Feature Selection Techniques
    Ul Haq, Amin
    Li, Jian Ping
    Saboor, Abdus
    Khan, Jalaluddin
    Wali, Samad
    Ahmad, Sultan
    Ali, Amjad
    Khan, Ghufran Ahmad
    Zhou, Wang
    IEEE ACCESS, 2021, 9 : 22090 - 22105
  • [47] A comparative study of cell nuclei attributed relational graphs for knowledge description and categorization in histopathological gastric cancer whole slide images
    Sharma, Harshita
    Zerbe, Norman
    Boeger, Christine
    Wienert, Stephan
    Hellwich, Olaf
    Hufnagl, Peter
    2017 IEEE 30TH INTERNATIONAL SYMPOSIUM ON COMPUTER-BASED MEDICAL SYSTEMS (CBMS), 2017, : 61 - 66
  • [48] Survival prediction on intrahepatic cholangiocarcinoma with histomorphological analysis on the whole slide images
    Xie, Jiawei
    Pu, Xiaohong
    He, Jian
    Qiu, Yudong
    Lu, Cheng
    Gao, Wei
    Wang, Xiangxue
    Lu, Haoda
    Shi, Jiong
    Xu, Yuemei
    Madabhushi, Anant
    Fan, Xiangshan
    Chen, Jun
    Xu, Jun
    COMPUTERS IN BIOLOGY AND MEDICINE, 2022, 146
  • [49] Generating Region of Interests for Invasive Breast Cancer in Histopathological Whole-Slide-Image
    Patil, Shreyas Malakarjun
    Tong, Li
    Wang, May D.
    2020 IEEE 44TH ANNUAL COMPUTERS, SOFTWARE, AND APPLICATIONS CONFERENCE (COMPSAC 2020), 2020, : 723 - 728
  • [50] Visual assessment of mitotic figures in breast cancer: a comparative study between light microscopy and whole slide images
    Lashen, Ayat
    Ibrahim, Asmaa
    Katayama, Ayaka
    Ball, Graham
    Mihai, Raluca
    Toss, Michael
    Rakha, Emad
    HISTOPATHOLOGY, 2021, 79 (06) : 913 - 925