Scalable Multimodal Machine Learning for Cervical Cancer Detection

被引:0
|
作者
Thuan Nhan [2 ]
Upadhya, Jiblal [2 ]
Poudel, Samir [1 ]
Wagle, Satish [2 ]
Poudel, Khem [1 ]
机构
[1] Middle Tennessee State Univ, Dept Comp Sci, Murfreesboro, TN 37132 USA
[2] Middle Tennessee State Univ, Computat & Data Sci, Murfreesboro, TN 37132 USA
来源
2024 IEEE 5TH ANNUAL WORLD AI IOT CONGRESS, AIIOT 2024 | 2024年
关键词
Spark; Cervical Cancer; SparkML; SipakMed; Scalability;
D O I
10.1109/AIIoT61789.2024.10578984
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Cervical cancer poses a significant global health concern, and machine learning techniques have shown great promise in early detection and diagnosis. However, the dominant focus on accuracy has overshadowed the crucial aspect of training time. This research paper investigates multimodal high-performance machine learning for cervical cancer detection, with a primary emphasis on scalability and training time optimization using Apache Spark. We conduct experiments on two distinct datasets CSV and Images - analyzing the trade-offs between training time and accuracy. Moreover, we discuss the challenges encountered during the implementation of Spark. Our study helps to shed light on Spark's potential benefits and limitations in high-performance machine learning for cervical cancer detection. Our findings indicate that Spark ML showcases remarkable scalability potential compared to traditional platforms like SkLearn for text datasets. The majority of the ML algorithms explored showcased exceptional accuracy rates, with many surpassing the impressive threshold of 98 % in text data. In particular, we have shown that, Spark helps to reduce the time complexity more than three folds when the data size increases.
引用
收藏
页码:0502 / 0510
页数:9
相关论文
共 50 条
  • [41] Automated invasive cervical cancer disease detection at early stage through suitable machine learning model
    Sohely Jahan
    M. D. Saimun Islam
    Linta Islam
    Tamanna Yesmin Rashme
    Ayesha Aziz Prova
    Bikash Kumar Paul
    M. D. Manowarul Islam
    Mohammed Khaled Mosharof
    SN Applied Sciences, 2021, 3
  • [42] Multimodal survival prediction in advanced pancreatic cancer using machine learning
    Keyl, J.
    Kasper, S.
    Wiesweg, M.
    Goetze, J.
    Schoenrock, M.
    Sinn, M.
    Berger, A.
    Nasca, E.
    Kostbade, K.
    Schumacher, B.
    Markus, P.
    Albers, D.
    Treckmann, J.
    Schmid, K. W.
    Schildhaus, H-U
    Siveke, J. T.
    Schuler, M.
    Kleesiek, J.
    ESMO OPEN, 2022, 7 (05)
  • [43] Breast cancer detection by leveraging Machine Learning
    Vaka, Anji Reddy
    Soni, Badal
    Reddy, Sudheer K.
    ICT EXPRESS, 2020, 6 (04): : 320 - 324
  • [44] Breast Cancer Detection Using Machine Learning
    Sivasangari, A.
    Ajitha, P.
    Bevishjenila
    Vimali, J. S.
    Jose, Jithina
    Gowri, S.
    MOBILE COMPUTING AND SUSTAINABLE INFORMATICS, 2022, 68 : 693 - 702
  • [45] Machine Learning Techniques for Breast Cancer Detection
    Hall, Karl
    Chang, Victor
    Mitchell, Paul
    PROCEEDINGS OF THE 7TH INTERNATIONAL CONFERENCE ON COMPLEXITY, FUTURE INFORMATION SYSTEMS AND RISK (COMPLEXIS), 2022, : 116 - 122
  • [46] Breast Cancer Detection with Topological Machine Learning
    Yadav, Ankur
    Yadav, Nisha
    Coskunuzer, Baris
    2023 10TH INTERNATIONAL CONFERENCE ON BIOMEDICAL AND BIOINFORMATICS ENGINEERING, ICBBE 2023, 2023, : 217 - 222
  • [47] An Insight into Machine Learning Techniques for Cancer Detection
    Chhillar I.
    Singh A.
    Journal of The Institution of Engineers (India): Series B, 2023, 104 (04) : 963 - 985
  • [48] A scalable distributed machine learning approach for attack detection in edge computing environments
    Kozik, Rafal
    Choras, Michal
    Ficco, Massimo
    Palmieri, Francesco
    JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2018, 119 : 18 - 26
  • [49] Efficient and scalable covariate drift detection in machine learning systems with serverless computing
    Sisniega, Jaime Cespedes
    Rodriguez, Vicente
    Molto, German
    Garcia, alvaro Lopez
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2024, 161 : 174 - 188
  • [50] An explainable unsupervised learning framework for scalable machine fault detection in Industry 4.0
    Asutkar, Supriya
    Tallur, Siddharth
    MEASUREMENT SCIENCE AND TECHNOLOGY, 2023, 34 (10)