Machine Learning Made Easy (MLme): a comprehensive toolkit for machine learning-driven data analysis

被引:0
|
作者
Akshay, Akshay [1 ,2 ]
Katoch, Mitali [3 ]
Shekarchizadeh, Navid [4 ,5 ]
Abedi, Masoud [4 ]
Sharma, Ankush [6 ,7 ]
Burkhard, Fiona C. [1 ,8 ]
Adam, Rosalyn M. [9 ]
Monastyrskaya, Katia [1 ,8 ]
Gheinani, Ali Hashemi [1 ,8 ,9 ,10 ,11 ]
机构
[1] Univ Bern, Dept Biomed Res DBMR, Funct Urol Res Grp, CH-3008 Bern, Switzerland
[2] Univ Bern, Grad Sch Cellular & Biomed Sci, CH-3012 Bern, Switzerland
[3] Friedrich Alexander Univ Erlangen Nurnberg FAU, Inst Neuropathol, Univ Klinikum Erlangen, Erlangen, Germany
[4] Univ Leipzig, Dept Med Data Sci, Med Ctr, D-04107 Leipzig, Germany
[5] Ctr Scalable Data Analyt & Artificial Intelligence, D-04105 Leipzig, Germany
[6] Univ Oslo, Inst Clin Med, KG Jebsen Ctr Cell Malignancies B, N-0318 Oslo, Norway
[7] Oslo Univ Hosp, Inst Canc Res, Dept Canc Immunol, N-0310 Oslo, Norway
[8] Univ Hosp, Dept Urol, Inselspital, Bern, Switzerland
[9] Boston Childrens Hosp, Urol Dis Res Ctr, Boston, MA 02115 USA
[10] Harvard Med Sch, Dept Surg, Boston, MA 02115 USA
[11] Broad Inst MIT & Harvard, Cambridge, MA 02142 USA
来源
GIGASCIENCE | 2024年 / 13卷
基金
瑞士国家科学基金会;
关键词
machine learning; classification problems; data analysis; AutoML; visualization;
D O I
10.1093/gigascience/giad111
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Background Machine learning (ML) has emerged as a vital asset for researchers to analyze and extract valuable information from complex datasets. However, developing an effective and robust ML pipeline can present a real challenge, demanding considerable time and effort, thereby impeding research progress. Existing tools in this landscape require a profound understanding of ML principles and programming skills. Furthermore, users are required to engage in the comprehensive configuration of their ML pipeline to obtain optimal performance.Results To address these challenges, we have developed a novel tool called Machine Learning Made Easy (MLme) that streamlines the use of ML in research, specifically focusing on classification problems at present. By integrating 4 essential functionalities-namely, Data Exploration, AutoML, CustomML, and Visualization-MLme fulfills the diverse requirements of researchers while eliminating the need for extensive coding efforts. To demonstrate the applicability of MLme, we conducted rigorous testing on 6 distinct datasets, each presenting unique characteristics and challenges. Our results consistently showed promising performance across different datasets, reaffirming the versatility and effectiveness of the tool. Additionally, by utilizing MLme's feature selection functionality, we successfully identified significant markers for CD8+ naive (BACH2), CD16+ (CD16), and CD14+ (VCAN) cell populations.Conclusion MLme serves as a valuable resource for leveraging ML to facilitate insightful data analysis and enhance research outcomes, while alleviating concerns related to complex coding scripts. The source code and a detailed tutorial for MLme are available at https://github.com/FunctionalUrology/MLme.
引用
收藏
页数:9
相关论文
共 50 条
  • [1] Machine Learning-Driven Language Assessment
    Settles, Burr
    LaFlair, Geoffrey T.
    Hagiwara, Masato
    [J]. TRANSACTIONS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, 2020, 8 : 247 - 263
  • [2] Machine Learning-Driven Biomaterials Evolution
    Suwardi, Ady
    Wang, FuKe
    Xue, Kun
    Han, Ming-Yong
    Teo, Peili
    Wang, Pei
    Wang, Shijie
    Liu, Ye
    Ye, Enyi
    Li, Zibiao
    Loh, Xian Jun
    [J]. ADVANCED MATERIALS, 2022, 34 (01)
  • [3] A machine learning-driven web application for sign language learning
    Orovwode, Hope
    Ibukun, Oduntan
    Abubakar, John Amanesi
    [J]. FRONTIERS IN ARTIFICIAL INTELLIGENCE, 2024, 7
  • [4] Extending OpenMP for Machine Learning-Driven Adaptation
    Liao, Chunhua
    Wang, Anjia
    Georgakoudis, Giorgis
    de Supinski, Bronis R.
    Yan, Yonghong
    Beckingsale, David
    Gamblin, Todd
    [J]. ACCELERATOR PROGRAMMING USING DIRECTIVES, WACCPD 2021, 2022, 13194 : 49 - 69
  • [5] Machine learning-driven new material discovery
    Cai, Jiazhen
    Chu, Xuan
    Xu, Kun
    Li, Hongbo
    Wei, Jing
    [J]. NANOSCALE ADVANCES, 2020, 2 (08): : 3115 - 3130
  • [6] Machine Learning-Driven SERS Nanoendoscopy and Optophysiology
    Chisanga, Malama
    Masson, Jean-Francois
    [J]. ANNUAL REVIEW OF ANALYTICAL CHEMISTRY, 2024, 17 : 313 - 338
  • [7] A Layered Quality Framework for Machine Learning-driven Data and Information Models
    Azimi, Shelernaz
    Pahl, Claus
    [J]. PROCEEDINGS OF THE 22ND INTERNATIONAL CONFERENCE ON ENTERPRISE INFORMATION SYSTEMS (ICEIS), VOL 1, 2020, : 579 - 587
  • [8] Recent machine learning-driven investigations into high entropy alloys: A comprehensive review
    Yan, Yonggang
    Hu, Xunxiang
    Liao, Yalin
    Zhou, Yanyao
    He, Wenhao
    Zhou, Ting
    [J]. Journal of Alloys and Compounds, 2025, 1010
  • [9] Machine Learning-Driven RAM Analysis Using Multi-variate Sensor Data
    Gugaratshan, Guga
    Barthlow, Dakota
    Lingenfelser, Dan
    Thumati, Balaje
    [J]. 2023 ANNUAL RELIABILITY AND MAINTAINABILITY SYMPOSIUM, RAMS, 2023,
  • [10] Machine Learning-Driven Virtual Bidding With Electricity Market Efficiency Analysis
    Li, Yinglun
    Yu, Nanpeng
    Wang, Wei
    [J]. IEEE TRANSACTIONS ON POWER SYSTEMS, 2022, 37 (01) : 354 - 364