Machine Learning Made Easy (MLme): a comprehensive toolkit for machine learning-driven data analysis

被引:0
|
作者
Akshay, Akshay [1 ,2 ]
Katoch, Mitali [3 ]
Shekarchizadeh, Navid [4 ,5 ]
Abedi, Masoud [4 ]
Sharma, Ankush [6 ,7 ]
Burkhard, Fiona C. [1 ,8 ]
Adam, Rosalyn M. [9 ]
Monastyrskaya, Katia [1 ,8 ]
Gheinani, Ali Hashemi [1 ,8 ,9 ,10 ,11 ]
机构
[1] Univ Bern, Dept Biomed Res DBMR, Funct Urol Res Grp, CH-3008 Bern, Switzerland
[2] Univ Bern, Grad Sch Cellular & Biomed Sci, CH-3012 Bern, Switzerland
[3] Friedrich Alexander Univ Erlangen Nurnberg FAU, Inst Neuropathol, Univ Klinikum Erlangen, Erlangen, Germany
[4] Univ Leipzig, Dept Med Data Sci, Med Ctr, D-04107 Leipzig, Germany
[5] Ctr Scalable Data Analyt & Artificial Intelligence, D-04105 Leipzig, Germany
[6] Univ Oslo, Inst Clin Med, KG Jebsen Ctr Cell Malignancies B, N-0318 Oslo, Norway
[7] Oslo Univ Hosp, Inst Canc Res, Dept Canc Immunol, N-0310 Oslo, Norway
[8] Univ Hosp, Dept Urol, Inselspital, Bern, Switzerland
[9] Boston Childrens Hosp, Urol Dis Res Ctr, Boston, MA 02115 USA
[10] Harvard Med Sch, Dept Surg, Boston, MA 02115 USA
[11] Broad Inst MIT & Harvard, Cambridge, MA 02142 USA
来源
GIGASCIENCE | 2024年 / 13卷
基金
瑞士国家科学基金会;
关键词
machine learning; classification problems; data analysis; AutoML; visualization;
D O I
10.1093/gigascience/giad111
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Background Machine learning (ML) has emerged as a vital asset for researchers to analyze and extract valuable information from complex datasets. However, developing an effective and robust ML pipeline can present a real challenge, demanding considerable time and effort, thereby impeding research progress. Existing tools in this landscape require a profound understanding of ML principles and programming skills. Furthermore, users are required to engage in the comprehensive configuration of their ML pipeline to obtain optimal performance.Results To address these challenges, we have developed a novel tool called Machine Learning Made Easy (MLme) that streamlines the use of ML in research, specifically focusing on classification problems at present. By integrating 4 essential functionalities-namely, Data Exploration, AutoML, CustomML, and Visualization-MLme fulfills the diverse requirements of researchers while eliminating the need for extensive coding efforts. To demonstrate the applicability of MLme, we conducted rigorous testing on 6 distinct datasets, each presenting unique characteristics and challenges. Our results consistently showed promising performance across different datasets, reaffirming the versatility and effectiveness of the tool. Additionally, by utilizing MLme's feature selection functionality, we successfully identified significant markers for CD8+ naive (BACH2), CD16+ (CD16), and CD14+ (VCAN) cell populations.Conclusion MLme serves as a valuable resource for leveraging ML to facilitate insightful data analysis and enhance research outcomes, while alleviating concerns related to complex coding scripts. The source code and a detailed tutorial for MLme are available at https://github.com/FunctionalUrology/MLme.
引用
收藏
页数:9
相关论文
共 50 条
  • [21] Machine Learning-Driven Algorithms for Network Anomaly Detection
    Islam, Md Sirajul
    Rouf, Mohammad Abdur
    Parvez, A. H. M. Shahariar
    Podder, Prajoy
    [J]. INVENTIVE COMPUTATION AND INFORMATION TECHNOLOGIES, ICICIT 2021, 2022, 336 : 493 - 507
  • [22] Machine learning-driven critical care decision making
    Coates, James T.
    de Koning, Christiaan
    [J]. JOURNAL OF THE ROYAL SOCIETY OF MEDICINE, 2022, 115 (06) : 236 - 238
  • [23] Machine Learning-Driven Digital Technologies for Educational Innovation
    Campos, Esmeralda
    Reveles, Sofia
    [J]. Future of Educational Innovation Workshop Series - Machine Learning-Driven Digital Technologies for Educational Innovation Workshop 2021, 2021,
  • [24] Machine learning-driven optimization of enterprise resource planning (ERP) systems: a comprehensive review
    Jawad, Zainab Nadhim
    Balazs, Villanyi
    [J]. BENI-SUEF UNIVERSITY JOURNAL OF BASIC AND APPLIED SCIENCES, 2024, 13 (01)
  • [25] Machine learning-driven optimization of enterprise resource planning (ERP) systems: a comprehensive review
    Zainab Nadhim Jawad
    Villányi Balázs
    [J]. Beni-Suef University Journal of Basic and Applied Sciences, 13
  • [26] Special issue on machine learning-driven big data and blockchain techniques for communication
    Mumtaz, Shahid
    Srivastava, Gautam
    Wei, Wei
    [J]. DIGITAL COMMUNICATIONS AND NETWORKS, 2023, 9 (01) : 1 - 2
  • [27] Machine Learning-Driven Event Characterization under Scarce Vehicular Sensing Data
    Taherifard, Nima
    Simsek, Murat
    Lascelles, Charles
    Kantarci, Burak
    [J]. 2020 IEEE 25TH INTERNATIONAL WORKSHOP ON COMPUTER AIDED MODELING AND DESIGN OF COMMUNICATION LINKS AND NETWORKS (CAMAD), 2020,
  • [28] Study on Using Machine Learning-Driven Classification for Analysis of the Disparities between Categorized Learning Outcomes
    Kowalska, Aleksandra
    Banasiak, Robert
    Stando, Jacek
    Wrobel-Lachowska, Magdalena
    Kozlowska, Adrianna
    Romanowski, Andrzej
    [J]. ELECTRONICS, 2022, 11 (22)
  • [29] Machine learning made easy for optimizing chemical reactions
    Jason E. Hein
    [J]. Nature, 2021, 590 : 40 - 41
  • [30] Machine learning made easy for optimizing chemical reactions
    Hein, Jason E.
    [J]. NATURE, 2021, 590 (7844) : 40 - 41