RootInteractive tool for multidimensional statistical analysis, machine learning and analytical model validation

被引:0
|
作者
Ivanov, Marian [1 ,2 ]
Ivanov, Marian [1 ,2 ]
Eulisse, Giulio [3 ]
机构
[1] GSI Darmstadt, Darmstadt, Germany
[2] UK Bratislava, Bratislava, Slovakia
[3] CERN, Meyrin, Switzerland
关键词
D O I
10.1051/epjconf/202429506019
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
The ALICE experiment [1] at CERN's LHC is specifically designed for investigating heavy ion collisions. The upgraded ALICE accommodates a tenfold increase in Pb-Pb luminosity and a two-order-of-magnitude surge in minimum bias events. To address the challenges of high detector occupancy and event pile-ups, advanced multidimensional data analysis techniques, including machine learning (ML), are indispensable. Despite ML's popularity, the complexity of its models presents interpretation challenges, and oversimplification in analysis often leads to inaccuracies. Our objective was to develop RootInteractive, a tool for multidimensional statistical analysis. This tool simplifies data analysis across dimensions, visualizes functions with uncertainties, and validates assumptions and approximations. In RootInteractive, it is crucial to easily define the functional composition of analytical parametric and non-parametric functions, exploit symmetries, and define multidimensional "invariant" functions and corresponding alarms. RootInteractive [2] adopts a declarative programming paradigm, ensuring user-friendliness for experts, students, and educators. It facilitates interactive visualization, n-dimensional histogramming/projection, and information extraction on both Python/C++ server and Javascript client. The tool supports client/server applications in Jupyter or standalone client-side applications. Through data compression, datasets with O(10(7)) entries and O(25) attributes can be interactively analyzed in a browser with O(0.500-1 GB) size. Representative downsampling and reweighting/pre-aggregation enable the effective analysis of one year of ALICE data for various purposes.
引用
收藏
页数:9
相关论文
共 50 条
  • [1] Machine learning and statistical analysis for BRDF data from computer graphics and multidimensional reflectometry
    Physikalisch-Technische Bundesanstalt, Berlin
    10587, Germany
    IAENG Int. J. Comput. Sci., 1 (22-30):
  • [2] MACHINE LEARNING AS A DIAGNOSTIC TOOL FOR VALIDATION OF SENSITIVITY
    Katsipis, I
    VALUE IN HEALTH, 2017, 20 (05) : A257 - A257
  • [3] Statistical Machine Learning Model for Uncertainty Analysis of Photovoltaic Power
    Fu, Xueqian
    Zhang, Chunyu
    Wu, Xianping
    FRONTIERS IN ENERGY RESEARCH, 2022, 10
  • [4] JS']JSAT: Java']Java Statistical Analysis Tool, a Library for Machine Learning
    Raff, Edward
    JOURNAL OF MACHINE LEARNING RESEARCH, 2017, 18
  • [5] Development and validation of an explainable machine learning model for predicting multidimensional frailty in hospitalized patients with cirrhosis
    Yang, Fang
    Li, Chaoqun
    Yang, Wanting
    He, Yumei
    Wu, Liping
    Jiang, Kui
    Sun, Chao
    BRIEFINGS IN BIOINFORMATICS, 2024, 25 (06)
  • [6] Validation of depression determinants in caregivers of dementia patients with machine learning algorithms and statistical model
    Cho, Kangrim
    Choi, Junggu
    Han, Sanghoon
    FRONTIERS IN MEDICINE, 2023, 10
  • [7] Suggestion of statistical validation on feature importance of machine learning
    Lee, Youngro
    Seo, Jongmo
    2023 45TH ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE & BIOLOGY SOCIETY, EMBC, 2023,
  • [8] Automated machine learning tool: The first stop for data science and statistical model building
    Gopagoni D.
    Lakshmi P.V.
    International Journal of Advanced Computer Science and Applications, 2020, (02): : 410 - 418
  • [9] Automated Machine Learning Tool: The First Stop for Data Science and Statistical Model Building
    Gopagoni, DeepaRani
    Lakshmi, P., V
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2020, 11 (02) : 410 - 418
  • [10] Investigating Statistical Machine Learning as a Tool for Software Development
    Patel, Kayur
    Fogarty, James
    Landay, James A.
    Harrison, Beverly
    CHI 2008: 26TH ANNUAL CHI CONFERENCE ON HUMAN FACTORS IN COMPUTING SYSTEMS VOLS 1 AND 2, CONFERENCE PROCEEDINGS, 2008, : 667 - 676