A practical utility-based but objective approach to model selection for regression in scientific applications

被引:0
|
作者
Andrea Murari
Riccardo Rossi
Luca Spolladore
Michele Lungaroni
Pasquale Gaudio
Michela Gelfusa
机构
[1] Consorzio RFX (CNR,Department of Industrial Engineering
[2] ENEA,undefined
[3] INFN,undefined
[4] Università di Padova,undefined
[5] Acciaierie Venete SpA),undefined
[6] Istituto per la Scienza e la Tecnologia Dei Plasmi,undefined
[7] CNR,undefined
[8] University of Rome “Tor Vergata”,undefined
来源
关键词
Model selection criteria; Bayesian Information Criterion (BIC); Akaike Information Criterion (AIC); Shannon entropy; Goodness of fit tests; Mutual information; Feedback loops;
D O I
暂无
中图分类号
学科分类号
摘要
In many fields of science, various types of models are available to describe phenomena, observations and the results of experiments. In the last decades, given the enormous advances of information gathering technologies, also machine learning techniques have been systematically deployed to extract models from the large available databases. However, regardless of their origins, no universal criterion has been found so far to select the most appropriate model given the data. A unique solution is probably a chimera, particularly in applications involving complex systems. Consequently, in this work a utility-based approach is advocated. However, the solutions proposed are not purely subjective but all based on “objective” criteria, rooted in the properties of the data, to preserve generality and to allow comparative assessments of the results. Several methods have been developed and tested, to improve the discrimination capability of basic Bayesian and information theoretic criteria, with particular attention to the BIC (Bayesian Information Criterion) and AIC (Akaike Information Criterion) indicators. Both the quality of the fits and the evaluation of model complexity are aspects addressed by the advances proposed. The competitive advantages of the individual alternatives, for both cross sectional data and time series, are clearly identified, together with their most appropriate fields of application. The proposed improvements of the criteria allow selecting the right models more reliably, more efficiently in terms of data requirements and can be adjusted to very different circumstances and applications. Particular attention has been paid to ensure that the developed versions of the indicators are easy to implement in practice, in both confirmatory and exploratory settings. Extensive numerical tests have been performed to support the conceptual and theoretical considerations.
引用
收藏
页码:2825 / 2859
页数:34
相关论文
共 50 条
  • [1] A practical utility-based but objective approach to model selection for regression in scientific applications
    Murari, Andrea
    Rossi, Riccardo
    Spolladore, Luca
    Lungaroni, Michele
    Gaudio, Pasquale
    Gelfusa, Michela
    [J]. ARTIFICIAL INTELLIGENCE REVIEW, 2023, 56 (Suppl 2) : 2825 - 2859
  • [2] Utility-based regression
    Torgo, Luis
    Ribeiro, Rita
    [J]. KNOWLEDGE DISCOVERY IN DATABASES: PKDD 2007, PROCEEDINGS, 2007, 4702 : 597 - +
  • [3] A utility-based adaptivity model for mobile applications
    Alia, Mourad
    Eide, Viktor S. Wold
    Paspallis, Nearchos
    Eliassen, Frank
    Hallsteinsen, Svein O.
    Papadopoulos, George A.
    [J]. 21ST INTERNATIONAL CONFERENCE ON ADVANCED NETWORKING AND APPLICATIONS WORKSHOPS/SYMPOSIA, VOL 2, PROCEEDINGS, 2007, : 556 - +
  • [4] A utility-based approach for customised cloud service selection
    Jrad, Foued
    Tao, Jie
    Streit, Achim
    Knapper, Rico
    Flath, Christoph
    [J]. INTERNATIONAL JOURNAL OF COMPUTATIONAL SCIENCE AND ENGINEERING, 2015, 10 (1-2) : 32 - 44
  • [6] A gid utility-based model for interior renovations selection
    Chiang, Ting-Yi
    Chu, Chien-Chien
    Shen, Hsu-Ming
    Chiu, Yu-Fa
    [J]. JOURNAL OF ASIAN ARCHITECTURE AND BUILDING ENGINEERING, 2021, 20 (03) : 249 - 259
  • [7] Utility-based sensor selection
    Bian, Fang
    Kempe, David
    Govindan, Ramesh
    [J]. IPSN 2006: THE FIFTH INTERNATIONAL CONFERENCE ON INFORMATION PROCESSING IN SENSOR NETWORKS, 2006, : 11 - 18
  • [8] A Utility-based QoS Model for Emerging Multimedia Applications
    Mu, Mu
    Mauthe, Andreas
    Garcia, Francisco
    [J]. NGMAST 2008: SECOND INTERNATIONAL CONFERENCE ON NEXT GENERATION MOBILE APPLICATIONS, SERVICES, AND TECHNOLOGIES, PROCEEDINGS, 2008, : 521 - +
  • [9] Design of an Adaptive Framework for Utility-based Optimization of Scientific Applications in the Cloud
    Koehler, Martin
    Benkner, Siegfried
    [J]. 2012 IEEE/ACM FIFTH INTERNATIONAL CONFERENCE ON UTILITY AND CLOUD COMPUTING (UCC 2012), 2012, : 303 - 308
  • [10] UTILITY-BASED STATISTICAL SELECTION PROCEDURES
    Sun, Guowei
    Li, Yunchuan
    Fu, Michael C.
    [J]. 2019 WINTER SIMULATION CONFERENCE (WSC), 2019, : 3416 - 3427