The quest for the reliability of machine learning models in binary classification on tabular data

被引:0
|
作者
Vitor Cirilo Araujo Santos
Lucas Cardoso
Ronnie Alves
机构
[1] Federal University of Pará,
[2] PPGCC,undefined
[3] Vale Institute of Technology,undefined
来源
关键词
D O I
暂无
中图分类号
学科分类号
摘要
In this paper we explore the reliability of contexts of machine learning (ML) models. There are several evaluation procedures commonly used to validate a model (precision, F1 Score and others); However, these procedures are not linked to the evaluation of learning itself, but only to the number of correct answers presented by the model. This characteristic makes it impossible to assess whether a model was able to learn through elements that make sense of the context in which it is inserted. Therefore, the model could achieves good results in the training stage but poor results when the model needs to be generalized. When there are many different models that achieve similar performance, the model that presented the highest number of hits in training does not mean that this model is the best. Therefore, we created a methodology based on Item Response Theory that allows us to identify whether an ML context is unreliable, providing an extra and different validation for ML models.
引用
收藏
相关论文
共 50 条
  • [1] The quest for the reliability of machine learning models in binary classification on tabular data
    Santos, Vitor Cirilo Araujo
    Cardoso, Lucas
    Alves, Ronnie
    [J]. SCIENTIFIC REPORTS, 2023, 13 (01)
  • [2] A machine learning framework for performing binary classification on tabular biomedical data
    Szijarto, Adam
    Fabian, Alexandra
    Lakatos, Balint Karoly
    Tolvaj, Matt
    Merkely, Btla
    Kovacs, Attila
    Tokodi, Marton
    [J]. IMAGING, 2023, 15 (01): : 1 - 6
  • [3] Demonstrating Quest: A Query-Driven Framework to Explain Classification Models on Tabular Data
    Geisler, Nadja
    Haettasch, Benjamin
    Binnig, Carsten
    [J]. PROCEEDINGS OF THE VLDB ENDOWMENT, 2022, 15 (12): : 3722 - 3725
  • [4] Introducing Quest: A Query-Driven Framework to Explain Classification Models on Tabular Data
    Geisler, Nadja
    Binnig, Carsten
    [J]. WORKSHOP ON HUMAN-IN-THE-LOOP DATA ANALYTICS, HILDA 2022, 2022,
  • [5] ShinvLearner: A containerized benchmarking tool for machine-learning classification of tabular data
    Piccolo, Stephen R.
    Lee, Terry J.
    Suh, Erica
    Hill, Kimball
    [J]. GIGASCIENCE, 2020, 9 (04):
  • [6] Revisiting Deep Learning Models for Tabular Data
    Gorishniy, Yury
    Rubachev, Ivan
    Khrulkov, Valentin
    Babenko, Artem
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
  • [7] Machine learning for question answering from tabular data
    Khalid, Mahboob Alam
    Jijkoun, Valentin
    de Rijke, Maarten
    [J]. DEXA 2007: 18TH INTERNATIONAL CONFERENCE ON DATABASE AND EXPERT SYSTEMS APPLICATIONS, PROCEEDINGS, 2007, : 392 - +
  • [8] Likelihood contrasts: a machine learning algorithm for binary classification of longitudinal data
    Riku Klén
    Markku Karhunen
    Laura L. Elo
    [J]. Scientific Reports, 10
  • [9] Likelihood contrasts: a machine learning algorithm for binary classification of longitudinal data
    Klen, Riku
    Karhunen, Markku
    Elo, Laura L.
    [J]. SCIENTIFIC REPORTS, 2020, 10 (01)
  • [10] Developing comprehensive reporting guidelines for machine learning models using tabular data in medical research
    Gargari, Omid K.
    [J]. POSTGRADUATE MEDICAL JOURNAL, 2024,