The quest for the reliability of machine learning models in binary classification on tabular data

被引：0

作者：

Vitor Cirilo Araujo Santos

Lucas Cardoso

Ronnie Alves

机构：

[1] Federal University of Pará,

[2] PPGCC,undefined

[3] Vale Institute of Technology,undefined

来源：

Scientific Reports | / 13卷

关键词：

D O I：

暂无

中图分类号：

学科分类号：

摘要：

In this paper we explore the reliability of contexts of machine learning (ML) models. There are several evaluation procedures commonly used to validate a model (precision, F1 Score and others); However, these procedures are not linked to the evaluation of learning itself, but only to the number of correct answers presented by the model. This characteristic makes it impossible to assess whether a model was able to learn through elements that make sense of the context in which it is inserted. Therefore, the model could achieves good results in the training stage but poor results when the model needs to be generalized. When there are many different models that achieve similar performance, the model that presented the highest number of hits in training does not mean that this model is the best. Therefore, we created a methodology based on Item Response Theory that allows us to identify whether an ML context is unreliable, providing an extra and different validation for ML models.

引用

共 50 条

[1] The quest for the reliability of machine learning models in binary classification on tabular data
Santos, Vitor Cirilo Araujo
Cardoso, Lucas
Alves, Ronnie
[J]. SCIENTIFIC REPORTS, 2023, 13 (01)
[2] A machine learning framework for performing binary classification on tabular biomedical data
Szijarto, Adam
Fabian, Alexandra
Lakatos, Balint Karoly
Tolvaj, Matt
Merkely, Btla
Kovacs, Attila
Tokodi, Marton
[J]. IMAGING, 2023, 15 (01): : 1 - 6
[3] Demonstrating Quest: A Query-Driven Framework to Explain Classification Models on Tabular Data
Geisler, Nadja
Haettasch, Benjamin
Binnig, Carsten
[J]. PROCEEDINGS OF THE VLDB ENDOWMENT, 2022, 15 (12): : 3722 - 3725
[4] Introducing Quest: A Query-Driven Framework to Explain Classification Models on Tabular Data
Geisler, Nadja
Binnig, Carsten
[J]. WORKSHOP ON HUMAN-IN-THE-LOOP DATA ANALYTICS, HILDA 2022, 2022,
[5] ShinvLearner: A containerized benchmarking tool for machine-learning classification of tabular data
Piccolo, Stephen R.
Lee, Terry J.
Suh, Erica
Hill, Kimball
[J]. GIGASCIENCE, 2020, 9 (04):
[6] Revisiting Deep Learning Models for Tabular Data
Gorishniy, Yury
Rubachev, Ivan
Khrulkov, Valentin
Babenko, Artem
[J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
[7] Machine learning for question answering from tabular data
Khalid, Mahboob Alam
Jijkoun, Valentin
de Rijke, Maarten
[J]. DEXA 2007: 18TH INTERNATIONAL CONFERENCE ON DATABASE AND EXPERT SYSTEMS APPLICATIONS, PROCEEDINGS, 2007, : 392 - +
[8] Likelihood contrasts: a machine learning algorithm for binary classification of longitudinal data
Riku Klén
Markku Karhunen
Laura L. Elo
[J]. Scientific Reports, 10
[9] Likelihood contrasts: a machine learning algorithm for binary classification of longitudinal data
Klen, Riku
Karhunen, Markku
Elo, Laura L.
[J]. SCIENTIFIC REPORTS, 2020, 10 (01)
[10] Developing comprehensive reporting guidelines for machine learning models using tabular data in medical research
Gargari, Omid K.
[J]. POSTGRADUATE MEDICAL JOURNAL, 2024,

← 1 2 3 4 5 →