Evaluating Machine Learning Algorithms for Applications with Humans in the Loop

被引：0

作者：

Gopalakrishna, Aravind Kota ^{[1
]}

Ozcelebi, Tanir ^{[1
]}

Lukkien, Johan J. ^{[1
]}

Liotta, Antonio ^{[2
]}

机构：

[1] Eindhoven Univ Technol, Dept Math & Comp Sci, Syst Architecture & Networking Grp, Eindhoven, Netherlands

[2] Eindhoven Univ Technol, Dept Elect Engn, Electroopt Commun, Eindhoven, Netherlands

来源：

PROCEEDINGS OF THE 2017 IEEE 14TH INTERNATIONAL CONFERENCE ON NETWORKING, SENSING AND CONTROL (ICNSC 2017) | 2017年

关键词：

SCALES;

D O I：

暂无

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Applications employing data classification such as smart lighting that involve human factors such as perception lead to non-deterministic input-output relationships where more than one output may be acceptable for a given input. For these so called non-deterministic multiple output classification (nDMOC) problems, the relationship between the input and output may change over time making it difficult for the machine learning (ML) algorithms in a batch setting to make predictions for a given context. In this paper, we describe the nature of nDMOC problems and discuss the Relevance Score (RS) that is suitable in this context as a performance metric. RS determines the extent by which a predicted output is relevant to the user's context and behaviors, taking into account the inconsistencies that come with human (perception) factors. We tailor the RS metric so that it can be used to evaluate ML algorithms in an online setting at run-time. We assess the performance of a number of ML algorithms, using a smart lighting dataset with non-deterministic one-to-many input-output relationships. The results indicate that using RS instead of classification accuracy (CA) is suitable to analyze the performance of conventional ML algorithms applied to the category of nDMOC problems. Instance-based online ML gives the best RS performance. An interesting finding is that the RS keeps increasing with increasing number of samples, even after the CA performance converges.

引用

页码：459 / 464

页数：6

共 50 条

[31] An Analytical Framework for Evaluating Successful Poisoning Attacks on Machine Learning Algorithms
M. Surekha
Anil Kumar Sagar
Vineeta Khemchandani
SN Computer Science, 6 (4)
[32] Evaluating QoE in VoIP networks with QoS mapping and machine learning algorithms
ZhiGuo Hu
HongRen Yan
Tao Yan
HaiJun Geng
GuoQing Liu
NEUROCOMPUTING, 2020, 386 : 63 - 83
[33] Evaluating the Factors and Forecasting Childhood Anemia Through Machine Learning Algorithms
Salma, Nahid
Ali, Majid Khan Majahar
MALAYSIAN JOURNAL OF FUNDAMENTAL AND APPLIED SCIENCES, 2025, 21 (01): : 1529 - 1541
[34] Evaluating the Performance of Machine Learning Algorithms in Gaze Gesture Recognition Systems
Li, Jiayao
Ray, Samantha
Rajanna, Vijay
Hammond, Tracy
IEEE ACCESS, 2022, 10 : 1020 - 1035
[35] Human-in-the-loop machine learning with applications for population health
Chen, Long
Wang, Jiangtao
Guo, Bin
Chen, Liming
CCF TRANSACTIONS ON PERVASIVE COMPUTING AND INTERACTION, 2023, 5 (01) : 1 - 12
[36] Evaluating the Performance of Machine Learning Algorithms in Predicting the Best Bank Customers
Ehsanifar, Mohammad
Dekamini, Fatemeh
Mehdiabadi, Amir
Khazaei, Moein
Spulbar, Cristi
Birau, Ramona
Filip, Robert dorin
ANNALS OF THE UNIVERSITY OF CRAIOVA-MATHEMATICS AND COMPUTER SCIENCE SERIES, 2023, 50 (02): : 464 - 475
[37] Evaluating the Performance of Machine Learning Sentiment Analysis Algorithms in Software Engineering
Shen, Jingyi
Baysal, Olga
Shafiq, M. Omair
IEEE 17TH INT CONF ON DEPENDABLE, AUTONOM AND SECURE COMP / IEEE 17TH INT CONF ON PERVAS INTELLIGENCE AND COMP / IEEE 5TH INT CONF ON CLOUD AND BIG DATA COMP / IEEE 4TH CYBER SCIENCE AND TECHNOLOGY CONGRESS (DASC/PICOM/CBDCOM/CYBERSCITECH), 2019, : 1023 - 1030
[38] Evaluating different machine learning algorithms for snow water equivalent prediction
Vafakhah, Mehdi
Khiavi, Ali Nasiri
Janizadeh, Saeid
Ganjkhanlo, Hojatolah
EARTH SCIENCE INFORMATICS, 2022, 15 (04) : 2431 - 2445
[39] Evaluating the Performance of Various Machine Learning Algorithms to Detect Subclinical Keratoconus
Cao, Ke
Verspoor, Karin
Sahebjada, Srujana
Baird, Paul N.
TRANSLATIONAL VISION SCIENCE & TECHNOLOGY, 2020, 9 (02):
[40] Evaluating the Accuracy of Machine Learning, Deep Learning and Hybrid Algorithms for Flood Routing Calculations
Sarigol, Metin
PURE AND APPLIED GEOPHYSICS, 2024, 181 (12) : 3485 - 3506

← 1 2 3 4 5 →