Evaluating Machine Learning Algorithms for Applications with Humans in the Loop

被引:0
|
作者
Gopalakrishna, Aravind Kota [1 ]
Ozcelebi, Tanir [1 ]
Lukkien, Johan J. [1 ]
Liotta, Antonio [2 ]
机构
[1] Eindhoven Univ Technol, Dept Math & Comp Sci, Syst Architecture & Networking Grp, Eindhoven, Netherlands
[2] Eindhoven Univ Technol, Dept Elect Engn, Electroopt Commun, Eindhoven, Netherlands
关键词
SCALES;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Applications employing data classification such as smart lighting that involve human factors such as perception lead to non-deterministic input-output relationships where more than one output may be acceptable for a given input. For these so called non-deterministic multiple output classification (nDMOC) problems, the relationship between the input and output may change over time making it difficult for the machine learning (ML) algorithms in a batch setting to make predictions for a given context. In this paper, we describe the nature of nDMOC problems and discuss the Relevance Score (RS) that is suitable in this context as a performance metric. RS determines the extent by which a predicted output is relevant to the user's context and behaviors, taking into account the inconsistencies that come with human (perception) factors. We tailor the RS metric so that it can be used to evaluate ML algorithms in an online setting at run-time. We assess the performance of a number of ML algorithms, using a smart lighting dataset with non-deterministic one-to-many input-output relationships. The results indicate that using RS instead of classification accuracy (CA) is suitable to analyze the performance of conventional ML algorithms applied to the category of nDMOC problems. Instance-based online ML gives the best RS performance. An interesting finding is that the RS keeps increasing with increasing number of samples, even after the CA performance converges.
引用
收藏
页码:459 / 464
页数:6
相关论文
共 50 条
  • [31] An Analytical Framework for Evaluating Successful Poisoning Attacks on Machine Learning Algorithms
    M. Surekha
    Anil Kumar Sagar
    Vineeta Khemchandani
    SN Computer Science, 6 (4)
  • [32] Evaluating QoE in VoIP networks with QoS mapping and machine learning algorithms
    ZhiGuo Hu
    HongRen Yan
    Tao Yan
    HaiJun Geng
    GuoQing Liu
    NEUROCOMPUTING, 2020, 386 : 63 - 83
  • [33] Evaluating the Factors and Forecasting Childhood Anemia Through Machine Learning Algorithms
    Salma, Nahid
    Ali, Majid Khan Majahar
    MALAYSIAN JOURNAL OF FUNDAMENTAL AND APPLIED SCIENCES, 2025, 21 (01): : 1529 - 1541
  • [34] Evaluating the Performance of Machine Learning Algorithms in Gaze Gesture Recognition Systems
    Li, Jiayao
    Ray, Samantha
    Rajanna, Vijay
    Hammond, Tracy
    IEEE ACCESS, 2022, 10 : 1020 - 1035
  • [35] Human-in-the-loop machine learning with applications for population health
    Chen, Long
    Wang, Jiangtao
    Guo, Bin
    Chen, Liming
    CCF TRANSACTIONS ON PERVASIVE COMPUTING AND INTERACTION, 2023, 5 (01) : 1 - 12
  • [36] Evaluating the Performance of Machine Learning Algorithms in Predicting the Best Bank Customers
    Ehsanifar, Mohammad
    Dekamini, Fatemeh
    Mehdiabadi, Amir
    Khazaei, Moein
    Spulbar, Cristi
    Birau, Ramona
    Filip, Robert dorin
    ANNALS OF THE UNIVERSITY OF CRAIOVA-MATHEMATICS AND COMPUTER SCIENCE SERIES, 2023, 50 (02): : 464 - 475
  • [37] Evaluating the Performance of Machine Learning Sentiment Analysis Algorithms in Software Engineering
    Shen, Jingyi
    Baysal, Olga
    Shafiq, M. Omair
    IEEE 17TH INT CONF ON DEPENDABLE, AUTONOM AND SECURE COMP / IEEE 17TH INT CONF ON PERVAS INTELLIGENCE AND COMP / IEEE 5TH INT CONF ON CLOUD AND BIG DATA COMP / IEEE 4TH CYBER SCIENCE AND TECHNOLOGY CONGRESS (DASC/PICOM/CBDCOM/CYBERSCITECH), 2019, : 1023 - 1030
  • [38] Evaluating different machine learning algorithms for snow water equivalent prediction
    Vafakhah, Mehdi
    Khiavi, Ali Nasiri
    Janizadeh, Saeid
    Ganjkhanlo, Hojatolah
    EARTH SCIENCE INFORMATICS, 2022, 15 (04) : 2431 - 2445
  • [39] Evaluating the Performance of Various Machine Learning Algorithms to Detect Subclinical Keratoconus
    Cao, Ke
    Verspoor, Karin
    Sahebjada, Srujana
    Baird, Paul N.
    TRANSLATIONAL VISION SCIENCE & TECHNOLOGY, 2020, 9 (02):
  • [40] Evaluating the Accuracy of Machine Learning, Deep Learning and Hybrid Algorithms for Flood Routing Calculations
    Sarigol, Metin
    PURE AND APPLIED GEOPHYSICS, 2024, 181 (12) : 3485 - 3506