Evaluating Machine Learning Algorithms for Applications with Humans in the Loop

被引:0
|
作者
Gopalakrishna, Aravind Kota [1 ]
Ozcelebi, Tanir [1 ]
Lukkien, Johan J. [1 ]
Liotta, Antonio [2 ]
机构
[1] Eindhoven Univ Technol, Dept Math & Comp Sci, Syst Architecture & Networking Grp, Eindhoven, Netherlands
[2] Eindhoven Univ Technol, Dept Elect Engn, Electroopt Commun, Eindhoven, Netherlands
关键词
SCALES;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Applications employing data classification such as smart lighting that involve human factors such as perception lead to non-deterministic input-output relationships where more than one output may be acceptable for a given input. For these so called non-deterministic multiple output classification (nDMOC) problems, the relationship between the input and output may change over time making it difficult for the machine learning (ML) algorithms in a batch setting to make predictions for a given context. In this paper, we describe the nature of nDMOC problems and discuss the Relevance Score (RS) that is suitable in this context as a performance metric. RS determines the extent by which a predicted output is relevant to the user's context and behaviors, taking into account the inconsistencies that come with human (perception) factors. We tailor the RS metric so that it can be used to evaluate ML algorithms in an online setting at run-time. We assess the performance of a number of ML algorithms, using a smart lighting dataset with non-deterministic one-to-many input-output relationships. The results indicate that using RS instead of classification accuracy (CA) is suitable to analyze the performance of conventional ML algorithms applied to the category of nDMOC problems. Instance-based online ML gives the best RS performance. An interesting finding is that the RS keeps increasing with increasing number of samples, even after the CA performance converges.
引用
收藏
页码:459 / 464
页数:6
相关论文
共 50 条
  • [21] Quantum Machine Learning Algorithms for Drug Discovery Applications
    Batra, Kushal
    Zorn, Kimberley M.
    Foil, Daniel H.
    Minerali, Eni
    Gawriljuk, Victor O.
    Lane, Thomas R.
    Ekins, Sean
    JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2021, 61 (06) : 2641 - 2647
  • [22] Machine Teaching for Inverse Reinforcement Learning: Algorithms and Applications
    Brown, Daniel S.
    Niekum, Scott
    THIRTY-THIRD AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FIRST INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / NINTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019, : 7749 - 7758
  • [23] Applications of Artificial Intelligence and Machine Learning Algorithms to Crystallization
    Xiouras, Christos
    Cameli, Fabio
    Quillo, Gustavo Lunardon
    Kavousanakis, Mihail E.
    Vlachos, Dionisios G.
    Stefanidis, Georgios D.
    CHEMICAL REVIEWS, 2022, 122 (15) : 13006 - 13042
  • [24] Spatial conditioning of machine learning algorithms for geoscience applications
    Nwaila, Glen T.
    Zhang, Steven E.
    Bourdeau, Julie E.
    16TH SGA BIENNIAL MEETING, 2022, VOL 1, 2022, : 279 - 282
  • [25] Evaluating the Role of Machine Learning in Defense Applications and Industry
    Alcantara Suarez, Evaldo Jorge
    Monzon Baeza, Victor
    MACHINE LEARNING AND KNOWLEDGE EXTRACTION, 2023, 5 (04): : 1557 - 1569
  • [26] Machine Learning With Neuroimaging: Evaluating Its Applications in Psychiatry
    Nielsen, Ashley N.
    Barch, Deanna M.
    Petersen, Steven E.
    Schlaggar, Bradley L.
    Greene, Deanna J.
    BIOLOGICAL PSYCHIATRY-COGNITIVE NEUROSCIENCE AND NEUROIMAGING, 2020, 5 (08) : 791 - 798
  • [27] Evaluating Parallel Minibatch Training for Machine Learning Applications
    Dreiseitl, Stephan
    COMPUTER AIDED SYSTEMS THEORY - EUROCAST 2017, PT I, 2018, 10671 : 400 - 407
  • [28] Simulated playground for evaluating machine-learning algorithms for bioactivity prediction
    Thompson, Jared
    Schrodl, Stefan
    Mysinger, Michael
    Wallach, Izhar
    ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 2018, 256
  • [29] Evaluating different machine learning algorithms for snow water equivalent prediction
    Mehdi Vafakhah
    Ali Nasiri Khiavi
    Saeid Janizadeh
    Hojatolah Ganjkhanlo
    Earth Science Informatics, 2022, 15 : 2431 - 2445
  • [30] Human-in-the-loop machine learning with applications for population health
    Long Chen
    Jiangtao Wang
    Bin Guo
    Liming Chen
    CCF Transactions on Pervasive Computing and Interaction, 2023, 5 : 1 - 12