The Chatbot Usability Scale: the Design and Pilot of a Usability Scale for Interaction with AI-Based Conversational Agents

被引:0
|
作者
Borsci, Simone [1 ,2 ]
Malizia, Alessio [3 ,4 ]
Schmettow, Martin [1 ]
van der Velde, Frank [1 ]
Tariverdiyeva, Gunay [5 ]
Balaji, Divyaa [6 ]
Chamberlain, Alan [7 ]
机构
[1] Department of Learning, Data analysis, and Technology, Cognition, Data and Education (CODE) group, Faculty of Behavioural Management and Social sciences, University of Twente, Enschede, Netherlands
[2] NIHR London In-Vitro Diagnostics Cooperative, Imperial College of London, London, United Kingdom
[3] Computer Science Department, University of Pisa, Pisa, Italy
[4] Molde University College, Molde, Norway
[5] Backbase, Amsterdam, Netherlands
[6] Faculty of Social and Behavioural Sciences, University of Amsterdam, Amsterdam, Netherlands
[7] School of Computer Science, University of Nottingham, Nottingham, United Kingdom
基金
英国科研创新办公室; 英国工程与自然科学研究理事会;
关键词
Buses - Artificial intelligence - Autonomous agents - Surveys - Usability engineering;
D O I
暂无
中图分类号
学科分类号
摘要
Standardised tools to assess a user’s satisfaction with the experience of using chatbots and conversational agents are currently unavailable. This work describes four studies, including a systematic literature review, with an overall sample of 141 participants in the survey (experts and novices), focus group sessions and testing of chatbots to (i) define attributes to assess the quality of interaction with chatbots and (ii) the designing and piloting a new scale to measure satisfaction after the experience with chatbots. Two instruments were developed: (i) A diagnostic tool in the form of a checklist (BOT-Check). This tool is a development of previous works which can be used reliably to check the quality of a chatbots experience in line with commonplace principles. (ii) A 15-item questionnaire (BOT Usability Scale, BUS-15) with estimated reliability between.76 and.87 distributed in five factors. BUS-15 strongly correlates with UMUX-LITE by enabling designers to consider a broader range of aspects usually not considered in satisfaction tools for non-conversational agents, e.g. conversational efficiency and accessibility, quality of the chatbot’s functionality and so on. Despite the convincing psychometric properties, BUS-15 requires further testing and validation. Designers can use it as a tool to assess products, thus building independent databases for future evaluation of its reliability, validity and sensitivity. © 2021, The Author(s).
引用
收藏
页码:95 / 119
相关论文
共 50 条
  • [31] An AI-based Framework for Supporting Large Scale Automated Analysis of Video Capsule Endoscopy
    Giordano, D.
    Murabito, F.
    Palazzo, S.
    Pino, C.
    Spampinato, C.
    2019 IEEE EMBS INTERNATIONAL CONFERENCE ON BIOMEDICAL & HEALTH INFORMATICS (BHI), 2019,
  • [32] Integrating Players' Perspectives in AI-Based Games: Case Studies of Player-AI Interaction Design
    Villareale, Jennifer
    Maram, Sai Siddartha
    El-Nasr, Magy Seif
    Zhu, Jichen
    PROCEEDINGS OF THE 18TH INTERNATIONAL CONFERENCE ON THE FOUNDATIONS OF DIGITAL GAMES, FDG 2023, 2023,
  • [33] N-Gain and System Usability Scale Analysis on Game Based Learning for Adult Learners
    Aziz, Azlan Abdul
    Adam, Intan Nur Hafawati
    Jasmis, Jamaluddin
    Elias, Shamsul Jamel
    Mansor, Shaifizat
    6TH IEEE INTERNATIONAL CONFERENCE ON RECENT ADVANCES AND INNOVATIONS IN ENGINEERING (ICRAIE), 2021,
  • [34] SVEA: A Small-scale Benchmark for Validating the Usability of Post-hoc Explainable AI Solutions in Image and Signal Recognition
    Sattarzadeh, Sam
    Sudhakar, Mahesh
    Plataniotis, Konstantinos N.
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW 2021), 2021, : 4141 - 4150
  • [35] Students' attitudes towards an AI-based interview for university admissions: Scale development and intervention effects
    Shin, Namin
    Choi, Eunyoung
    Chang, Sejin
    EDUCATION AND INFORMATION TECHNOLOGIES, 2024, 29 (15) : 20055 - 20076
  • [36] AI-Based Nano-Scale Material Property Prediction for Li-Ion Batteries
    Lal, Mohit Anil
    Singh, Akashdeep
    Mzik, Ryan
    Lanjan, Amirmasoud
    Srinivasan, Seshasai
    BATTERIES-BASEL, 2024, 10 (02):
  • [37] Shaping and evaluating a system for affective computing in online higher education using a participatory design and the system usability scale
    Shingjergji, Krist
    Urlings, Corrie
    Iren, Deniz
    Klemke, Roland
    FOURTEENTH INTERNATIONAL CONFERENCE ON LEARNING ANALYTICS & KNOWLEDGE, LAK 2024, 2024, : 382 - 391
  • [38] Designing Virtual Reality-Based Conversational Agents to Train Clinicians in Verbal De-escalation Skills: Exploratory Usability Study
    Moore, Nathan
    Ahmadpour, Naseem
    Brown, Martin
    Poronnik, Philip
    Davids, Jennifer
    JMIR SERIOUS GAMES, 2022, 10 (03):
  • [39] Introducing CASUX: A Standardized Scale for Measuring the User Experience of Artificial Intelligence Based Conversational Agents
    Faruk, Lawal Ibrahim Dutsinma
    Pal, Debajyoti
    Funilkul, Suree
    Perumal, Thinagaran
    Mongkolnam, Pornchai
    INTERNATIONAL JOURNAL OF HUMAN-COMPUTER INTERACTION, 2024,
  • [40] Multimodal Usability of Human-Computer Interaction Based on Task-Human-Computer-Centered Design
    Shi, Yanbin
    Li, Xiaoqi
    Ouyang, Dantong
    Jiang, Hongtao
    INTELLIGENT COMPUTING THEORIES AND APPLICATION, ICIC 2017, PT I, 2017, 10361 : 459 - 468