Interactive probes: Towards action-level evaluation for dialogue systems

被引:3
|
作者
Liesenfeld, Andreas [1 ]
Dingemanse, Mark [2 ]
机构
[1] Radboud Univ Nijmegen, Ctr Language Studies, Erasmuspl 1, NL-6525 HT Nijmegen, Netherlands
[2] Radboud Univ Nijmegen, Nijmegen, Netherlands
关键词
Applied conversation analysis; conversational user interfaces; dialogue systems; usability testing; REPAIR;
D O I
10.1177/17504813241267071
中图分类号
G2 [信息与知识传播];
学科分类号
05 ; 0503 ;
摘要
Measures of 'humanness', 'coherence' or 'fluency' are the mainstay of dialogue system evaluation, but they don't target system capabilities and rarely offer actionable feedback. Reviewing recent work in this domain, we identify an opportunity for evaluation at the level of action sequences, rather than the more commonly targeted levels of whole conversations or single responses. We introduce interactive probes, an evaluation framework inspired by empirical work on social interaction that can help to systematically probe the capabilities of dialogue systems. We sketch some first probes in the domains of tellings and repair, two sequence types ubiquitous in human interaction and challenging for dialogue systems. We argue interactive probing can offer the requisite flexibility to keep up with developments in interactive language technologies and do justice to the open-endedness of action formation and ascription in interaction.
引用
收藏
页码:954 / 964
页数:11
相关论文
共 50 条
  • [21] Japanese Youth: An Interactive Dialogue: Towards Comparative Youth Research
    Toivonen, Tuukka
    Furuichi, Noritoshi
    Terachi, Mikito
    Ogawa, Tomu
    ASIA-PACIFIC JOURNAL-JAPAN FOCUS, 2012, 10 (35):
  • [22] Towards a UML for interactive systems
    Paternò, F
    ENGINEERING FOR HUMAN-COMPUTER INTERACTION, 2001, 2254 : 7 - 18
  • [23] Multimodal dialogue systems: A case study for interactive TV
    Ibrahim, A
    Johansson, P
    UNIVERSAL ACCESS: THEORETICAL PERSPECTIVES, PRACTICE, AND EXPERIENCE, 2003, 2615 : 209 - 218
  • [24] Evaluation of Multimodal Dialogue Systems
    Bavarian Archive for Speech Signals, c/o Institut für Phonetik und Sprachliche Kommunikation, Ludwig-Maximilians-Universität Münchenn, Germany
    Cogn. Technol., 2006, (617-643):
  • [25] Towards Fair Evaluation of Dialogue State Tracking by Flexible Incorporation of Turn-level Performances
    Dey, Suvodip
    Kummara, Ramamohan
    Desarkar, Maunendra Sankar
    PROCEEDINGS OF THE 60TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2022): (SHORT PAPERS), VOL 2, 2022, : 318 - 324
  • [26] Towards the Layered Evaluation of Interactive Adaptive Systems using ELECTRE TRI Method
    Dhouib, Amira
    Trabelsi, Abdelwaheb
    Kolski, Christophe
    Neji, Mahmoud
    ICSOFT: PROCEEDINGS OF THE 12TH INTERNATIONAL CONFERENCE ON SOFTWARE TECHNOLOGIES, 2017, : 163 - 170
  • [27] Towards Modelling Elaborateness in Argumentative Dialogue Systems
    Aicher, Annalena
    Fuchs, Marc
    Minker, Wolfgang
    Ultes, Stefan
    ARTIFICIAL INTELLIGENCE IN HCI, AI-HCI 2023, PT II, 2023, 14051 : 3 - 22
  • [28] Towards Developing Dialogue Systems with Entertaining Conversations
    Hai-Long Trieu
    Iida, Hiroyuki
    Nhien Pham Hoang Bao
    Le-Minh Nguyen
    ICAART: PROCEEDINGS OF THE 9TH INTERNATIONAL CONFERENCE ON AGENTS AND ARTIFICIAL INTELLIGENCE, VOL 2, 2017, : 511 - 518
  • [29] Predicting microcystin concentration action-level exceedances resulting from cyanobacterial blooms in selected lake sites in Ohio
    Francy, Donna S.
    Brady, Amie M. G.
    Stelzer, Erin A.
    Cicale, Jessica R.
    Hackney, Courtney
    Dalby, Harrison D.
    Struffolino, Pamela
    Dwyer, Daryl F.
    ENVIRONMENTAL MONITORING AND ASSESSMENT, 2020, 192 (08)
  • [30] Towards Compensation Correctness in Interactive Systems
    Vaz, Catia
    Ferreira, Carla
    WEB SERVICES AND FORMAL METHODS, 2010, 6194 : 161 - +