TaskMAD: A Platform for Multimodal Task-Centric Knowledge-Grounded Conversational Experimentation

被引:5
|
作者
Speggiorin, Alessandro [1 ]
Dalton, Jeffrey [1 ]
Leuski, Anton [2 ]
机构
[1] Univ Glasgow, Glasgow, Lanark, Scotland
[2] Univ Southern Calif, Inst Creat Technol, Los Angeles, CA 90007 USA
关键词
interactive search; data collection; wizard-of-oz;
D O I
10.1145/3477495.3531679
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The role of conversational assistants continues to evolve, beyond simple voice commands to ones that support rich and complex tasks in the home, car, and even virtual reality. Going beyond simple voice command and control requires agents and datasets blending structured dialogue, information seeking, grounded reasoning, and contextual question-answering in a multimodal environment with rich image and video content. In this demo, we introduce Taskoriented Multimodal Agent Dialogue (TaskMAD), a new platform that supports the creation of interactive multimodal and taskcentric datasets in a Wizard-of-Oz experimental setup. TaskMAD includes support for text and voice, federated retrieval from text and knowledge bases, and structured logging of interactions for offline labeling. Its architecture supports a spectrum of tasks that span open-domain exploratory search to traditional frame-based dialogue tasks. It's open-source and offers rich capability as a platform used to collect data for the Amazon Alexa Prize Taskbot challenge, TREC Conversational Assistance track, undergraduate student research, and others. TaskMAD is distributed under the MIT license.
引用
收藏
页码:3240 / 3244
页数:5
相关论文
共 7 条
  • [1] COMEX: A Multi-task Benchmark for Knowledge-grounded COnversational Media EXploration
    Tun, Zay Yar
    Speggiorin, Alessandro
    Dalton, Jeffrey
    Stamper, Megan
    PROCEEDINGS OF THE 4TH INTERNATIONAL CONFERENCE ON CONVERSATIONAL USER INTERFACES, CUI 2022, 2022,
  • [2] Knowledge-Grounded Dialogue Flow Management for Social Robots and Conversational Agents
    Grassi, Lucrezia
    Recchiuto, Carmine Tommaso
    Sgorbissa, Antonio
    INTERNATIONAL JOURNAL OF SOCIAL ROBOTICS, 2022, 14 (05) : 1273 - 1293
  • [3] Knowledge-Grounded Dialogue Flow Management for Social Robots and Conversational Agents
    Lucrezia Grassi
    Carmine Tommaso Recchiuto
    Antonio Sgorbissa
    International Journal of Social Robotics, 2022, 14 : 1273 - 1293
  • [4] Dicer: Dialogue-Centric Representation for Knowledge-Grounded Dialogue through Contrastive Learning
    Cho, Junhee
    Ko, Youngjoong
    PATTERN RECOGNITION LETTERS, 2023, 172 : 151 - 157
  • [5] A Knowledge-Grounded Task-Oriented Dialogue System with Hierarchical Structure for Enhancing Knowledge Selection
    Lee, Hayoung
    Jeong, Okran
    SENSORS, 2023, 23 (02)
  • [6] A task-centric knowledge graph construction method based on multi-modal representation learning for industrial maintenance automation
    Liu, Zengkun
    Lu, Yuqian
    ENGINEERING REPORTS, 2024, 6 (12)
  • [7] OmEGa(Ω): Ontology-based information extraction framework for constructing task-centric knowledge graph from manufacturing documents with large language model
    Shim, Midan
    Choi, Hyojun
    Koo, Heeyeon
    Um, Kaehyun
    Lee, Kyong-Ho
    Lee, Sanghyun
    ADVANCED ENGINEERING INFORMATICS, 2025, 64