TaskMAD: A Platform for Multimodal Task-Centric Knowledge-Grounded Conversational Experimentation

被引：5

作者：

Speggiorin, Alessandro ^{[1
]}

Dalton, Jeffrey ^{[1
]}

Leuski, Anton ^{[2
]}

机构：

[1] Univ Glasgow, Glasgow, Lanark, Scotland

[2] Univ Southern Calif, Inst Creat Technol, Los Angeles, CA 90007 USA

来源：

PROCEEDINGS OF THE 45TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL (SIGIR '22) | 2022年

关键词：

interactive search; data collection; wizard-of-oz;

D O I：

10.1145/3477495.3531679

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

The role of conversational assistants continues to evolve, beyond simple voice commands to ones that support rich and complex tasks in the home, car, and even virtual reality. Going beyond simple voice command and control requires agents and datasets blending structured dialogue, information seeking, grounded reasoning, and contextual question-answering in a multimodal environment with rich image and video content. In this demo, we introduce Taskoriented Multimodal Agent Dialogue (TaskMAD), a new platform that supports the creation of interactive multimodal and taskcentric datasets in a Wizard-of-Oz experimental setup. TaskMAD includes support for text and voice, federated retrieval from text and knowledge bases, and structured logging of interactions for offline labeling. Its architecture supports a spectrum of tasks that span open-domain exploratory search to traditional frame-based dialogue tasks. It's open-source and offers rich capability as a platform used to collect data for the Amazon Alexa Prize Taskbot challenge, TREC Conversational Assistance track, undergraduate student research, and others. TaskMAD is distributed under the MIT license.

引用

页码：3240 / 3244

页数：5

共 7 条

[1] COMEX: A Multi-task Benchmark for Knowledge-grounded COnversational Media EXploration
Tun, Zay Yar
Speggiorin, Alessandro
Dalton, Jeffrey
Stamper, Megan
PROCEEDINGS OF THE 4TH INTERNATIONAL CONFERENCE ON CONVERSATIONAL USER INTERFACES, CUI 2022, 2022,
[2] Knowledge-Grounded Dialogue Flow Management for Social Robots and Conversational Agents
Grassi, Lucrezia
Recchiuto, Carmine Tommaso
Sgorbissa, Antonio
INTERNATIONAL JOURNAL OF SOCIAL ROBOTICS, 2022, 14 (05) : 1273 - 1293
[3] Knowledge-Grounded Dialogue Flow Management for Social Robots and Conversational Agents
Lucrezia Grassi
Carmine Tommaso Recchiuto
Antonio Sgorbissa
International Journal of Social Robotics, 2022, 14 : 1273 - 1293
[4] Dicer: Dialogue-Centric Representation for Knowledge-Grounded Dialogue through Contrastive Learning
Cho, Junhee
Ko, Youngjoong
PATTERN RECOGNITION LETTERS, 2023, 172 : 151 - 157
[5] A Knowledge-Grounded Task-Oriented Dialogue System with Hierarchical Structure for Enhancing Knowledge Selection
Lee, Hayoung
Jeong, Okran
SENSORS, 2023, 23 (02)
[6] A task-centric knowledge graph construction method based on multi-modal representation learning for industrial maintenance automation
Liu, Zengkun
Lu, Yuqian
ENGINEERING REPORTS, 2024, 6 (12)
[7] OmEGa(Ω): Ontology-based information extraction framework for constructing task-centric knowledge graph from manufacturing documents with large language model
Shim, Midan
Choi, Hyojun
Koo, Heeyeon
Um, Kaehyun
Lee, Kyong-Ho
Lee, Sanghyun
ADVANCED ENGINEERING INFORMATICS, 2025, 64

← 1 →