MMConv: An Environment for Multimodal Conversational Search across Multiple Domains

被引:23
|
作者
Liao, Lizi [1 ,2 ]
Long, Le Hong [2 ]
Zhang, Zheng [3 ]
Huang, Minlie [3 ]
Chua, Tat-Seng [2 ]
机构
[1] Sea NExT Joint Lab, Singapore, Singapore
[2] Natl Univ Singapore, Sch Comp, Singapore, Singapore
[3] Tsinghua Univ, Dept Comp Sci & Technol, Beijing, Peoples R China
关键词
datasets; multimodal dialogue; conversational search;
D O I
10.1145/3404835.3462970
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Although conversational search has become a hot topic in both dialogue research and IR community, the real breakthrough has been limited by the scale and quality of datasets available. To address this fundamental obstacle, we introduce the Multimodal Multi-domain Conversational dataset (MMConv), a fully annotated collection of human-to-human role-playing dialogues spanning over multiple domains and tasks. The contribution is two-fold. First, beyond the task-oriented multimodal dialogues among user and agent pairs, dialogues are fully annotated with dialogue belief states and dialogue acts. More importantly, we create a relatively comprehensive environment for conducting multimodal conversational search with real user settings, structured venue database, annotated image repository as well as crowd-sourced knowledge database. A detailed description of the data collection procedure along with a summary of data structure and analysis is provided. Second, a set of benchmark results for dialogue state tracking, conversational recommendation, response generation as well as a unified model for multiple tasks are reported. We adopt the state-of-the-art methods for these tasks respectively to demonstrate the usability of the data, discuss limitations of current methods and set baselines for future studies.
引用
收藏
页码:675 / 684
页数:10
相关论文
共 50 条
  • [1] Priming exploration across domains: does search in a spatial environment influence search in a cognitive environment?
    Anvari, Farid
    Marchiori, Davide
    [J]. ROYAL SOCIETY OPEN SCIENCE, 2021, 8 (08):
  • [2] The Next Generation Multimodal Conversational Search and Recommendation
    Magalhaes, Joao
    Chua, Tat-Seng
    Mei, Tao
    Smeaton, Alan
    [J]. PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021, 2021, : 953 - 954
  • [3] Multimodal Across Domains Gaze Target Detection
    Tonini, Francesco
    Beyan, Cigdem
    Ricci, Elisa
    [J]. PROCEEDINGS OF THE 2022 INTERNATIONAL CONFERENCE ON MULTIMODAL INTERACTION, ICMI 2022, 2022, : 420 - 431
  • [4] Topic-Guided Conversational Recommender in Multiple Domains
    Liao, Lizi
    Takanobu, Ryuichi
    Ma, Yunshan
    Yang, Xun
    Huang, Minlie
    Tat-Seng Chua
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2022, 34 (05) : 2485 - 2496
  • [5] Discovering Experts across Multiple Domains
    Pal, Aditya
    [J]. SIGIR 2015: PROCEEDINGS OF THE 38TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2015, : 923 - 926
  • [6] Multiple feature interpretation across domains
    Sha, K
    Gurumoorthy, B
    [J]. COMPUTERS IN INDUSTRY, 2000, 42 (01) : 13 - 32
  • [7] Secure Resource Provisioning Across Multiple Domains
    Mano, Toru
    Mizutani, Kimihiro
    Akashi, Osamu
    [J]. 2013 IFIP/IEEE INTERNATIONAL SYMPOSIUM ON INTEGRATED NETWORK MANAGEMENT (IM 2013), 2013, : 1129 - 1134
  • [8] Exploring Consumer Risk Across Multiple Domains
    Tumbat, Gulnur
    [J]. ADVANCES IN CONSUMER RESEARCH, VOL XXXVI, 2009, 36 : 926 - 927
  • [9] Building Virtual Networks Across Multiple Domains
    Werle, Christoph
    Bless, Roland
    Papadimitriou, Panagiotis
    Houidi, Ines
    Louati, Wajdi
    Zeghlache, Djamal
    Mathy, Laurent
    [J]. ACM SIGCOMM COMPUTER COMMUNICATION REVIEW, 2011, 41 (04) : 412 - 413
  • [10] Scalable quality of service across multiple domains
    Cobb, JA
    [J]. COMPUTER COMMUNICATIONS, 2005, 28 (18) : 1997 - 2008