Conversational Composed Retrieval with Iterative Sequence Refinement

被引:1
|
作者
Wei, Hao [1 ,3 ]
Wang, Shuhui [1 ]
Xue, Zhe [2 ]
Chen, Shengbo [4 ]
Huang, Qingming [1 ,3 ]
机构
[1] Chinese Acad Sci, Inst Comput Tech, Key Lab Intell Info Proc, Beijing, Peoples R China
[2] BUPT, Beijing Key Lab Intelligent Telecommun Software, Beijing, Peoples R China
[3] Univ Chinese Acad Sci, Beijing, Peoples R China
[4] Henan Univ, Sch Comp & Informat Engn, Kaifeng, Peoples R China
基金
中国国家自然科学基金; 国家重点研发计划;
关键词
Cross-modal Retrieval; Conversational Search; Sequence Modeling;
D O I
10.1145/3581783.3611885
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Due to the progress of large-scale multimodal model pretraining, existing cross-modal retrieval techniques is accurate to align text description to the target image when they show close and clear semantic correspondence. However, in real situations, users only provide ambiguous text queries, making it difficult to retrieve the desired images. To address this issue, we introduce the conversational composed retrieval paradigm, inspired by conversational search which models complex user intent through iterative interaction. This paradigm enhances the model capacity in learning fine-grained correspondences. To train the cross-modal conversational retrieval, we propose the Iterative Refining Retrieval (IRR) framework. It formalizes the reference images and modification texts in each session as a multimodal sequence, which is fed into the generative model to predict the information in the sequence autoregressively, and ultimately predicting the target image feature. In the conversational retrieval paradigm, the model refines the learned correspondences based on the interaction in the later stage of the retrieval session, thus captures fine-grained semantic correspondence to enforce the cross-modal representation. We propose a domain-specific multimodal pretraining method and the full sequence sampling augmentation method to fully utilize the session information. Extensive experiments demonstrate that the iterative refining retrieval method achieves state-of-the-art performance on sessions of varying lengths.
引用
收藏
页码:6390 / 6399
页数:10
相关论文
共 50 条
  • [1] Iterative Refinement Methods for Enhanced Information Retrieval
    Zhou, Dong
    Truran, Mark
    Liu, Jianxun
    Li, Wei
    Jones, Gareth
    INTERNATIONAL JOURNAL OF INTELLIGENT SYSTEMS, 2014, 29 (04) : 341 - 364
  • [2] Incremental iterative retrieval and browsing for efficient conversational CBR systems
    Jurisica, I
    Glasgow, J
    Mylopoulos, J
    APPLIED INTELLIGENCE, 2000, 12 (03) : 251 - 268
  • [3] Incremental Iterative Retrieval and Browsing for Efficient Conversational CBR Systems
    Igor Jurisica
    Janice Glasgow
    John Mylopoulos
    Applied Intelligence, 2000, 12 : 251 - 268
  • [4] Iterative refinement of structure-based sequence alignments by Seed Extension
    Changhoon Kim
    Chin-Hsien Tai
    Byungkook Lee
    BMC Bioinformatics, 10
  • [5] Iterative refinement of structure-based sequence alignments by Seed Extension
    Kim, Changhoon
    Tai, Chin-Hsien
    Lee, Byungkook
    BMC BIOINFORMATICS, 2009, 10
  • [6] Iterative refinement of repeat sequence specification using constrained pattern matching
    He, Dan
    Arslan, Abdullah N.
    He, Yu
    Wu, Xindong
    PROCEEDINGS OF THE 7TH IEEE INTERNATIONAL SYMPOSIUM ON BIOINFORMATICS AND BIOENGINEERING, VOLS I AND II, 2007, : 1199 - 1203
  • [7] Deterministic Non-Autoregressive Neural Sequence Modeling by Iterative Refinement
    Lee, Jason
    Mansimov, Elman
    Cho, Kyunghyun
    2018 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2018), 2018, : 1173 - 1182
  • [8] Sequence-structure homology recognition by iterative alignment refinement and comparative modeling
    Williams, MG
    Shirai, H
    Shi, J
    Nagendra, HG
    Mueller, J
    Mizuguchi, K
    Miguel, RN
    Lovell, SC
    Innis, CA
    Deane, CM
    Chen, L
    Campillo, N
    Burke, DF
    Blundell, TL
    de Bakker, PIW
    PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2001, : 92 - 97
  • [9] Iterative template refinement: Protein-fold prediction using iterative search and hybrid sequence/structure templates
    Yi, TM
    Lander, ES
    COMPUTER METHODS FOR MACROMOLECULAR SEQUENCE ANALYSIS, 1996, 266 : 322 - 339
  • [10] Recent Advances in Conversational Information Retrieval
    Gao, Jianfeng
    Xiong, Chenyan
    Bennett, Paul
    PROCEEDINGS OF THE 43RD INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL (SIGIR '20), 2020, : 2421 - 2424