Conversational Composed Retrieval with Iterative Sequence Refinement

被引:1
|
作者
Wei, Hao [1 ,3 ]
Wang, Shuhui [1 ]
Xue, Zhe [2 ]
Chen, Shengbo [4 ]
Huang, Qingming [1 ,3 ]
机构
[1] Chinese Acad Sci, Inst Comput Tech, Key Lab Intell Info Proc, Beijing, Peoples R China
[2] BUPT, Beijing Key Lab Intelligent Telecommun Software, Beijing, Peoples R China
[3] Univ Chinese Acad Sci, Beijing, Peoples R China
[4] Henan Univ, Sch Comp & Informat Engn, Kaifeng, Peoples R China
基金
中国国家自然科学基金; 国家重点研发计划;
关键词
Cross-modal Retrieval; Conversational Search; Sequence Modeling;
D O I
10.1145/3581783.3611885
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Due to the progress of large-scale multimodal model pretraining, existing cross-modal retrieval techniques is accurate to align text description to the target image when they show close and clear semantic correspondence. However, in real situations, users only provide ambiguous text queries, making it difficult to retrieve the desired images. To address this issue, we introduce the conversational composed retrieval paradigm, inspired by conversational search which models complex user intent through iterative interaction. This paradigm enhances the model capacity in learning fine-grained correspondences. To train the cross-modal conversational retrieval, we propose the Iterative Refining Retrieval (IRR) framework. It formalizes the reference images and modification texts in each session as a multimodal sequence, which is fed into the generative model to predict the information in the sequence autoregressively, and ultimately predicting the target image feature. In the conversational retrieval paradigm, the model refines the learned correspondences based on the interaction in the later stage of the retrieval session, thus captures fine-grained semantic correspondence to enforce the cross-modal representation. We propose a domain-specific multimodal pretraining method and the full sequence sampling augmentation method to fully utilize the session information. Extensive experiments demonstrate that the iterative refining retrieval method achieves state-of-the-art performance on sessions of varying lengths.
引用
收藏
页码:6390 / 6399
页数:10
相关论文
共 50 条
  • [41] Towards Filling the Gap in Conversational Search: From Passage Retrieval to Conversational Response Generation
    Lajewska, Weronika
    Balog, Krisztian
    PROCEEDINGS OF THE 32ND ACM INTERNATIONAL CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, CIKM 2023, 2023, : 5326 - 5330
  • [42] Iterative Refinement of an AIS Rewards System
    Wang, Karen
    Ma, Zhenjun
    Baker, Ryan S.
    Li, Yuanyuan
    ADAPTIVE INSTRUCTIONAL SYSTEMS, AIS 2022, 2022, 13332 : 113 - 125
  • [43] IS THE ITERATIVE REFINEMENT OF EIGENELEMENTS AN EXPENSIVE TECHNIQUE
    DALMEIDA, FD
    RODRIGUES, MJ
    JOURNAL OF COMPUTATIONAL AND APPLIED MATHEMATICS, 1987, 20 : 159 - 166
  • [44] Combining indexing and learning in iterative refinement
    Li, CS
    Castelli, V
    Smith, JR
    Bergman, L
    STORAGE AND RETRIEVAL FOR IMAGE AND VIDEO DATABASES VII, 1998, 3656 : 390 - 400
  • [45] A refinement of an iterative orthogonal projection method
    Jamil, Noreen
    Mirza, Farhaan
    Naeem, M. Asif
    Baghaei, Nilufar
    JOURNAL OF COMPUTATIONAL AND APPLIED MATHEMATICS, 2018, 341 : 31 - 41
  • [46] Iterative refinement for symmetric eigenvalue decomposition
    Takeshi Ogita
    Kensuke Aishima
    Japan Journal of Industrial and Applied Mathematics, 2018, 35 : 1007 - 1035
  • [47] Multistage mixed precision iterative refinement
    Oktay, Eda
    Carson, Erin
    NUMERICAL LINEAR ALGEBRA WITH APPLICATIONS, 2022, 29 (04)
  • [48] Iterative refinement using splitting methods
    Yuan, JY
    LINEAR ALGEBRA AND ITS APPLICATIONS, 1998, 273 : 199 - 214
  • [49] Iterative refinement for linear systems and LAPACK
    Higham, NJ
    IMA JOURNAL OF NUMERICAL ANALYSIS, 1997, 17 (04) : 495 - 509
  • [50] Iterative Refinement Quantum Amplitude Estimation
    Saito, Yoshiyuki
    Lee, Xinwei
    Xie, Ningyi
    Cai, Dongsheng
    Shin, Jungpil
    Asai, Nobuyoshi
    2023 IEEE 16TH INTERNATIONAL SYMPOSIUM ON EMBEDDED MULTICORE/MANY-CORE SYSTEMS-ON-CHIP, MCSOC, 2023, : 202 - 209