Conversational Composed Retrieval with Iterative Sequence Refinement

被引:1
|
作者
Wei, Hao [1 ,3 ]
Wang, Shuhui [1 ]
Xue, Zhe [2 ]
Chen, Shengbo [4 ]
Huang, Qingming [1 ,3 ]
机构
[1] Chinese Acad Sci, Inst Comput Tech, Key Lab Intell Info Proc, Beijing, Peoples R China
[2] BUPT, Beijing Key Lab Intelligent Telecommun Software, Beijing, Peoples R China
[3] Univ Chinese Acad Sci, Beijing, Peoples R China
[4] Henan Univ, Sch Comp & Informat Engn, Kaifeng, Peoples R China
基金
中国国家自然科学基金; 国家重点研发计划;
关键词
Cross-modal Retrieval; Conversational Search; Sequence Modeling;
D O I
10.1145/3581783.3611885
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Due to the progress of large-scale multimodal model pretraining, existing cross-modal retrieval techniques is accurate to align text description to the target image when they show close and clear semantic correspondence. However, in real situations, users only provide ambiguous text queries, making it difficult to retrieve the desired images. To address this issue, we introduce the conversational composed retrieval paradigm, inspired by conversational search which models complex user intent through iterative interaction. This paradigm enhances the model capacity in learning fine-grained correspondences. To train the cross-modal conversational retrieval, we propose the Iterative Refining Retrieval (IRR) framework. It formalizes the reference images and modification texts in each session as a multimodal sequence, which is fed into the generative model to predict the information in the sequence autoregressively, and ultimately predicting the target image feature. In the conversational retrieval paradigm, the model refines the learned correspondences based on the interaction in the later stage of the retrieval session, thus captures fine-grained semantic correspondence to enforce the cross-modal representation. We propose a domain-specific multimodal pretraining method and the full sequence sampling augmentation method to fully utilize the session information. Extensive experiments demonstrate that the iterative refining retrieval method achieves state-of-the-art performance on sessions of varying lengths.
引用
收藏
页码:6390 / 6399
页数:10
相关论文
共 50 条
  • [31] Few-Shot Conversational Dense Retrieval
    Yu, Shi
    Liu, Zhenghao
    Xiong, Chenyan
    Feng, Tao
    Liu, Zhiyuan
    SIGIR '21 - PROCEEDINGS OF THE 44TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2021, : 829 - 838
  • [32] Using recommendation summaries for retrieval in conversational recommenders
    McGinty, L
    Shanley, D
    Smyth, B
    ICAI '05: PROCEEDINGS OF THE 2005 INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOLS 1 AND 2, 2005, : 802 - 808
  • [33] Trilingual conversational intent decoding for response retrieval
    Godslove, Julius Femi
    Nayak, Ajit Kumar
    KNOWLEDGE AND INFORMATION SYSTEMS, 2024, 66 (01) : 535 - 556
  • [34] Towards retrieval-based conversational recommendation
    Manzoor, Ahtsham
    Jannach, Dietmar
    INFORMATION SYSTEMS, 2022, 109
  • [35] CoSPLADE: Contextualizing SPLADE for Conversational Information Retrieval
    Le, Nam Hai
    Gerald, Thomas
    Formal, Thibault
    Nie, Jian-Yun
    Piwowarski, Benjamin
    Soulier, Laure
    ADVANCES IN INFORMATION RETRIEVAL, ECIR 2023, PT I, 2023, 13980 : 537 - 552
  • [36] Trilingual conversational intent decoding for response retrieval
    Julius Femi Godslove
    Ajit Kumar Nayak
    Knowledge and Information Systems, 2024, 66 (1) : 535 - 556
  • [37] Significant improvement in accuracy of multiple protein sequence alignments by iterative refinement as assessed by reference to structural alignments
    Gotoh, O
    JOURNAL OF MOLECULAR BIOLOGY, 1996, 264 (04) : 823 - 838
  • [38] History-Aware Conversational Dense Retrieval
    Mo, Fengran
    Qu, Chen
    Mao, Kelong
    Zhu, Tianyu
    Su, Zhan
    Huang, Kaiyu
    Nie, Jian-Yun
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: ACL 2024, 2024, : 13366 - 13378
  • [39] Iterative refinement of computing inverse matrix
    Wu, Xinyuan
    Wu, Jun
    INTERNATIONAL JOURNAL OF COMPUTER MATHEMATICS, 2009, 86 (07) : 1126 - 1134
  • [40] Information filtering via iterative refinement
    Laureti, P.
    Moret, L.
    Zhang, Y. -C.
    Yu, Y. -K.
    EUROPHYSICS LETTERS, 2006, 75 (06): : 1006 - 1012