Leveraging Style and Content features for Text Conditioned Image Retrieval

被引:4
|
作者
Chawla, Pranit [1 ]
Jandial, Surgan [2 ]
Badjatiya, Pinkesh [3 ]
Chopra, Ayush [4 ]
Sarkar, Mausoom [3 ]
Krishnamurthy, Balaji [3 ]
机构
[1] IIT Kharagpur, Kharagpur, W Bengal, India
[2] IIT Hyderabad, Kandi, Telangana, India
[3] Adobe, Media & Data Sci Res Lab, San Jose, CA USA
[4] MIT, Cambridge, MA 02139 USA
关键词
D O I
10.1109/CVPRW53098.2021.00448
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Image Search is a fundamental task playing a significant role in the success of wide variety of frameworks and applications. However, with the increasing sizes of product catalogues and the number of attributes per product, it has become difficult for users to express their needs effectively. Therefore, we focus on the problem of Image Retrieval with Text Feedback, which involves retrieving modified images according to the natural language feedback provided by users. In this work, we hypothesise that since an image can be delineated by its content and style features, modifications to the image can also take place in the two sub spaces respectively. Hence, we decompose an input image into its corresponding style and content features, apply modification of the text feedback individually in both the style and content spaces and finally fuse them for retrieval. Our experiments show that our approach outperforms a recent state of the art method in this task, TIRG, that seeks to use a single vector in contrast to leveraging the modification via text over style and content spaces separately.
引用
收藏
页码:3973 / 3977
页数:5
相关论文
共 50 条
  • [21] Combining novel features for content based image retrieval
    Prakash, K. Satya Sai
    Sundaram, R. M. D.
    2007 14TH INTERNATIONAL WORKSHOP ON SYSTEMS, SIGNALS, & IMAGE PROCESSING & EURASIP CONFERENCE FOCUSED ON SPEECH & IMAGE PROCESSING, MULTIMEDIA COMMUNICATIONS & SERVICES, 2007, : 125 - +
  • [22] Content based image retrieval using image features information fusion
    Ahmed, Khawaja Tehseen
    Ummesafi, Shahida
    Iqbal, Amjad
    INFORMATION FUSION, 2019, 51 : 76 - 99
  • [23] Fuzzy aggregation of image features in content-based image retrieval
    Kushki, A
    Androutsos, P
    Plataniotis, KN
    Venetsanopoulos, AN
    2002 INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, VOL III, PROCEEDINGS, 2002, : 909 - 912
  • [24] Leveraging Deep Features Enhance and Semantic-Preserving Hashing for Image Retrieval
    Zhao, Xusheng
    Liu, Jinglei
    ELECTRONICS, 2022, 11 (15)
  • [25] Leveraging Image Visual Features in Content-Based Recommender System
    Deng, Fuhu
    Ren, Panlong
    Qin, Zhen
    Huang, Gu
    Qin, Zhiguang
    SCIENTIFIC PROGRAMMING, 2018, 2018
  • [26] Effective conditioned and composed image retrieval combining CLIP-based features
    Baldrati, Alberto
    Bertini, Marco
    Uricchio, Tiberio
    Del Bimbo, Alberto
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 21434 - 21442
  • [27] CATIRI: An Efficient Method for Content-and-Text Based Image Retrieval
    Mengqi Zeng
    Bin Yao
    Zhi-Jie Wang
    Yanyan Shen
    Feifei Li
    Jianfeng Zhang
    Hao Lin
    Minyi Guo
    Journal of Computer Science and Technology, 2019, 34 : 287 - 304
  • [28] Predicting Visual Features From Text for Image and Video Caption Retrieval
    Dong, Jianfeng
    Li, Xirong
    Snoek, Cees G. M.
    IEEE TRANSACTIONS ON MULTIMEDIA, 2018, 20 (12) : 3377 - 3388
  • [29] CATIRI: An Efficient Method for Content-and-Text Based Image Retrieval
    Zeng, Mengqi
    Yao, Bin
    Wang, Zhi-Jie
    Shen, Yanyan
    Li, Feifei
    Zhang, Jianfeng
    Lin, Hao
    Guo, Minyi
    JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY, 2019, 34 (02) : 287 - 304
  • [30] Text and Content Based Image Retrieval Via Locality Sensitive Hashing
    Zhang, Nan
    Man, Ka Lok
    Yu, Tianlin
    Lei, Chi-Un
    ENGINEERING LETTERS, 2011, 19 (03) : 228 - 234