Leveraging Style and Content features for Text Conditioned Image Retrieval

被引:4
|
作者
Chawla, Pranit [1 ]
Jandial, Surgan [2 ]
Badjatiya, Pinkesh [3 ]
Chopra, Ayush [4 ]
Sarkar, Mausoom [3 ]
Krishnamurthy, Balaji [3 ]
机构
[1] IIT Kharagpur, Kharagpur, W Bengal, India
[2] IIT Hyderabad, Kandi, Telangana, India
[3] Adobe, Media & Data Sci Res Lab, San Jose, CA USA
[4] MIT, Cambridge, MA 02139 USA
关键词
D O I
10.1109/CVPRW53098.2021.00448
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Image Search is a fundamental task playing a significant role in the success of wide variety of frameworks and applications. However, with the increasing sizes of product catalogues and the number of attributes per product, it has become difficult for users to express their needs effectively. Therefore, we focus on the problem of Image Retrieval with Text Feedback, which involves retrieving modified images according to the natural language feedback provided by users. In this work, we hypothesise that since an image can be delineated by its content and style features, modifications to the image can also take place in the two sub spaces respectively. Hence, we decompose an input image into its corresponding style and content features, apply modification of the text feedback individually in both the style and content spaces and finally fuse them for retrieval. Our experiments show that our approach outperforms a recent state of the art method in this task, TIRG, that seeks to use a single vector in contrast to leveraging the modification via text over style and content spaces separately.
引用
收藏
页码:3973 / 3977
页数:5
相关论文
共 50 条
  • [1] CoSMo: Content-Style Modulation for Image Retrieval with Text Feedback
    Lee, Seungmin
    Kim, Dongwan
    Han, Bohyung
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 802 - 812
  • [2] Interactive Image Retrieval Using Text and Image Content
    Dinakaran, B.
    Annapurna, J.
    Kumar, Ch. Aswani
    CYBERNETICS AND INFORMATION TECHNOLOGIES, 2010, 10 (03) : 20 - 30
  • [3] Query expansion by text and image features in image retrieval
    Zhou, H
    Chan, SY
    Kok, FL
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 1998, 9 (04) : 287 - 299
  • [4] Text-Image Retrieval With Salient Features
    Feng, Xia
    Hu, Zhiyi
    Liu, Caihua
    Ip, W. H.
    Chen, Huiying
    JOURNAL OF DATABASE MANAGEMENT, 2021, 32 (04) : 1 - 13
  • [5] Image-retrieval agent: integrating image content and text
    Favela, J
    Meza, V
    IEEE INTELLIGENT SYSTEMS & THEIR APPLICATIONS, 1999, 14 (05): : 36 - 39
  • [6] Combining Image and Text Features for Medicinal Plants Image Retrieval
    Madam, Oki
    Herdiyeni, Yeni
    2013 INTERNATIONAL CONFERENCE ON ADVANCED COMPUTER SCIENCE AND INFORMATION SYSTEMS (ICACSIS), 2013, : 273 - 277
  • [7] LEVERAGING IMPLICIT SPATIAL INFORMATION IN GLOBAL FEATURES FOR IMAGE RETRIEVAL
    Jacob, Pierre
    Picard, David
    Histace, Aymeric
    Klein, Edouard
    2018 25TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2018, : 2002 - 2006
  • [8] SAC: Semantic Attention Composition for Text-Conditioned Image Retrieval
    Jandial, Surgan
    Badjatiya, Pinkesh
    Chawla, Pranit
    Chopra, Ayush
    Sarkar, Mausoom
    Krishnamurthy, Balaji
    2022 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2022), 2022, : 597 - 606
  • [9] Weighted Semantic Fusion of Text and Content for Image Retrieval
    Goel, Nidhi
    Sehgal, Priti
    2013 INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING, COMMUNICATIONS AND INFORMATICS (ICACCI), 2013, : 681 - 687
  • [10] Image Features Optimizing for Content-Based Image Retrieval
    Shi, Zhiping
    Liu, Xi
    He, Qing
    Shi, Zhongzhi
    2009 IEEE INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTING AND INTELLIGENT SYSTEMS, PROCEEDINGS, VOL 4, 2009, : 260 - 264