Spatial-Semantic Image Search by Visual Feature Synthesis

被引:21
|
作者
Mai, Long [1 ]
Jin, Hailin [2 ]
Lin, Zhe [2 ]
Fang, Chen [2 ]
Brandt, Jonathan [2 ]
Liu, Feng [1 ]
机构
[1] Portland State Univ, Portland, OR 97207 USA
[2] Adobe Res, San Jose, CA USA
基金
美国国家科学基金会;
关键词
OF-THE-ART; RETRIEVAL;
D O I
10.1109/CVPR.2017.125
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The performance of image retrieval has been improved tremendously in recent years through the use of deep feature representations. Most existing methods, however, aim to retrieve images that are visually similar or semantically relevant to the query, irrespective of spatial configuration. In this paper, we develop a spatial-semantic image search technology that enables users to search for images with both semantic and spatial constraints by manipulating concept text-boxes on a 2D query canvas. We train a convolutional neural network to synthesize appropriate visual features that captures the spatial-semantic constraints from the user canvas query. We directly optimize the retrieval performance of the visual features when training our deep neural network. These visual features then are used to retrieve images that are both spatially and semantically relevant to the user query. The experiments on large-scale datasets such as MS-COCO and Visual Genome show that our method outperforms other baseline and state-of-the-art methods in spatial-semantic image search.
引用
收藏
页码:1121 / 1130
页数:10
相关论文
共 50 条
  • [1] SPATIAL-SEMANTIC ATTENTION FOR GROUNDED IMAGE CAPTIONING
    Hu, Wenzhe
    Wang, Lanxiao
    Xu, Linfeng
    2022 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2022, : 61 - 65
  • [2] Efficient and interactive spatial-semantic image retrieval
    Ryosuke Furuta
    Naoto Inoue
    Toshihiko Yamasaki
    Multimedia Tools and Applications, 2019, 78 : 18713 - 18733
  • [3] Efficient and Interactive Spatial-Semantic Image Retrieval
    Furuta, Ryosuke
    Inoue, Naoto
    Yamasaki, Toshihiko
    MULTIMEDIA MODELING, MMM 2018, PT I, 2018, 10704 : 190 - 202
  • [4] Efficient and interactive spatial-semantic image retrieval
    Furuta, Ryosuke
    Inoue, Naoto
    Yamasaki, Toshihiko
    MULTIMEDIA TOOLS AND APPLICATIONS, 2019, 78 (13) : 18713 - 18733
  • [5] Automated Spatial-Semantic Modeling with Applications to Place Labeling and Informed Search
    Viswanathan, Pooja
    Meger, David
    Southey, Tristram
    Little, James J.
    Mackworth, Alan
    2009 CANADIAN CONFERENCE ON COMPUTER AND ROBOT VISION, 2009, : 284 - 291
  • [6] Optimized voxel transformer for 3D detection with spatial-semantic feature aggregation
    Li, Yingfei
    COMPUTERS & ELECTRICAL ENGINEERING, 2023, 112
  • [7] Individual differences in a spatial-semantic virtual environment
    Chen, CM
    JOURNAL OF THE AMERICAN SOCIETY FOR INFORMATION SCIENCE, 2000, 51 (06): : 529 - 542
  • [8] Geometric Boundary Guided Feature Fusion and Spatial-Semantic Context Aggregation for Semantic Segmentation of Remote Sensing Images
    Wang, Yupei
    Zhang, Haoran
    Hu, Yongkang
    Hu, Xiaoxing
    Chen, Liang
    Hu, Shanqing
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2023, 32 : 6373 - 6385
  • [9] Bi-Directional Spatial-Semantic Attention Networks for Image-Text Matching
    Huang, Feiran
    Zhang, Xiaoming
    Zhao, Zhonghua
    Li, Zhoujun
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2019, 28 (04) : 2008 - 2020
  • [10] Spatial-semantic display processing: The role of spatial structure on recall
    Newbern, D
    Dansereau, DF
    Patterson, ME
    CONTEMPORARY EDUCATIONAL PSYCHOLOGY, 1997, 22 (03) : 319 - 337