Spatial-Semantic Image Search by Visual Feature Synthesis

被引:21
|
作者
Mai, Long [1 ]
Jin, Hailin [2 ]
Lin, Zhe [2 ]
Fang, Chen [2 ]
Brandt, Jonathan [2 ]
Liu, Feng [1 ]
机构
[1] Portland State Univ, Portland, OR 97207 USA
[2] Adobe Res, San Jose, CA USA
基金
美国国家科学基金会;
关键词
OF-THE-ART; RETRIEVAL;
D O I
10.1109/CVPR.2017.125
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The performance of image retrieval has been improved tremendously in recent years through the use of deep feature representations. Most existing methods, however, aim to retrieve images that are visually similar or semantically relevant to the query, irrespective of spatial configuration. In this paper, we develop a spatial-semantic image search technology that enables users to search for images with both semantic and spatial constraints by manipulating concept text-boxes on a 2D query canvas. We train a convolutional neural network to synthesize appropriate visual features that captures the spatial-semantic constraints from the user canvas query. We directly optimize the retrieval performance of the visual features when training our deep neural network. These visual features then are used to retrieve images that are both spatially and semantically relevant to the user query. The experiments on large-scale datasets such as MS-COCO and Visual Genome show that our method outperforms other baseline and state-of-the-art methods in spatial-semantic image search.
引用
收藏
页码:1121 / 1130
页数:10
相关论文
共 50 条
  • [41] SaFe: A general framework for integrated spatial and feature image search
    Smith, JR
    Chang, SF
    1997 IEEE FIRST WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING, 1997, : 301 - 306
  • [42] Image Feature Matching Based on Semantic Fusion Description and Spatial Consistency
    Zhang, Wei
    Zhang, Guoying
    SYMMETRY-BASEL, 2018, 10 (12):
  • [43] Spatial Structure Preserving Feature Pyramid Network for Semantic Image Segmentation
    Yuan, Yuan
    Fang, Jie
    Lu, Xiaoqiang
    Feng, Yachuang
    ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2019, 15 (03)
  • [44] Deep Object Co-Segmentation via Spatial-Semantic Network Modulation
    Zhang, Kaihua
    Chen, Jin
    Liu, Bo
    Liu, Qingshan
    THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 12813 - 12820
  • [45] SALIENCY-AWARE SEMANTIC IMAGE CODING FOR MOBILE VISUAL SEARCH
    Sun, Cuirong
    Li, Houqiang
    Li, Weiping
    2015 IEEE CHINA SUMMIT & INTERNATIONAL CONFERENCE ON SIGNAL AND INFORMATION PROCESSING, 2015, : 544 - 548
  • [46] Identifying vehicle types from trajectory data based on spatial-semantic information
    Zhang, Yunfei
    Xie, Yajun
    Shi, Chaoyang
    Li, Qiuping
    Yang, Bisheng
    Hao, Wei
    GEO-SPATIAL INFORMATION SCIENCE, 2024,
  • [47] Three-dimensional object detection with spatial-semantic features of point clouds
    Chen, Tianxiang
    Han, Chao
    JOURNAL OF ELECTRONIC IMAGING, 2023, 32 (05)
  • [48] SKGHOI: Spatial-Semantic Knowledge Graph for Human-Object Interaction Detection
    Zhu, Lijing
    Lan, Qizhen
    Velasquez, Alvaro
    Song, Houbing
    Kamal, Acharya
    Tian, Qing
    Niu, Shuteng
    2023 23RD IEEE INTERNATIONAL CONFERENCE ON DATA MINING WORKSHOPS, ICDMW 2023, 2023, : 1186 - 1193
  • [49] Evidence for Negative Feature Guidance in Visual Search Is Explained by Spatial Recoding
    Beck, Valerie M.
    Hollingworth, Andrew
    JOURNAL OF EXPERIMENTAL PSYCHOLOGY-HUMAN PERCEPTION AND PERFORMANCE, 2015, 41 (05) : 1190 - 1196
  • [50] Visual Saliency Fusion Based Multi-feature for Semantic Image Retrieval
    Chen, Jianan
    Bai, Cong
    Huang, Ling
    Liu, Zhi
    Chen, Shengyong
    COMPUTER VISION, PT II, 2017, 772 : 126 - 136