Hierarchical semantic image matching using CNN feature pyramid

被引:29
|
作者
Yu, Wei [1 ]
Sun, Xiaoshuai [1 ]
Yang, Kuiyuan [2 ]
Rui, Yong [2 ]
Yao, Hongxun [1 ]
机构
[1] Harbin Inst Technol, Sch Comp & Technol, Harbin 150090, Heilongjiang, Peoples R China
[2] Microsoft Res, Beijing 100080, Peoples R China
基金
中国国家自然科学基金;
关键词
CNN feature; Image matching; Hierarchical framework; Dense correspondence; Visualization; MODELS; FLOW; STEREO;
D O I
10.1016/j.cviu.2018.01.001
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Image matching remains an important and challenging problem in computer vision, especially for the dense correspondence estimation between images with high category-level similarity. The effectiveness of image matching largely depends on the advance of image descriptors. Inspired by the success of Convolutional Neural Network (CNN), we propose a hierarchal image matching method using the CNN feature pyramid, named as CNN Flow. The feature maps output by different layers of CNN tend to encode different information of the input image, such as the semantic information extracted from higher layers and the structural information extracted from lower layers. This nature of CNN feature pyramid is suitable to build the hierarchical image matching framework, which detects the patterns of different levels in an implicit coarse-to-fine manner. In particular, we take advantage of the complementarity of different layers using guidance from higher layer to lower layer. The high-layer features present semantic patterns to cope with the intra-class variations, and the guidance from high layers can resist the semantic ambiguity of low-layer features due to small receptive fields. The bottom-level matching utilize the low-layer features with more structural information to achieve finer matching. On one hand, extensive experiments and analysis demonstrate the superiority of CNN Flow in image dense matching under challenging variations. On the other hand, CNN Flow is demonstrated through various applications, such as fine alignment for intra-class object, scene label transfer and facial expression transfer.
引用
收藏
页码:40 / 51
页数:12
相关论文
共 50 条
  • [21] Semantic Stereo Matching with Pyramid Cost Volumes
    Wu, Zhenyao
    Wu, Xinyi
    Zhang, Xiaoping
    Wang, Song
    Ju, Lili
    [J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 7483 - 7492
  • [22] Semantic matching in hierarchical ontologies
    Khan, Sharifullah
    Safyan, Muhammad
    [J]. JOURNAL OF KING SAUD UNIVERSITY-COMPUTER AND INFORMATION SCIENCES, 2014, 26 (03) : 247 - 257
  • [23] Image Feature Matching Based on Semantic Fusion Description and Spatial Consistency
    Zhang, Wei
    Zhang, Guoying
    [J]. SYMMETRY-BASEL, 2018, 10 (12):
  • [24] Image Classification Using Sparse Coding and Spatial Pyramid Matching
    Wang, Xiaofang
    Ma, Jun
    Xu, Ming
    [J]. PROCEEDINGS OF THE 2014 INTERNATIONAL CONFERENCE ON E-EDUCATION, E-BUSINESS AND INFORMATION MANAGEMENT, 2014, 91 : 81 - 84
  • [25] Fast stereo matching using image pyramid for lunar rover
    Li, Haichao
    Li, Feng
    Chen, Liang
    [J]. OPTOELECTRONIC IMAGING AND MULTIMEDIA TECHNOLOGY V, 2018, 10817
  • [26] A pyramid image coder using Block template Matching algorithm
    Keissarian, F
    Daemi, MF
    [J]. VISUAL COMMUNICATIONS AND IMAGE PROCESSING 2000, PTS 1-3, 2000, 4067 : 568 - 575
  • [27] HIERARCHICAL FEATURE REPRESENTATION OF GEOSPATIAL OBJECTS USING MORPHOLOGICAL PYRAMID EXPLOITATION
    Wang, Jun
    Qin, Qiming
    Ye, Xin
    Gao, Zhongling
    [J]. 2014 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS), 2014, : 1789 - 1792
  • [28] Integrating SIFT and CNN Feature Matching for Partial-Duplicate Image Detection
    Zhou, Zhili
    Wu, Q. M. Jonathan
    Wan, Shaohua
    Sun, Wendi
    Sun, Xingming
    [J]. IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTATIONAL INTELLIGENCE, 2020, 4 (05): : 593 - 604
  • [29] Hierarchical Feature Aggregation Based on Transformer for Image-Text Matching
    Dong, Xinfeng
    Zhang, Huaxiang
    Zhu, Lei
    Nie, Liqiang
    Liu, Li
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (09) : 6437 - 6447
  • [30] Deep Semantic Feature Matching
    Ufer, Nikolai
    Ommer, Bjoern
    [J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 5929 - 5938