Nonparametric Scene Parsing via Label Transfer

被引:241
|
作者
Liu, Ce [1 ,2 ]
Yuen, Jenny [2 ]
Torralba, Antonio [2 ]
机构
[1] Microsoft Res New England, Cambridge, MA 02142 USA
[2] MIT, CSAIL, Cambridge, MA 02139 USA
基金
美国国家科学基金会;
关键词
Object recognition; scene parsing; label transfer; SIFT flow; Markov random fields; OBJECT; TEXTURE;
D O I
10.1109/TPAMI.2011.131
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
While there has been a lot of recent work on object recognition and image understanding, the focus has been on carefully establishing mathematical models for images, scenes, and objects. In this paper, we propose a novel, nonparametric approach for object recognition and scene parsing using a new technology we name label transfer. For an input image, our system first retrieves its nearest neighbors from a large database containing fully annotated images. Then, the system establishes dense correspondences between the input image and each of the nearest neighbors using the dense SIFT flow algorithm [28], which aligns two images based on local image structures. Finally, based on the dense scene correspondences obtained from SIFT flow, our system warps the existing annotations and integrates multiple cues in a Markov random field framework to segment and recognize the query image. Promising experimental results have been achieved by our nonparametric scene parsing system on challenging databases. Compared to existing object recognition approaches that require training classifiers or appearance models for each object category, our system is easy to implement, has few parameters, and embeds contextual information naturally in the retrieval/alignment procedure.
引用
收藏
页码:2368 / 2382
页数:15
相关论文
共 50 条
  • [41] RANUS: RGB and NIR Urban Scene Dataset for Deep Scene Parsing
    Choe, Gyeongmin
    Kim, Seong-Heum
    Im, Sunghoon
    Lee, Joon-Young
    Narasimhan, Srinivasa G.
    Kweon, In So
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2018, 3 (03): : 1808 - 1815
  • [42] VRT-Net: Real-Time Scene Parsing via Variable Resolution Transform
    Kundu, Jogendra Nath
    Rajput, Gaurav Singh
    Babu, R. Venkatesh
    2020 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2020, : 2038 - 2045
  • [43] Enhancing scene parsing by transferring structures via efficient low-rank graph matching
    Yu, Tianshu
    Wang, Ruisheng
    24TH ACM SIGSPATIAL INTERNATIONAL CONFERENCE ON ADVANCES IN GEOGRAPHIC INFORMATION SYSTEMS (ACM SIGSPATIAL GIS 2016), 2016,
  • [44] Weakly supervised image parsing via label propagation over discriminatively semantic graph
    Xu, Xiaocheng
    Ma, Jun
    Nie, Liqiang
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2016, 40 : 808 - 815
  • [45] Padding Investigations for CNNs in Scene Parsing Tasks
    Huang, Yu-Hui
    Proesmans, Marc
    Van Gool, Luc
    2023 18TH INTERNATIONAL CONFERENCE ON MACHINE VISION AND APPLICATIONS, MVA, 2023,
  • [46] HSNet: hierarchical semantics network for scene parsing
    Tan, Xin
    Xu, Jiachen
    Cao, Ying
    Xu, Ke
    Ma, Lizhuang
    Lau, Rynson W. H.
    VISUAL COMPUTER, 2023, 39 (07): : 2543 - 2554
  • [47] HSNet: hierarchical semantics network for scene parsing
    Xin Tan
    Jiachen Xu
    Ying Cao
    Ke Xu
    Lizhuang Ma
    Rynson W. H. Lau
    The Visual Computer, 2023, 39 : 2543 - 2554
  • [48] Global Aggregation Then Local Distribution for Scene Parsing
    Li, Xiangtai
    Zhang, Li
    Cheng, Guangliang
    Yang, Kuiyuan
    Tong, Yunhai
    Zhu, Xiatian
    Xiang, Tao
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 30 : 6829 - 6842
  • [49] Predicting Scene Parsing and Motion Dynamics in the Future
    Jin, Xiaojie
    Xiao, Huaxin
    Shen, Xiaohui
    Yang, Jimei
    Lin, Zhe
    Chen, Yunpeng
    Jie, Zequn
    Feng, Jiashi
    Yan, Shuicheng
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 30 (NIPS 2017), 2017, 30
  • [50] Video Scene Parsing with Predictive Feature Learning
    Jin, Xiaojie
    Li, Xin
    Xiao, Huaxin
    Shen, Xiaohui
    Lin, Zhe
    Yang, Jimei
    Chen, Yunpeng
    Dong, Jian
    Liu, Luoqi
    Jie, Zequn
    Feng, Jiashi
    Yan, Shuicheng
    2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 5581 - 5589