Nonparametric Scene Parsing via Label Transfer

被引:241
|
作者
Liu, Ce [1 ,2 ]
Yuen, Jenny [2 ]
Torralba, Antonio [2 ]
机构
[1] Microsoft Res New England, Cambridge, MA 02142 USA
[2] MIT, CSAIL, Cambridge, MA 02139 USA
基金
美国国家科学基金会;
关键词
Object recognition; scene parsing; label transfer; SIFT flow; Markov random fields; OBJECT; TEXTURE;
D O I
10.1109/TPAMI.2011.131
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
While there has been a lot of recent work on object recognition and image understanding, the focus has been on carefully establishing mathematical models for images, scenes, and objects. In this paper, we propose a novel, nonparametric approach for object recognition and scene parsing using a new technology we name label transfer. For an input image, our system first retrieves its nearest neighbors from a large database containing fully annotated images. Then, the system establishes dense correspondences between the input image and each of the nearest neighbors using the dense SIFT flow algorithm [28], which aligns two images based on local image structures. Finally, based on the dense scene correspondences obtained from SIFT flow, our system warps the existing annotations and integrates multiple cues in a Markov random field framework to segment and recognize the query image. Promising experimental results have been achieved by our nonparametric scene parsing system on challenging databases. Compared to existing object recognition approaches that require training classifiers or appearance models for each object category, our system is easy to implement, has few parameters, and embeds contextual information naturally in the retrieval/alignment procedure.
引用
收藏
页码:2368 / 2382
页数:15
相关论文
共 50 条
  • [31] Unified Perceptual Parsing for Scene Understanding
    Xiao, Tete
    Liu, Yingcheng
    Zhou, Bolei
    Jiang, Yuning
    Sun, Jian
    COMPUTER VISION - ECCV 2018, PT V, 2018, 11209 : 432 - 448
  • [32] Consensus Feature Network for Scene Parsing
    Wu, Tianyi
    Tang, Sheng
    Zhang, Rui
    Guo, Guodong
    IEEE TRANSACTIONS ON MULTIMEDIA, 2021, 24 : 3208 - 3217
  • [33] Label transfer via sparse representation
    An, Taeg-Hyun
    Hong, Ki-Sang
    PATTERN RECOGNITION LETTERS, 2016, 70 : 1 - 7
  • [34] Interaction via Bi-directional Graph of Semantic Region Affinity for Scene Parsing
    Ding, Henghui
    Zhang, Hui
    Liu, Jun
    Li, Jiaxin
    Feng, Zijian
    Jiang, Xudong
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 15828 - 15838
  • [35] Interactive Natural Language Grounding via Referring Expression Comprehension and Scene Graph Parsing
    Mi, Jinpeng
    Lyu, Jianzhi
    Tang, Song
    Li, Qingdu
    Zhang, Jianwei
    FRONTIERS IN NEUROROBOTICS, 2020, 14
  • [36] Cross-Domain Human Parsing via Adversarial Feature and Label Adaptation
    Liu, Si
    Sun, Yao
    Zhu, Defa
    Ren, Guanghui
    Chen, Yu
    Feng, Jiashi
    Han, Jizhong
    THIRTY-SECOND AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTIETH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / EIGHTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2018, : 7146 - 7153
  • [37] Graphonomy: Universal Human Parsing via Graph Transfer Learning
    Gong, Ke
    Gao, Yiming
    Liang, Xiaodan
    Shen, Xiaohui
    Wang, Meng
    Lin, Liang
    2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 7442 - 7451
  • [38] Graphonomy: Universal Image Parsing via Graph Reasoning and Transfer
    Lin, Liang
    Gao, Yiming
    Gong, Ke
    Wang, Meng
    Liang, Xiaodan
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 44 (05) : 2504 - 2518
  • [39] Improving Segmentation Boundaries with Nonparametric Image Parsing
    Pan, Hong
    Lang, Jochen
    2015 12TH CONFERENCE ON COMPUTER AND ROBOT VISION CRV 2015, 2015, : 328 - 335
  • [40] SuperParsing: Scalable Nonparametric Image Parsing with Superpixels
    Tighe, Joseph
    Lazebnik, Svetlana
    COMPUTER VISION-ECCV 2010, PT V, 2010, 6315 : 352 - 365