Nonparametric Scene Parsing via Label Transfer

被引：241

作者：

Liu, Ce ^{[1
,2
]}

Yuen, Jenny ^{[2
]}

Torralba, Antonio ^{[2
]}

机构：

[1] Microsoft Res New England, Cambridge, MA 02142 USA

[2] MIT, CSAIL, Cambridge, MA 02139 USA

来源：

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE | 2011年 / 33卷 / 12期

基金：

美国国家科学基金会;

关键词：

Object recognition; scene parsing; label transfer; SIFT flow; Markov random fields; OBJECT; TEXTURE;

D O I：

10.1109/TPAMI.2011.131

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

While there has been a lot of recent work on object recognition and image understanding, the focus has been on carefully establishing mathematical models for images, scenes, and objects. In this paper, we propose a novel, nonparametric approach for object recognition and scene parsing using a new technology we name label transfer. For an input image, our system first retrieves its nearest neighbors from a large database containing fully annotated images. Then, the system establishes dense correspondences between the input image and each of the nearest neighbors using the dense SIFT flow algorithm [28], which aligns two images based on local image structures. Finally, based on the dense scene correspondences obtained from SIFT flow, our system warps the existing annotations and integrates multiple cues in a Markov random field framework to segment and recognize the query image. Promising experimental results have been achieved by our nonparametric scene parsing system on challenging databases. Compared to existing object recognition approaches that require training classifiers or appearance models for each object category, our system is easy to implement, has few parameters, and embeds contextual information naturally in the retrieval/alignment procedure.

引用

页码：2368 / 2382

页数：15

共 50 条

[41] RANUS: RGB and NIR Urban Scene Dataset for Deep Scene Parsing
Choe, Gyeongmin
Kim, Seong-Heum
Im, Sunghoon
Lee, Joon-Young
Narasimhan, Srinivasa G.
Kweon, In So
IEEE ROBOTICS AND AUTOMATION LETTERS, 2018, 3 (03): : 1808 - 1815
[42] VRT-Net: Real-Time Scene Parsing via Variable Resolution Transform
Kundu, Jogendra Nath
Rajput, Gaurav Singh
Babu, R. Venkatesh
2020 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2020, : 2038 - 2045
[43] Enhancing scene parsing by transferring structures via efficient low-rank graph matching
Yu, Tianshu
Wang, Ruisheng
24TH ACM SIGSPATIAL INTERNATIONAL CONFERENCE ON ADVANCES IN GEOGRAPHIC INFORMATION SYSTEMS (ACM SIGSPATIAL GIS 2016), 2016,
[44] Weakly supervised image parsing via label propagation over discriminatively semantic graph
Xu, Xiaocheng
Ma, Jun
Nie, Liqiang
JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2016, 40 : 808 - 815
[45] Padding Investigations for CNNs in Scene Parsing Tasks
Huang, Yu-Hui
Proesmans, Marc
Van Gool, Luc
2023 18TH INTERNATIONAL CONFERENCE ON MACHINE VISION AND APPLICATIONS, MVA, 2023,
[46] HSNet: hierarchical semantics network for scene parsing
Tan, Xin
Xu, Jiachen
Cao, Ying
Xu, Ke
Ma, Lizhuang
Lau, Rynson W. H.
VISUAL COMPUTER, 2023, 39 (07): : 2543 - 2554
[47] HSNet: hierarchical semantics network for scene parsing
Xin Tan
Jiachen Xu
Ying Cao
Ke Xu
Lizhuang Ma
Rynson W. H. Lau
The Visual Computer, 2023, 39 : 2543 - 2554
[48] Global Aggregation Then Local Distribution for Scene Parsing
Li, Xiangtai
Zhang, Li
Cheng, Guangliang
Yang, Kuiyuan
Tong, Yunhai
Zhu, Xiatian
Xiang, Tao
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 30 : 6829 - 6842
[49] Predicting Scene Parsing and Motion Dynamics in the Future
Jin, Xiaojie
Xiao, Huaxin
Shen, Xiaohui
Yang, Jimei
Lin, Zhe
Chen, Yunpeng
Jie, Zequn
Feng, Jiashi
Yan, Shuicheng
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 30 (NIPS 2017), 2017, 30
[50] Video Scene Parsing with Predictive Feature Learning
Jin, Xiaojie
Li, Xin
Xiao, Huaxin
Shen, Xiaohui
Lin, Zhe
Yang, Jimei
Chen, Yunpeng
Dong, Jian
Liu, Luoqi
Jie, Zequn
Feng, Jiashi
Yan, Shuicheng
2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 5581 - 5589

← 1 2 3 4 5 →