Nonparametric Scene Parsing via Label Transfer

被引：241

作者：

Liu, Ce ^{[1
,2
]}

Yuen, Jenny ^{[2
]}

Torralba, Antonio ^{[2
]}

机构：

[1] Microsoft Res New England, Cambridge, MA 02142 USA

[2] MIT, CSAIL, Cambridge, MA 02139 USA

来源：

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE | 2011年 / 33卷 / 12期

基金：

美国国家科学基金会;

关键词：

Object recognition; scene parsing; label transfer; SIFT flow; Markov random fields; OBJECT; TEXTURE;

D O I：

10.1109/TPAMI.2011.131

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

While there has been a lot of recent work on object recognition and image understanding, the focus has been on carefully establishing mathematical models for images, scenes, and objects. In this paper, we propose a novel, nonparametric approach for object recognition and scene parsing using a new technology we name label transfer. For an input image, our system first retrieves its nearest neighbors from a large database containing fully annotated images. Then, the system establishes dense correspondences between the input image and each of the nearest neighbors using the dense SIFT flow algorithm [28], which aligns two images based on local image structures. Finally, based on the dense scene correspondences obtained from SIFT flow, our system warps the existing annotations and integrates multiple cues in a Markov random field framework to segment and recognize the query image. Promising experimental results have been achieved by our nonparametric scene parsing system on challenging databases. Compared to existing object recognition approaches that require training classifiers or appearance models for each object category, our system is easy to implement, has few parameters, and embeds contextual information naturally in the retrieval/alignment procedure.

引用

页码：2368 / 2382

页数：15

共 50 条

[31] Unified Perceptual Parsing for Scene Understanding
Xiao, Tete
Liu, Yingcheng
Zhou, Bolei
Jiang, Yuning
Sun, Jian
COMPUTER VISION - ECCV 2018, PT V, 2018, 11209 : 432 - 448
[32] Consensus Feature Network for Scene Parsing
Wu, Tianyi
Tang, Sheng
Zhang, Rui
Guo, Guodong
IEEE TRANSACTIONS ON MULTIMEDIA, 2021, 24 : 3208 - 3217
[33] Label transfer via sparse representation
An, Taeg-Hyun
Hong, Ki-Sang
PATTERN RECOGNITION LETTERS, 2016, 70 : 1 - 7
[34] Interaction via Bi-directional Graph of Semantic Region Affinity for Scene Parsing
Ding, Henghui
Zhang, Hui
Liu, Jun
Li, Jiaxin
Feng, Zijian
Jiang, Xudong
2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 15828 - 15838
[35] Interactive Natural Language Grounding via Referring Expression Comprehension and Scene Graph Parsing
Mi, Jinpeng
Lyu, Jianzhi
Tang, Song
Li, Qingdu
Zhang, Jianwei
FRONTIERS IN NEUROROBOTICS, 2020, 14
[36] Cross-Domain Human Parsing via Adversarial Feature and Label Adaptation
Liu, Si
Sun, Yao
Zhu, Defa
Ren, Guanghui
Chen, Yu
Feng, Jiashi
Han, Jizhong
THIRTY-SECOND AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTIETH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / EIGHTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2018, : 7146 - 7153
[37] Graphonomy: Universal Human Parsing via Graph Transfer Learning
Gong, Ke
Gao, Yiming
Liang, Xiaodan
Shen, Xiaohui
Wang, Meng
Lin, Liang
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 7442 - 7451
[38] Graphonomy: Universal Image Parsing via Graph Reasoning and Transfer
Lin, Liang
Gao, Yiming
Gong, Ke
Wang, Meng
Liang, Xiaodan
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 44 (05) : 2504 - 2518
[39] Improving Segmentation Boundaries with Nonparametric Image Parsing
Pan, Hong
Lang, Jochen
2015 12TH CONFERENCE ON COMPUTER AND ROBOT VISION CRV 2015, 2015, : 328 - 335
[40] SuperParsing: Scalable Nonparametric Image Parsing with Superpixels
Tighe, Joseph
Lazebnik, Svetlana
COMPUTER VISION-ECCV 2010, PT V, 2010, 6315 : 352 - 365

← 1 2 3 4 5 →