Image parsing: Unifying segmentation, detection, and recognition

被引：260

作者：

Tu, ZW ^{[1
]}

Chen, XG

Yuille, AL

Zhu, SC

机构：

[1] Univ Calif Los Angeles, Dept Stat, Los Angeles, CA 90095 USA

[2] Univ Calif Los Angeles, Dept Psychol, Los Angeles, CA 90095 USA

[3] Univ Calif Los Angeles, Dept Comp Sci, Los Angeles, CA 90095 USA

来源：

INTERNATIONAL JOURNAL OF COMPUTER VISION | 2005年 / 63卷 / 02期

基金：

美国国家卫生研究院; 美国国家科学基金会;

关键词：

image parsing; image segmentation; object detection; object recognition; data driven Markov Chain Monte Carlo; AdaBoost;

D O I：

10.1007/s11263-005-6642-x

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In this paper we present a Bayesian framework for parsing images into their constituent visual patterns. The parsing algorithm optimizes the posterior probability and outputs a scene representation as a "parsing graph", in a spirit similar to parsing sentences in speech and natural language. The algorithm constructs the parsing graph and re-configures it dynamically using a set of moves, which are mostly reversible Markov chain jumps. This computational framework integrates two popular inference approaches-generative (top-down) methods and discriminative (bottom-up) methods. The former formulates the posterior probability in terms of generative models for images defined by likelihood functions and priors. The latter computes discriminative probabilities based on a sequence (cascade) of bottom-up tests/filters. In our Markov chain algorithm design, the posterior probability, defined by the generative models, is the invariant (target) probability for the Markov chain, and the discriminative probabilities are used to construct proposal probabilities to drive the Markov chain. Intuitively, the bottom-up discriminative probabilities activate top-down generative models. In this paper, we focus on two types of visual patterns-generic visual patterns, such as texture and shading, and object patterns including human faces and text. These types of patterns compete and cooperate to explain the image and so image parsing unifies image segmentation, object detection, and recognition of we use generic visual patterns only then image parsing will correspond to image segmentation (Tu and Zhu, 2002. IEEE Trans. PAM1, 24(5):657-673). We illustrate our algorithm on natural images of complex city scenes and show examples where image segmentation can be improved by allowing object specific knowledge to disambiguate low-level segmentation cues, and conversely where object detection can be improved by using generic visual patterns to explain away shadows and occlusions.

引用

页码：113 / 140

页数：28

共 50 条

[41] A SYNTACTIC METHOD FOR IMAGE SEGMENTATION AND OBJECT RECOGNITION
DON, HS
FU, KS
PATTERN RECOGNITION, 1985, 18 (01) : 73 - 87
[42] Segmentation and recognition in sub-image system
Tung, GS
Liu, JH
Lin, YN
Sun, ZY
Wang, QQ
ELECTRONIC IMAGING AND MULTIMEDIA SYSTEMS II, 1998, 3561 : 135 - 140
[43] Image segmentation and object recognition by Bayesian grouping
Kalitzin, SN
Staal, JJ
Romeny, BMT
Viergever, MA
2000 INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, VOL III, PROCEEDINGS, 2000, : 580 - 583
[44] Color Recognition Method Based on Image Segmentation
Hua, Minghui
Zhou, Haixiang
Li, Wanlei
Lou, Yunjiang
INTELLIGENT ROBOTICS AND APPLICATIONS, ICIRA 2021, PT IV, 2021, 13016 : 576 - 586
[45] Adaptive integrated image segmentation and object recognition
Bhanu, B
Peng, J
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART C-APPLICATIONS AND REVIEWS, 2000, 30 (04): : 427 - 441
[46] Leaf Recognition and Segmentation by Using Depth Image
Shao, Xiaowei
Shi, Yun
Wu, Wenbing
Yang, Peng
Chen, Zhongxin
Shibasaki, Ryosuke
THIRD INTERNATIONAL CONFERENCE ON AGRO-GEOINFORMATICS (AGRO-GEOINFORMATICS 2014), 2014, : 195 - 198
[47] Research on segmentation and recognition of marrow cells image
Hou Zhenjie
Ma Shuoshi
Pei Xichun
Pan Xin
ISDA 2006: SIXTH INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS DESIGN AND APPLICATIONS, VOL 2, 2006, : 329 - +
[48] Adaptive classification for image segmentation and target recognition
Bargel, B
Bers, KH
Jäger, K
Schwan, G
AUTOMATIC TARGET RECOGNITION XII, 2002, 4726 : 230 - 240
[49] Thermal Hand Image Segmentation for Biometric Recognition
Font-Aragones, Xavier
Faundez-Zanuy, Marcos
Mekyska, Jiri
IEEE AEROSPACE AND ELECTRONIC SYSTEMS MAGAZINE, 2013, 28 (06) : 4 - 14
[50] A segmentation and recognition method of rigid image targets
Cao, Jian
Li, Haisheng
Cai, Qiang
Guo, Shilong
Research Journal of Applied Sciences, Engineering and Technology, 2012, 4 (16) : 2728 - 2734

← 1 2 3 4 5 →