Image parsing: Unifying segmentation, detection, and recognition

被引:260
|
作者
Tu, ZW [1 ]
Chen, XG
Yuille, AL
Zhu, SC
机构
[1] Univ Calif Los Angeles, Dept Stat, Los Angeles, CA 90095 USA
[2] Univ Calif Los Angeles, Dept Psychol, Los Angeles, CA 90095 USA
[3] Univ Calif Los Angeles, Dept Comp Sci, Los Angeles, CA 90095 USA
基金
美国国家卫生研究院; 美国国家科学基金会;
关键词
image parsing; image segmentation; object detection; object recognition; data driven Markov Chain Monte Carlo; AdaBoost;
D O I
10.1007/s11263-005-6642-x
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper we present a Bayesian framework for parsing images into their constituent visual patterns. The parsing algorithm optimizes the posterior probability and outputs a scene representation as a "parsing graph", in a spirit similar to parsing sentences in speech and natural language. The algorithm constructs the parsing graph and re-configures it dynamically using a set of moves, which are mostly reversible Markov chain jumps. This computational framework integrates two popular inference approaches-generative (top-down) methods and discriminative (bottom-up) methods. The former formulates the posterior probability in terms of generative models for images defined by likelihood functions and priors. The latter computes discriminative probabilities based on a sequence (cascade) of bottom-up tests/filters. In our Markov chain algorithm design, the posterior probability, defined by the generative models, is the invariant (target) probability for the Markov chain, and the discriminative probabilities are used to construct proposal probabilities to drive the Markov chain. Intuitively, the bottom-up discriminative probabilities activate top-down generative models. In this paper, we focus on two types of visual patterns-generic visual patterns, such as texture and shading, and object patterns including human faces and text. These types of patterns compete and cooperate to explain the image and so image parsing unifies image segmentation, object detection, and recognition of we use generic visual patterns only then image parsing will correspond to image segmentation (Tu and Zhu, 2002. IEEE Trans. PAM1, 24(5):657-673). We illustrate our algorithm on natural images of complex city scenes and show examples where image segmentation can be improved by allowing object specific knowledge to disambiguate low-level segmentation cues, and conversely where object detection can be improved by using generic visual patterns to explain away shadows and occlusions.
引用
收藏
页码:113 / 140
页数:28
相关论文
共 50 条
  • [41] A SYNTACTIC METHOD FOR IMAGE SEGMENTATION AND OBJECT RECOGNITION
    DON, HS
    FU, KS
    PATTERN RECOGNITION, 1985, 18 (01) : 73 - 87
  • [42] Segmentation and recognition in sub-image system
    Tung, GS
    Liu, JH
    Lin, YN
    Sun, ZY
    Wang, QQ
    ELECTRONIC IMAGING AND MULTIMEDIA SYSTEMS II, 1998, 3561 : 135 - 140
  • [43] Image segmentation and object recognition by Bayesian grouping
    Kalitzin, SN
    Staal, JJ
    Romeny, BMT
    Viergever, MA
    2000 INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, VOL III, PROCEEDINGS, 2000, : 580 - 583
  • [44] Color Recognition Method Based on Image Segmentation
    Hua, Minghui
    Zhou, Haixiang
    Li, Wanlei
    Lou, Yunjiang
    INTELLIGENT ROBOTICS AND APPLICATIONS, ICIRA 2021, PT IV, 2021, 13016 : 576 - 586
  • [45] Adaptive integrated image segmentation and object recognition
    Bhanu, B
    Peng, J
    IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART C-APPLICATIONS AND REVIEWS, 2000, 30 (04): : 427 - 441
  • [46] Leaf Recognition and Segmentation by Using Depth Image
    Shao, Xiaowei
    Shi, Yun
    Wu, Wenbing
    Yang, Peng
    Chen, Zhongxin
    Shibasaki, Ryosuke
    THIRD INTERNATIONAL CONFERENCE ON AGRO-GEOINFORMATICS (AGRO-GEOINFORMATICS 2014), 2014, : 195 - 198
  • [47] Research on segmentation and recognition of marrow cells image
    Hou Zhenjie
    Ma Shuoshi
    Pei Xichun
    Pan Xin
    ISDA 2006: SIXTH INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS DESIGN AND APPLICATIONS, VOL 2, 2006, : 329 - +
  • [48] Adaptive classification for image segmentation and target recognition
    Bargel, B
    Bers, KH
    Jäger, K
    Schwan, G
    AUTOMATIC TARGET RECOGNITION XII, 2002, 4726 : 230 - 240
  • [49] Thermal Hand Image Segmentation for Biometric Recognition
    Font-Aragones, Xavier
    Faundez-Zanuy, Marcos
    Mekyska, Jiri
    IEEE AEROSPACE AND ELECTRONIC SYSTEMS MAGAZINE, 2013, 28 (06) : 4 - 14
  • [50] A segmentation and recognition method of rigid image targets
    Cao, Jian
    Li, Haisheng
    Cai, Qiang
    Guo, Shilong
    Research Journal of Applied Sciences, Engineering and Technology, 2012, 4 (16) : 2728 - 2734