Image parsing: Unifying segmentation, detection, and recognition

被引:260
|
作者
Tu, ZW [1 ]
Chen, XG
Yuille, AL
Zhu, SC
机构
[1] Univ Calif Los Angeles, Dept Stat, Los Angeles, CA 90095 USA
[2] Univ Calif Los Angeles, Dept Psychol, Los Angeles, CA 90095 USA
[3] Univ Calif Los Angeles, Dept Comp Sci, Los Angeles, CA 90095 USA
基金
美国国家卫生研究院; 美国国家科学基金会;
关键词
image parsing; image segmentation; object detection; object recognition; data driven Markov Chain Monte Carlo; AdaBoost;
D O I
10.1007/s11263-005-6642-x
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper we present a Bayesian framework for parsing images into their constituent visual patterns. The parsing algorithm optimizes the posterior probability and outputs a scene representation as a "parsing graph", in a spirit similar to parsing sentences in speech and natural language. The algorithm constructs the parsing graph and re-configures it dynamically using a set of moves, which are mostly reversible Markov chain jumps. This computational framework integrates two popular inference approaches-generative (top-down) methods and discriminative (bottom-up) methods. The former formulates the posterior probability in terms of generative models for images defined by likelihood functions and priors. The latter computes discriminative probabilities based on a sequence (cascade) of bottom-up tests/filters. In our Markov chain algorithm design, the posterior probability, defined by the generative models, is the invariant (target) probability for the Markov chain, and the discriminative probabilities are used to construct proposal probabilities to drive the Markov chain. Intuitively, the bottom-up discriminative probabilities activate top-down generative models. In this paper, we focus on two types of visual patterns-generic visual patterns, such as texture and shading, and object patterns including human faces and text. These types of patterns compete and cooperate to explain the image and so image parsing unifies image segmentation, object detection, and recognition of we use generic visual patterns only then image parsing will correspond to image segmentation (Tu and Zhu, 2002. IEEE Trans. PAM1, 24(5):657-673). We illustrate our algorithm on natural images of complex city scenes and show examples where image segmentation can be improved by allowing object specific knowledge to disambiguate low-level segmentation cues, and conversely where object detection can be improved by using generic visual patterns to explain away shadows and occlusions.
引用
收藏
页码:113 / 140
页数:28
相关论文
共 50 条
  • [21] Salient region detection and segmentation for general object recognition and image understanding
    TieJun Huang
    YongHong Tian
    Jia Li
    HaoNan Yu
    Science China Information Sciences, 2011, 54 : 2461 - 2470
  • [22] Salient region detection and segmentation for general object recognition and image understanding
    HUANG TieJun 1
    2 Key Laboratory of Intelligent Information Processing
    ScienceChina(InformationSciences), 2011, 54 (12) : 2481 - 2490
  • [23] Image Segmentation For Fingerprint Recognition
    El-Hajj-Chehade, Wassim
    Kader, Riham Abdel
    Kassem, Rola
    El-Zaart, Ali
    PROCEEDINGS OF 2018 IEEE APPLIED SIGNAL PROCESSING CONFERENCE (ASPCON), 2018, : 314 - 319
  • [24] BURST: A Benchmark for Unifying Object Recognition, Segmentation and Tracking in Video
    Athar, Ali
    Luiten, Jonathon
    Voigtlaender, Paul
    Khurana, Tarasha
    Dave, Achal
    Leibe, Bastian
    Ramanan, Deva
    2023 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2023, : 1674 - 1683
  • [25] Clothing Co-Parsing by Joint Image Segmentation and Labeling
    Yang, Wei
    Luo, Ping
    Lin, Liang
    2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, : 3182 - 3189
  • [26] A UNIFYING MODEL FOR LOOKAHEAD LR PARSING
    BERMUDEZ, ME
    COMPUTER LANGUAGES, 1991, 16 (02): : 167 - 178
  • [27] Fully Convolutional Network with Superpixel Parsing for Fashion Web Image Segmentation
    Yang, Lixuan
    Rodriguez, Helena
    Crucianu, Michel
    Ferecatu, Marin
    MULTIMEDIA MODELING (MMM 2017), PT I, 2017, 10132 : 139 - 151
  • [28] Novel Technique for Image Segmentation Based on Grammar Parsing and Hilbert Transform
    Hamdi, Salah
    Ben Abdallah, Asma
    Bedoui, Mohamed Hedi
    IMAGE ANALYSIS AND RECOGNITION, 2013, 7950 : 346 - 353
  • [29] A Survey on Image Segmentation for Handwriting Recognition
    Dutta, Prarthana
    Muppalaneni, Naresh Babu
    THIRD INTERNATIONAL CONFERENCE ON IMAGE PROCESSING AND CAPSULE NETWORKS (ICIPCN 2022), 2022, 514 : 491 - 506
  • [30] Highlight area recognition and image segmentation
    Zhou, Xiaokuan
    Zhou, Fugen
    Zhu, Xiaoyu
    Beijing Hangkong Hangtian Daxue Xuebao/Journal of Beijing University of Aeronautics and Astronautics, 1996, 22 (04): : 495 - 499