Image parsing: Unifying segmentation, detection, and recognition

被引:260
|
作者
Tu, ZW [1 ]
Chen, XG
Yuille, AL
Zhu, SC
机构
[1] Univ Calif Los Angeles, Dept Stat, Los Angeles, CA 90095 USA
[2] Univ Calif Los Angeles, Dept Psychol, Los Angeles, CA 90095 USA
[3] Univ Calif Los Angeles, Dept Comp Sci, Los Angeles, CA 90095 USA
基金
美国国家卫生研究院; 美国国家科学基金会;
关键词
image parsing; image segmentation; object detection; object recognition; data driven Markov Chain Monte Carlo; AdaBoost;
D O I
10.1007/s11263-005-6642-x
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper we present a Bayesian framework for parsing images into their constituent visual patterns. The parsing algorithm optimizes the posterior probability and outputs a scene representation as a "parsing graph", in a spirit similar to parsing sentences in speech and natural language. The algorithm constructs the parsing graph and re-configures it dynamically using a set of moves, which are mostly reversible Markov chain jumps. This computational framework integrates two popular inference approaches-generative (top-down) methods and discriminative (bottom-up) methods. The former formulates the posterior probability in terms of generative models for images defined by likelihood functions and priors. The latter computes discriminative probabilities based on a sequence (cascade) of bottom-up tests/filters. In our Markov chain algorithm design, the posterior probability, defined by the generative models, is the invariant (target) probability for the Markov chain, and the discriminative probabilities are used to construct proposal probabilities to drive the Markov chain. Intuitively, the bottom-up discriminative probabilities activate top-down generative models. In this paper, we focus on two types of visual patterns-generic visual patterns, such as texture and shading, and object patterns including human faces and text. These types of patterns compete and cooperate to explain the image and so image parsing unifies image segmentation, object detection, and recognition of we use generic visual patterns only then image parsing will correspond to image segmentation (Tu and Zhu, 2002. IEEE Trans. PAM1, 24(5):657-673). We illustrate our algorithm on natural images of complex city scenes and show examples where image segmentation can be improved by allowing object specific knowledge to disambiguate low-level segmentation cues, and conversely where object detection can be improved by using generic visual patterns to explain away shadows and occlusions.
引用
收藏
页码:113 / 140
页数:28
相关论文
共 50 条
  • [31] Stochastic texture recognition for image segmentation
    Mueller, Thomas
    Erdnuess, Bastian
    TM-TECHNISCHES MESSEN, 2019, 86 (7-8) : 384 - 398
  • [32] Image segmentation and recognition of lunar rover
    Shi, De-Le
    Ye, Pei-Jian
    Jia, Yang
    Wang, Rong-Ben
    Guo, Lie
    Jilin Daxue Xuebao (Gongxueban)/Journal of Jilin University (Engineering and Technology Edition), 2007, 37 (01): : 212 - 217
  • [33] Simultaneous Image Segmentation and Object Recognition
    Bansal, Gaurav
    2018 7TH INTERNATIONAL CONFERENCE ON RELIABILITY, INFOCOM TECHNOLOGIES AND OPTIMIZATION (TRENDS AND FUTURE DIRECTIONS) (ICRITO) (ICRITO), 2018, : 873 - 876
  • [34] IMAGE SEGMENTATION BY OBJECT COLOR - A UNIFYING FRAMEWORK AND CONNECTION TO COLOR CONSTANCY
    BRILL, MH
    JOURNAL OF THE OPTICAL SOCIETY OF AMERICA A-OPTICS IMAGE SCIENCE AND VISION, 1990, 7 (10): : 2041 - 2047
  • [35] Table Structure Recognition and Form Parsing by End-to-End Object Detection and Relation Parsing
    Li, Xiao-Hui
    Yin, Fei
    Dai, He-Sen
    Liu, Cheng-Lin
    PATTERN RECOGNITION, 2022, 132
  • [36] IMAGE SEGMENTATION BASED PRIVACY-PRESERVING HUMAN ACTION RECOGNITION FOR ANOMALY DETECTION
    Yan, Jiawei
    Angelini, Federico
    Naqvi, Syed Mohsen
    2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 8931 - 8935
  • [37] Robust ROI localization based on image segmentation and outlier detection in finger vein recognition
    Yanan Gao
    Jianxin Wang
    Liping Zhang
    Multimedia Tools and Applications, 2020, 79 : 20039 - 20059
  • [38] Robust ROI localization based on image segmentation and outlier detection in finger vein recognition
    Gao, Yanan
    Wang, Jianxin
    Zhang, Liping
    MULTIMEDIA TOOLS AND APPLICATIONS, 2020, 79 (27-28) : 20039 - 20059
  • [39] Object detection combining recognition and segmentation
    Wang, Liming
    Shi, Jianbo
    Song, Gang
    Shen, I-fan
    COMPUTER VISION - ACCV 2007, PT I, PROCEEDINGS, 2007, 4843 : 189 - +
  • [40] CT Brain Image:Abnormalities Recognition and Segmentation
    TONG Hau-Lee
    Mohammad Faizal Ahmad Fauzi
    Ryoichi Komiya
    HAW Su-Cheng
    JournalofDonghuaUniversity(EnglishEdition), 2010, 27 (02) : 246 - 249