Robust object detection with interleaved categorization and segmentation

被引：578

作者：

Leibe, Bastian ^{[1
]}

Leonardis, Ales ^{[2
]}

Schiele, Bernt ^{[3
]}

机构：

[1] ETH, Comp Vis Lab, Zurich, Switzerland

[2] Univ Ljubljana, Fac Comp & Informat Sci, Ljubljana, Slovenia

[3] Tech Univ Darmstadt, Dept Comp Sci, Darmstadt, Germany

来源：

INTERNATIONAL JOURNAL OF COMPUTER VISION | 2008年 / 77卷 / 1-3期

关键词：

object categorization; object detection; segmentation; clustering; hough transform; hypothesis selection; MDL;

D O I：

10.1007/s11263-007-0095-3

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

This paper presents a novel method for detecting and localizing objects of a visual category in cluttered real-world scenes. Our approach considers object categorization and figure-ground segmentation as two interleaved processes that closely collaborate towards a common goal. As shown in our work, the tight coupling between those two processes allows them to benefit from each other and improve the combined performance. The core part of our approach is a highly flexible learned representation for object shape that can combine the information observed on different training examples in a probabilistic extension of the Generalized Hough Transform. The resulting approach can detect categorical objects in novel images and automatically infer a probabilistic segmentation from the recognition result. This segmentation is then in turn used to again improve recognition by allowing the system to focus its efforts on object pixels and to discard misleading influences from the background. Moreover, the information from where in the image a hypothesis draws its support is employed in an MDL based hypothesis verification stage to resolve ambiguities between overlapping hypotheses and factor out the effects of partial occlusion. An extensive evaluation on several large data sets shows that the proposed system is applicable to a range of different object categories, including both rigid and articulated objects. In addition, its flexible representation allows it to achieve competitive object detection performance already from training sets that are between one and two orders of magnitude smaller than those used in comparable systems.

引用

页码：259 / 289

页数：31

共 50 条

[41] Object detection combining recognition and segmentation
Wang, Liming
Shi, Jianbo
Song, Gang
Shen, I-fan
COMPUTER VISION - ACCV 2007, PT I, PROCEEDINGS, 2007, 4843 : 189 - +
[42] Simultaneous Object Detection and Semantic Segmentation
Salscheider, Niels Ole
ICPRAM: PROCEEDINGS OF THE 9TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION APPLICATIONS AND METHODS, 2020, : 555 - 561
[43] OBJECT DETECTION AND IDENTIFICATION BY HIERARCHICAL SEGMENTATION
HANUSSE, P
GUILLATAUD, P
LECTURE NOTES IN COMPUTER SCIENCE, 1990, 427 : 583 - 585
[44] Fast and Robust Object Segmentation with the Integral Linear Classifier
Aldavert, David
Ramisa, Arnau
Lopez de Mantaras, Ramon
Toledo, Ricardo
2010 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2010, : 1046 - 1053
[45] Robust and Efficient Memory Network for Video Object Segmentation
Chen, Yadang
Zhang, Dingwei
Yang, Zhi-Xin
Wu, Enhua
2023 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, ICME, 2023, : 1769 - 1774
[46] Robust object segmentation using low resolution stereo
Morimoto, M
Fujii, K
IMAGE PROCESSING, BIOMEDICINE, MULTIMEDIA, FINANCIAL ENGINEERING AND MANUFACTURING, VOL 18, 2004, 18 : 75 - 80
[47] Multi-scale object segmentation for robust recognition
Mokhtarian, F
VISION INTERFACE '97, PROCEEDINGS, 1997, : 8 - 15
[48] A ROBUST FRAMEWORK FOR REGION BASED VIDEO OBJECT SEGMENTATION
Escudero-Vinolo, Marcos
Bescos, Jesus
2010 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, 2010, : 3461 - 3464
[49] FAST AND ROBUST OBJECT SEGMENTATION APPROACH FOR MPEG VIDEOS
Ahmad, Ashraf M. A.
COMPUTER VISION AND GRAPHICS (ICCVG 2004), 2006, 32 : 746 - 751
[50] Robust Object Segmentation using Split-and-Merge
Faruquzzaman, A. B. M.
Paiker, Nafize Rabbani
Arafat, Jahidul
Ali, M. Ameer
Sorwar, Golam
INTERNATIONAL JOURNAL OF SIGNAL AND IMAGING SYSTEMS ENGINEERING, 2009, 2 (1-2) : 70 - 80

← 1 2 3 4 5 →