Figure-Ground Segmentation Improves Handled Object Recognition in Egocentric Video

被引：67

作者：

Ren, Xiaofeng ^{[1
]}

Gu, Chunhui ^{[2
]}

机构：

[1] Intel Labs Seattle, 1100 NE 45th St, Seattle, WA 98105 USA

[2] Univ Calif Berkeley, Berkeley, CA 94720 USA

来源：

2010 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR) | 2010年

关键词：

D O I：

10.1109/CVPR.2010.5540074

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Identifying handled objects, i.e. objects being manipulated by a user, is essential for recognizing the person's activities. An egocentric camera as worn on the body enjoys many advantages such as having a natural first-person view and not needing to instrument the environment. It is also a challenging setting, where background clutter is known to be a major source of problems and is difficult to handle with the camera constantly and arbitrarily moving. In this work we develop a bottom-up motion-based approach to robustly segment out foreground objects in egocentric video and show that it greatly improves object recognition accuracy. Our key insight is that egocentric video of object manipulation is a special domain and many domain-specific cues can readily help. We compute dense optical flow and fit it into multiple affine layers. We then use a max-margin classifier to combine motion with empirical knowledge of object location and background movement as well as temporal cues of support region and color appearance. We evaluate our segmentation algorithm on the large Intel Egocentric Object Recognition dataset with 42 objects and 100K frames. We show that, when combined with temporal integration, figure-ground segmentation improves the accuracy of a SIFT-based recognition system from 33% to 60%, and that of a latent-HOG system from 64% to 86%.

引用

页码：3137 / 3144

页数：8

共 50 条

[41] Probabilistic Joint Image Segmentation and Labeling by Figure-Ground Composition
Ion, Adrian
Carreira, Joao
Sminchisescu, Cristian
[J]. INTERNATIONAL JOURNAL OF COMPUTER VISION, 2014, 107 (01) : 40 - 57
[42] Deficit in figure-ground segmentation following closed head injury
Baylis, GC
Bayliss, LL
[J]. NEUROPSYCHOLOGIA, 1997, 35 (08) : 1133 - 1138
[43] A Unified Contour-Pixel Model for Figure-Ground Segmentation
Packer, Ben
Gould, Stephen
Koller, Daphne
[J]. COMPUTER VISION-ECCV 2010, PT V, 2010, 6315 : 338 - +
[44] Spiking model of fixational eye movements and figure-ground segmentation
Romeo, August
Super, Hans
[J]. NETWORK-COMPUTATION IN NEURAL SYSTEMS, 2022, 33 (1-2) : 143 - 166
[45] A Supervised Figure-Ground Segmentation Method Using Genetic Programming
Liang, Yuyu
Zhang, Mengjie
Browne, Will N.
[J]. APPLICATIONS OF EVOLUTIONARY COMPUTATION, EVOAPPLICATIONS 2015, 2015, 9028 : 491 - 503
[46] Learning to approximate global shape priors for figure-ground segmentation
Kuettel, Daniel
Ferrari, Vittorio
[J]. PROCEEDINGS OF THE BRITISH MACHINE VISION CONFERENCE 2013, 2013,
[47] Online figure-ground segmentation with adaptive metrics in generalized LVQ
Denecke, Alexander
Wersing, Heiko
Steil, Jochen J.
Koerner, Edgar
[J]. NEUROCOMPUTING, 2009, 72 (7-9) : 1470 - 1482
[48] Image feature selection using genetic programming for figure-ground segmentation
Liang, Yuyu
Zhang, Mengjie
Browne, Will N.
[J]. ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2017, 62 : 96 - 108
[49] Spatial suppression promotes rapid figure-ground segmentation of moving objects
Duje Tadin
Woon Ju Park
Kevin C. Dieter
Michael D. Melnick
Joseph S. Lappin
Randolph Blake
[J]. Nature Communications, 10
[50] Spatial suppression promotes rapid figure-ground segmentation of moving objects
Tadin, Duje
Park, Woon Ju
Dieter, Kevin C.
Melnick, Michael D.
Lappin, Joseph S.
Blake, Randolph
[J]. NATURE COMMUNICATIONS, 2019, 10 (1)

← 1 2 3 4 5 →