Figure-Ground Segmentation Improves Handled Object Recognition in Egocentric Video

被引:67
|
作者
Ren, Xiaofeng [1 ]
Gu, Chunhui [2 ]
机构
[1] Intel Labs Seattle, 1100 NE 45th St, Seattle, WA 98105 USA
[2] Univ Calif Berkeley, Berkeley, CA 94720 USA
关键词
D O I
10.1109/CVPR.2010.5540074
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Identifying handled objects, i.e. objects being manipulated by a user, is essential for recognizing the person's activities. An egocentric camera as worn on the body enjoys many advantages such as having a natural first-person view and not needing to instrument the environment. It is also a challenging setting, where background clutter is known to be a major source of problems and is difficult to handle with the camera constantly and arbitrarily moving. In this work we develop a bottom-up motion-based approach to robustly segment out foreground objects in egocentric video and show that it greatly improves object recognition accuracy. Our key insight is that egocentric video of object manipulation is a special domain and many domain-specific cues can readily help. We compute dense optical flow and fit it into multiple affine layers. We then use a max-margin classifier to combine motion with empirical knowledge of object location and background movement as well as temporal cues of support region and color appearance. We evaluate our segmentation algorithm on the large Intel Egocentric Object Recognition dataset with 42 objects and 100K frames. We show that, when combined with temporal integration, figure-ground segmentation improves the accuracy of a SIFT-based recognition system from 33% to 60%, and that of a latent-HOG system from 64% to 86%.
引用
收藏
页码:3137 / 3144
页数:8
相关论文
共 50 条
  • [1] Object Recognition by Sequential Figure-Ground Ranking
    Carreira, Joao
    Li, Fuxin
    Sminchisescu, Cristian
    [J]. INTERNATIONAL JOURNAL OF COMPUTER VISION, 2012, 98 (03) : 243 - 262
  • [2] Object Recognition by Sequential Figure-Ground Ranking
    João Carreira
    Fuxin Li
    Cristian Sminchisescu
    [J]. International Journal of Computer Vision, 2012, 98 : 243 - 262
  • [3] Figure-Ground Segmentation Using Object Oriented Descriptor
    Ambulkar, Snehal P.
    Sakhare, Nikhil S.
    Gaikwad, Vishesh P.
    [J]. PROCEEDINGS OF 2ND IEEE INTERNATIONAL CONFERENCE ON ENGINEERING & TECHNOLOGY ICETECH-2016, 2016, : 223 - 226
  • [4] Video Segmentation by Tracking Many Figure-Ground Segments
    Li, Fuxin
    Kim, Taeyoung
    Humayun, Ahmad
    Tsai, David
    Rehg, James M.
    [J]. 2013 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2013, : 2192 - 2199
  • [5] Object Recognition as Ranking Holistic Figure-Ground Hypotheses
    Li, Fuxin
    Carreira, Joao
    Sminchisescu, Cristian
    [J]. 2010 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2010, : 1712 - 1719
  • [6] Noise destroys feedback enhanced figure-ground segmentation but not feedforward figure-ground segmentation
    Romeo, August
    Arall, Marina
    Super, Hans
    [J]. FRONTIERS IN PHYSIOLOGY, 2012, 3
  • [7] Evaluating Superpixels in Video: Metrics Beyond Figure-Ground Segmentation
    Neubert, Peer
    Protzel, Peter
    [J]. PROCEEDINGS OF THE BRITISH MACHINE VISION CONFERENCE 2013, 2013,
  • [8] Figure-ground organization and object recognition processes: An interactive account
    Vecera, SP
    O'Reilly, RC
    [J]. JOURNAL OF EXPERIMENTAL PSYCHOLOGY-HUMAN PERCEPTION AND PERFORMANCE, 1998, 24 (02) : 441 - 462
  • [9] Object Figure-Ground Segmentation Using Zero-Shot Learning
    Naha, Shujon
    Wang, Yang
    [J]. 2016 23RD INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2016, : 2842 - 2847
  • [10] Intact figure-ground segmentation in schizophrenia
    Herzog, MH
    Kopmann, S
    Brand, A
    [J]. PSYCHIATRY RESEARCH, 2004, 129 (01) : 55 - 63