Video-based descriptors for object recognition

被引:13
|
作者
Lee, Taehee [1 ]
Soatto, Stefano [1 ]
机构
[1] Univ Calif Los Angeles, Dept Comp Sci, Los Angeles, CA 90095 USA
关键词
Feature tracking; Video-based descriptors; Object recognition; Multi-view recognition; Mobile devices; Visual recognition; Active vision; SHAPE; PERSISTENCE;
D O I
10.1016/j.imavis.2011.08.003
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We describe a visual recognition system operating on a hand-held device, based on a video-based feature descriptor, and characterize its invariance and discriminative properties. Feature selection and tracking are performed in real-time, and used to train a template-based classifier during a capture phase prompted by the user. During normal operation, the system recognizes objects in the field of view based on their ranking. Severe resource constraints have prompted a re-evaluation of existing algorithms improving their performance (accuracy and robustness) as well as computational efficiency. We motivate the design choices in the implementation with a characterization of the stability properties of local invariant detectors, and of the conditions under which a template-based descriptor is optimal. The analysis also highlights the role of time as "weak supervisor" during training, which we exploit in our implementation. (C) 2011 Elsevier B.V. All rights reserved.
引用
收藏
页码:639 / 652
页数:14
相关论文
共 50 条
  • [31] Video-Based Face Recognition: State of the Art
    Zhang, Zhaoxiang
    Wang, Chao
    Wang, Yunhong
    [J]. BIOMETRIC RECOGNITION: CCBR 2011, 2011, 7098 : 1 - 9
  • [32] Historical Blurry Video-Based Face Recognition
    Zhai, Lujun
    Cui, Suxia
    Wang, Yonghui
    Wang, Song
    Zhou, Jun
    Wilsbacher, Greg
    [J]. JOURNAL OF IMAGING, 2024, 10 (09)
  • [33] Video-based vehicle tracking based on moving object detection
    Yang, Min
    Pei, Ming-Tao
    Wang, Yong-Jie
    Dong, Zhen
    Wu, Yu-Wei
    [J]. Beijing Ligong Daxue Xuebao/Transaction of Beijing Institute of Technology, 2014, 34 (04): : 370 - 375
  • [34] Mosaicing-by-recognition: a technique for video-based text recognition
    Miyazaki, H
    Uchida, S
    Sakoe, H
    [J]. EIGHTH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION, VOLS 1 AND 2, PROCEEDINGS, 2005, : 904 - 908
  • [35] Concealed object recognition based on geometric feature descriptors
    Yeom, Seokwon
    Lee, Dong-Su
    Chang, YuShin
    Lee, Mun-Kyo
    Jung, Sang-Won
    [J]. PASSIVE AND ACTIVE MILLIMETER-WAVE IMAGING XV, 2012, 8362
  • [36] Active object recognition based on Fourier descriptors clustering
    Gonzalez, Elizabeth
    Adan, Antonio
    Feliu, Vicente
    Sanchez, Luis
    [J]. PATTERN RECOGNITION LETTERS, 2008, 29 (08) : 1060 - 1071
  • [37] Object Recognition with Fourier Descriptors
    Sarfraz, Muhammad
    [J]. 2020 24TH INTERNATIONAL CONFERENCE INFORMATION VISUALISATION (IV 2020), 2020, : 657 - 662
  • [38] HEROES: A Video-Based Human Emotion Recognition Database
    Mannocchi, Ilaria
    Lamichhane, Kamal
    Carli, Marco
    Battisti, Federica
    [J]. 2022 10TH EUROPEAN WORKSHOP ON VISUAL INFORMATION PROCESSING (EUVIP), 2022,
  • [39] State-of-the-art on video-based face recognition
    Yan, Yan
    Zhang, Yu-Jin
    [J]. Jisuanji Xuebao/Chinese Journal of Computers, 2009, 32 (05): : 878 - 886
  • [40] A new dataset for video-based cow behavior recognition
    Li, Kuo
    Fan, Daoerji
    Wu, Huijuan
    Zhao, Aruna
    [J]. SCIENTIFIC REPORTS, 2024, 14 (01):