A Computational Learning Theory of Active Object Recognition Under Uncertainty

被引:26
|
作者
Andreopoulos, Alexander [1 ]
Tsotsos, John K. [2 ]
机构
[1] IBM Res Almaden, San Jose, CA 95120 USA
[2] York Univ, Dept Comp Sci & Engn, Ctr Vis Res, Toronto, ON M3J 2R7, Canada
关键词
Object recognition; Visual search; Active vision; Attention; Computational complexity of vision; VISUAL-ATTENTION; MODEL; COMPLEXITY; SALIENCY; SEARCH; TASK;
D O I
10.1007/s11263-012-0551-6
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present some theoretical results related to the problem of actively searching a 3D scene to determine the positions of one or more pre-specified objects. We investigate the effects that input noise, occlusion, and the VC-dimensions of the related representation classes have in terms of localizing all objects present in the search region, under finite computational resources and a search cost constraint. We present a number of bounds relating the noise-rate of low level feature detection to the VC-dimension of an object representable by an architecture satisfying the given computational constraints. We prove that under certain conditions, the corresponding classes of object localization and recognition problems are efficiently learnable in the presence of noise and under a purposive learning strategy, as there exists a polynomial upper bound on the minimum number of examples necessary to correctly localize the targets under the given models of uncertainty. We also use these arguments to show that passive approaches to the same problem do not necessarily guarantee that the problem is efficiently learnable. Under this formulation, we prove the existence of a number of emergent relations between the object detection noise-rate, the scene representation length, the object class complexity, and the representation class complexity, which demonstrate that selective attention is not only necessary due to computational complexity constraints, but it is also necessary as a noise-suppression mechanism and as a mechanism for efficient object class learning. These results concretely demonstrate the advantages of active, purposive and attentive approaches for solving complex vision problems.
引用
收藏
页码:95 / 142
页数:48
相关论文
共 50 条
  • [31] Learning Hybrid Object Kinematics for Efficient Hierarchical Planning Under Uncertainty
    Jain, Ajinkya
    Niekum, Scott
    2020 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2020, : 5253 - 5260
  • [32] A Facial Expression Recognition Method Integrating Uncertainty Estimation and Active Learning
    Wang, Yujian
    Zhang, Jianxun
    Sun, Renhao
    CMC-COMPUTERS MATERIALS & CONTINUA, 2024, 81 (01): : 533 - 548
  • [33] Learning temporal context in active object recognition using Bayesian analysis
    Paletta, L
    Prantl, M
    Pinz, A
    15TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOL 1, PROCEEDINGS: COMPUTER VISION AND IMAGE ANALYSIS, 2000, : 695 - 699
  • [34] Active Object Perceiver: Recognition-guided Policy Learning for Object Searching on Mobile Robots
    Ye, Xin
    Lin, Zhe
    Li, Haoxiang
    Zheng, Shibin
    Yang, Yezhou
    2018 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2018, : 6857 - 6863
  • [35] Transinformation for active object recognition
    Schiele, B
    Crowley, JL
    SIXTH INTERNATIONAL CONFERENCE ON COMPUTER VISION, 1998, : 249 - 254
  • [36] Object recognition by active fusion
    Prantl, M
    Borotschnig, H
    Ganster, H
    Sinclair, D
    Pinz, A
    INTELLIGENT ROBOTS AND COMPUTER VISION XV: ALGORITHMS, TECHNIQUES, ACTIVE VISION, AND MATERIALS HANDLING, 1996, 2904 : 320 - 330
  • [37] A computational model for learning from repeated traumatic experiences under uncertainty
    Kaye, Alfred P.
    Rao, Manasa G.
    Kwan, Alex C.
    Ressler, Kerry J.
    Krystal, John H.
    COGNITIVE AFFECTIVE & BEHAVIORAL NEUROSCIENCE, 2023, 23 (03) : 894 - 904
  • [38] Interval deep learning for computational mechanics problems under input uncertainty
    Betancourt, David
    Muhanna, Rafi L.
    PROBABILISTIC ENGINEERING MECHANICS, 2022, 70
  • [39] A computational model for learning from repeated traumatic experiences under uncertainty
    Alfred P. Kaye
    Manasa G. Rao
    Alex C. Kwan
    Kerry J. Ressler
    John H. Krystal
    Cognitive, Affective, & Behavioral Neuroscience, 2023, 23 : 894 - 904
  • [40] Uncertainty Fusion based Object Recognition and Tracking in Maritime Scenes using Spatiotemporal Active Contours
    Bechar, Ikhlef
    Bouchara, Frederic
    Lelore, Thibault
    Guis, Vincente
    Grimaldi, Michel
    PROCEEDINGS OF THE 2014 9TH INTERNATIONAL CONFERENCE ON COMPUTER VISION THEORY AND APPLICATIONS (VISAPP), VOL 1, 2014, : 682 - 689