Ideal observers of visual object recognition

被引:0
|
作者
Liu, ZL [1 ]
机构
[1] NEC Res Inst, Princeton, NJ 08540 USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Converging evidence has shown that human object recognition depends on the observers' familiarity with objects' appearance. The more similar the objects are, the stronger this dependence will be, and the more important two-dimensional (2D) image information will be to discriminate these objects from one another. The degree to which 3D structural information is used, however, still remains an area of strong debate. Previously, we showed that all models that allow rotations in the image plane of independent 2D templates could not account for human performance in discriminating novel object views as a result of 3D rotation. We now present results from models of generalized radial basis functions (GRBF), 2D closest template matching that allows 2D affine transformations of independent 2D templates,(-)and Bayesian statistical estimator that integrates over all possible 2D affine transformations. The performance of the human observers relative to each of the models is better for the novel views than for the learned template views, this implies that human observers generalize to novel views from learned views better than the models do. The Bayesian estimator yields provably the optimal performance among all models of 2D affine transformations with independent 2D templates. Therefore, no models of 2D affine operations with independent 2D templates account for the human observers' performance. We suggest that the human observers used 3D structural information of the objects, which is also supported by the improved performance as the objects' 3D structural regularity increases.
引用
收藏
页码:145 / 154
页数:10
相关论文
共 50 条
  • [41] Exploiting Object Similarity for Robotic Visual Recognition
    Cai, Hong
    Mostofi, Yasamin
    [J]. IEEE TRANSACTIONS ON ROBOTICS, 2021, 37 (01) : 16 - 33
  • [42] Visual saliency detection based object recognition
    School of Information Science and Engineering, YanShan University, Qinhuangdao, China
    不详
    [J]. J. Inf. Hiding Multimedia Signal Proces., 6 (1250-1263):
  • [43] The role of action representations in visual object recognition
    Helbig, Hannah Barbara
    Graf, Markus
    Kiefer, Markus
    [J]. EXPERIMENTAL BRAIN RESEARCH, 2006, 174 (02) : 221 - 228
  • [44] Omnidirectional Image Stabilization for Visual Object Recognition
    Torii, Akihiko
    Havlena, Michal
    Pajdla, Tomas
    [J]. INTERNATIONAL JOURNAL OF COMPUTER VISION, 2011, 91 (02) : 157 - 174
  • [45] Category-specificity in visual object recognition
    Gerlach, Christian
    [J]. COGNITION, 2009, 111 (03) : 281 - 301
  • [46] Evolving visual object recognition for legged robots
    Zagal, JC
    Ruiz-Del-Solar, J
    Guerrero, P
    Palma, R
    [J]. ROBOCUP 2003: ROBOT SOCCER WORLD CUP VII, 2004, 3020 : 181 - 191
  • [47] Visual-Tactile Fusion for Object Recognition
    Liu, Huaping
    Yu, Yuanlong
    Sun, Fuchun
    Gu, Jason
    [J]. IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING, 2017, 14 (02) : 996 - 1008
  • [48] Object recognition with features inspired by visual cortex
    Serre, T
    Wolf, L
    Poggio, T
    [J]. 2005 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOL 2, PROCEEDINGS, 2005, : 994 - 1000
  • [49] Effects of occlusion on pigeons' visual object recognition
    DiPietro, NT
    Wasserman, EA
    Young, ME
    [J]. PERCEPTION, 2002, 31 (11) : 1299 - 1312
  • [50] The role of spatial attention in visual object recognition
    Shyi, GCW
    Cheng, SK
    [J]. INTERNATIONAL JOURNAL OF PSYCHOLOGY, 1996, 31 (3-4) : 4841 - 4841