Learning Internal Representations of 3D Transformations From 2D Projected Inputs

被引:0
|
作者
Connor, Marissa [1 ]
Olshausen, Bruno [2 ,3 ]
Rozell, Christopher [1 ]
机构
[1] Georgia Inst Technol, Sch Elect & Comp Engn, Atlanta, GA 30332 USA
[2] Univ Calif Berkeley, Helen Wills Neurosci Inst, Berkeley, CA 94720 USA
[3] Univ Calif Berkeley, Sch Optometry, Berkeley, CA 94720 USA
关键词
MENTAL ROTATION; KINETIC DEPTH; 3-DIMENSIONAL STRUCTURE; LIE-GROUPS; MOTION; RECONSTRUCTION; MODEL; SHAPE;
D O I
10.1162/neco_a_01695
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We describe a computational model for inferring 3D structure from the motion of projected 2D points in an image, with the aim of understanding how biological vision systems learn and internally represent 3D transformations from the statistics of their input. The model uses manifold transport operators to describe the action of 3D points in a scene as they undergo transformation. We show that the model can learn the generator of the Lie group for these transformations from purely 2D input, providing a proof-of-concept demonstration for how biological systems could adapt their internal representations based on sensory input. Focusing on a rotational model, we evaluate the ability of the model to infer depth from moving 2D projected points and to learn rotational transformations from 2D training stimuli. Finally, we compare the model performance to psychophysical performance on structure-from-motion tasks.
引用
收藏
页码:2505 / 2539
页数:35
相关论文
共 50 条
  • [31] Mathematically provable correct implementation of integrated 2D and 3D representations
    Thompson, Rodney
    van Oosterom, Peter
    ADVANCES IN 3D GEOINFORMATION SYSTEMS, 2008, : 247 - 278
  • [32] Exploring 3D DTI Fiber Tracts with Linked 2D Representations
    Jianu, Radu
    Demiralp, Cagatay
    Laidlaw, David H.
    IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2009, 15 (06) : 1449 - 1456
  • [33] Unified 2D and 3D Pre-Training of Molecular Representations
    Zhu, Jinhua
    Xia, Yingce
    Wu, Lijun
    Xie, Shufang
    Qin, Tao
    Zhou, Wengang
    Li, Houqiang
    Liu, Tie-Yan
    PROCEEDINGS OF THE 28TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, KDD 2022, 2022, : 2626 - 2636
  • [34] Continuous medial representations for geometric object modeling in 2D and 3D
    Yushkevich, P
    Fletcher, PT
    Joshi, S
    Thall, A
    Pizer, SM
    IMAGE AND VISION COMPUTING, 2003, 21 (01) : 17 - 27
  • [35] Augmented Reality to Scaffold 2D Representations of 3D Models in Magnetism
    McColgan, Michele W.
    Hassel, George E.
    Stagnitti, Natalie C.
    Morphew, Jason W.
    Lindell, Rebecca
    2023 PHYSICS EDUCATION RESEARCH CONFERENCE, PERC, 2023, : 211 - 216
  • [36] Scenes in the Human Brain: Comparing 2D versus 3D Representations
    Groen, Iris I. A.
    Baker, Chris I.
    NEURON, 2019, 101 (01) : 8 - 10
  • [37] 2D and 3D Representations of the Noise in a PCB Using Analytical Methods
    Fizesan, Raul
    Pitica, Dan
    Pop, Ovidiu
    2013 PROCEEDINGS OF THE 36TH INTERNATIONAL SPRING SEMINAR ON ELECTRONICS TECHNOLOGY (ISSE), 2013, : 419 - 422
  • [38] Learning 2D to 3D Lifting for Object Detection in 3D for Autonomous Vehicles
    Srivastava, Siddharth
    Jurie, Frederic
    Sharma, Gaurav
    2019 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2019, : 4504 - 4511
  • [39] Prediction of 3D size and shape descriptors of irregular granular particles from projected 2D images
    Su, D.
    Yan, W. M.
    ACTA GEOTECHNICA, 2020, 15 (06) : 1533 - 1555
  • [40] Prediction of 3D size and shape descriptors of irregular granular particles from projected 2D images
    D. Su
    W. M. Yan
    Acta Geotechnica, 2020, 15 : 1533 - 1555