Learning Internal Representations of 3D Transformations From 2D Projected Inputs

被引：0

作者：

Connor, Marissa ^{[1
]}

Olshausen, Bruno ^{[2
,3
]}

Rozell, Christopher ^{[1
]}

机构：

[1] Georgia Inst Technol, Sch Elect & Comp Engn, Atlanta, GA 30332 USA

[2] Univ Calif Berkeley, Helen Wills Neurosci Inst, Berkeley, CA 94720 USA

[3] Univ Calif Berkeley, Sch Optometry, Berkeley, CA 94720 USA

来源：

NEURAL COMPUTATION | 2024年 / 36卷 / 11期

关键词：

MENTAL ROTATION; KINETIC DEPTH; 3-DIMENSIONAL STRUCTURE; LIE-GROUPS; MOTION; RECONSTRUCTION; MODEL; SHAPE;

D O I：

10.1162/neco_a_01695

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

We describe a computational model for inferring 3D structure from the motion of projected 2D points in an image, with the aim of understanding how biological vision systems learn and internally represent 3D transformations from the statistics of their input. The model uses manifold transport operators to describe the action of 3D points in a scene as they undergo transformation. We show that the model can learn the generator of the Lie group for these transformations from purely 2D input, providing a proof-of-concept demonstration for how biological systems could adapt their internal representations based on sensory input. Focusing on a rotational model, we evaluate the ability of the model to infer depth from moving 2D projected points and to learn rotational transformations from 2D training stimuli. Finally, we compare the model performance to psychophysical performance on structure-from-motion tasks.

引用

页码：2505 / 2539

页数：35

共 50 条

[31] Mathematically provable correct implementation of integrated 2D and 3D representations
Thompson, Rodney
van Oosterom, Peter
ADVANCES IN 3D GEOINFORMATION SYSTEMS, 2008, : 247 - 278
[32] Exploring 3D DTI Fiber Tracts with Linked 2D Representations
Jianu, Radu
Demiralp, Cagatay
Laidlaw, David H.
IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2009, 15 (06) : 1449 - 1456
[33] Unified 2D and 3D Pre-Training of Molecular Representations
Zhu, Jinhua
Xia, Yingce
Wu, Lijun
Xie, Shufang
Qin, Tao
Zhou, Wengang
Li, Houqiang
Liu, Tie-Yan
PROCEEDINGS OF THE 28TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, KDD 2022, 2022, : 2626 - 2636
[34] Continuous medial representations for geometric object modeling in 2D and 3D
Yushkevich, P
Fletcher, PT
Joshi, S
Thall, A
Pizer, SM
IMAGE AND VISION COMPUTING, 2003, 21 (01) : 17 - 27
[35] Augmented Reality to Scaffold 2D Representations of 3D Models in Magnetism
McColgan, Michele W.
Hassel, George E.
Stagnitti, Natalie C.
Morphew, Jason W.
Lindell, Rebecca
2023 PHYSICS EDUCATION RESEARCH CONFERENCE, PERC, 2023, : 211 - 216
[36] Scenes in the Human Brain: Comparing 2D versus 3D Representations
Groen, Iris I. A.
Baker, Chris I.
NEURON, 2019, 101 (01) : 8 - 10
[37] 2D and 3D Representations of the Noise in a PCB Using Analytical Methods
Fizesan, Raul
Pitica, Dan
Pop, Ovidiu
2013 PROCEEDINGS OF THE 36TH INTERNATIONAL SPRING SEMINAR ON ELECTRONICS TECHNOLOGY (ISSE), 2013, : 419 - 422
[38] Learning 2D to 3D Lifting for Object Detection in 3D for Autonomous Vehicles
Srivastava, Siddharth
Jurie, Frederic
Sharma, Gaurav
2019 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2019, : 4504 - 4511
[39] Prediction of 3D size and shape descriptors of irregular granular particles from projected 2D images
Su, D.
Yan, W. M.
ACTA GEOTECHNICA, 2020, 15 (06) : 1533 - 1555
[40] Prediction of 3D size and shape descriptors of irregular granular particles from projected 2D images
D. Su
W. M. Yan
Acta Geotechnica, 2020, 15 : 1533 - 1555

← 1 2 3 4 5 →