Learning Internal Representations of 3D Transformations From 2D Projected Inputs

被引：0

作者：

Connor, Marissa ^{[1
]}

Olshausen, Bruno ^{[2
,3
]}

Rozell, Christopher ^{[1
]}

机构：

[1] Georgia Inst Technol, Sch Elect & Comp Engn, Atlanta, GA 30332 USA

[2] Univ Calif Berkeley, Helen Wills Neurosci Inst, Berkeley, CA 94720 USA

[3] Univ Calif Berkeley, Sch Optometry, Berkeley, CA 94720 USA

来源：

NEURAL COMPUTATION | 2024年 / 36卷 / 11期

关键词：

MENTAL ROTATION; KINETIC DEPTH; 3-DIMENSIONAL STRUCTURE; LIE-GROUPS; MOTION; RECONSTRUCTION; MODEL; SHAPE;

D O I：

10.1162/neco_a_01695

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

We describe a computational model for inferring 3D structure from the motion of projected 2D points in an image, with the aim of understanding how biological vision systems learn and internally represent 3D transformations from the statistics of their input. The model uses manifold transport operators to describe the action of 3D points in a scene as they undergo transformation. We show that the model can learn the generator of the Lie group for these transformations from purely 2D input, providing a proof-of-concept demonstration for how biological systems could adapt their internal representations based on sensory input. Focusing on a rotational model, we evaluate the ability of the model to infer depth from moving 2D projected points and to learn rotational transformations from 2D training stimuli. Finally, we compare the model performance to psychophysical performance on structure-from-motion tasks.

引用

页码：2505 / 2539

页数：35

共 50 条

[21] Learning Transferable and Discriminative Representations for 2D Image-Based 3D Model Retrieval
Zhou, Yaqian
Liu, Yu
Zhou, Heyu
Cheng, Zhiyong
Li, Xuanya
Liu, An-An
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (10) : 7147 - 7159
[22] LEARNING 3D WHITE MATTER MICROSTRUCTURE FROM 2D HISTOLOGY
Nath, Vishwesh
Schilling, Kurt G.
Remedios, Samuel
Bayrak, Roza G.
Gao, Yurui
Blaber, Justin A.
Huo, Yuankai
Landman, Bennett A.
Anderson, A. W.
2019 IEEE 16TH INTERNATIONAL SYMPOSIUM ON BIOMEDICAL IMAGING (ISBI 2019), 2019, : 186 - 190
[23] Learning to Produce 3D Media From a Captured 2D Video
Park, Minwoo
Luo, Jiebo
Gallagher, Andrew C.
Rabbani, Majid
IEEE TRANSACTIONS ON MULTIMEDIA, 2013, 15 (07) : 1569 - 1578
[24] CLIP-FO3D: Learning Free Open-world 3D Scene Representations from 2D Dense CLIP
Zhang, Junbo
Dong, Runpei
Ma, Kaisheng
2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS, ICCVW, 2023, : 2040 - 2051
[25] Exploring rich intermediate representations for reconstructing 3D shapes from 2D images
Yang, Yang
Han, Junwei
Zhang, Dingwen
Tian, Qi
PATTERN RECOGNITION, 2022, 122
[26] HIERARCHICAL REPRESENTATIONS OF 2D/3D GRAY-SCALE IMAGES AND THEIR 2D/3D TWO-WAY CONVERSION.
Mao, Xiaoyang
Kunii, Tosiyasu L.
Fujishiro, Issei
Noma, Tsukasa
IEEE Computer Graphics and Applications, 1987, 7 (11) : 37 - 44
[27] Design in 2D, model in 3D: Live 3D pose generation from 2D sketches
Tosco, Paolo
Mackey, Mark
Cheeseright, Tim
ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 2019, 258
[28] Shape features for projected 2D images based 3D object recognition
Department of Electronic Engineering, Tsinghua University, Beijing 100084, China
Qinghua Daxue Xuebao, 2009, 10 (1646-1650): : 1646 - 1650
[29] 2D or 3D?
Mills, R
COMPUTER-AIDED ENGINEERING, 1996, 15 (08): : 4 - 4
[30] Learning 3D Scene Priors with 2D Supervision
Nie, Yinyu
Dai, Angela
Han, Xiaoguang
Niessner, Matthias
2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR, 2023, : 792 - 802

← 1 2 3 4 5 →