Learning Internal Representations of 3D Transformations From 2D Projected Inputs

被引:0
|
作者
Connor, Marissa [1 ]
Olshausen, Bruno [2 ,3 ]
Rozell, Christopher [1 ]
机构
[1] Georgia Inst Technol, Sch Elect & Comp Engn, Atlanta, GA 30332 USA
[2] Univ Calif Berkeley, Helen Wills Neurosci Inst, Berkeley, CA 94720 USA
[3] Univ Calif Berkeley, Sch Optometry, Berkeley, CA 94720 USA
关键词
MENTAL ROTATION; KINETIC DEPTH; 3-DIMENSIONAL STRUCTURE; LIE-GROUPS; MOTION; RECONSTRUCTION; MODEL; SHAPE;
D O I
10.1162/neco_a_01695
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We describe a computational model for inferring 3D structure from the motion of projected 2D points in an image, with the aim of understanding how biological vision systems learn and internally represent 3D transformations from the statistics of their input. The model uses manifold transport operators to describe the action of 3D points in a scene as they undergo transformation. We show that the model can learn the generator of the Lie group for these transformations from purely 2D input, providing a proof-of-concept demonstration for how biological systems could adapt their internal representations based on sensory input. Focusing on a rotational model, we evaluate the ability of the model to infer depth from moving 2D projected points and to learn rotational transformations from 2D training stimuli. Finally, we compare the model performance to psychophysical performance on structure-from-motion tasks.
引用
收藏
页码:2505 / 2539
页数:35
相关论文
共 50 条
  • [21] Learning Transferable and Discriminative Representations for 2D Image-Based 3D Model Retrieval
    Zhou, Yaqian
    Liu, Yu
    Zhou, Heyu
    Cheng, Zhiyong
    Li, Xuanya
    Liu, An-An
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (10) : 7147 - 7159
  • [22] LEARNING 3D WHITE MATTER MICROSTRUCTURE FROM 2D HISTOLOGY
    Nath, Vishwesh
    Schilling, Kurt G.
    Remedios, Samuel
    Bayrak, Roza G.
    Gao, Yurui
    Blaber, Justin A.
    Huo, Yuankai
    Landman, Bennett A.
    Anderson, A. W.
    2019 IEEE 16TH INTERNATIONAL SYMPOSIUM ON BIOMEDICAL IMAGING (ISBI 2019), 2019, : 186 - 190
  • [23] Learning to Produce 3D Media From a Captured 2D Video
    Park, Minwoo
    Luo, Jiebo
    Gallagher, Andrew C.
    Rabbani, Majid
    IEEE TRANSACTIONS ON MULTIMEDIA, 2013, 15 (07) : 1569 - 1578
  • [24] CLIP-FO3D: Learning Free Open-world 3D Scene Representations from 2D Dense CLIP
    Zhang, Junbo
    Dong, Runpei
    Ma, Kaisheng
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS, ICCVW, 2023, : 2040 - 2051
  • [25] Exploring rich intermediate representations for reconstructing 3D shapes from 2D images
    Yang, Yang
    Han, Junwei
    Zhang, Dingwen
    Tian, Qi
    PATTERN RECOGNITION, 2022, 122
  • [26] HIERARCHICAL REPRESENTATIONS OF 2D/3D GRAY-SCALE IMAGES AND THEIR 2D/3D TWO-WAY CONVERSION.
    Mao, Xiaoyang
    Kunii, Tosiyasu L.
    Fujishiro, Issei
    Noma, Tsukasa
    IEEE Computer Graphics and Applications, 1987, 7 (11) : 37 - 44
  • [27] Design in 2D, model in 3D: Live 3D pose generation from 2D sketches
    Tosco, Paolo
    Mackey, Mark
    Cheeseright, Tim
    ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 2019, 258
  • [28] Shape features for projected 2D images based 3D object recognition
    Department of Electronic Engineering, Tsinghua University, Beijing 100084, China
    Qinghua Daxue Xuebao, 2009, 10 (1646-1650): : 1646 - 1650
  • [29] 2D or 3D?
    Mills, R
    COMPUTER-AIDED ENGINEERING, 1996, 15 (08): : 4 - 4
  • [30] Learning 3D Scene Priors with 2D Supervision
    Nie, Yinyu
    Dai, Angela
    Han, Xiaoguang
    Niessner, Matthias
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR, 2023, : 792 - 802