Unsupervised Object Learning via Common Fate

被引:0
|
作者
Tangemann, Matthias [1 ,2 ]
Schneider, Steffen [1 ,2 ,3 ]
von Kuegelgen, Julius [4 ,5 ]
Locatello, Francesco [6 ]
Gehler, Peter [6 ]
Brox, Thomas [6 ]
Kuemmerer, Matthias [1 ,2 ]
Bethge, Matthias [1 ,2 ]
Schoelkopf, Bernhard [6 ]
机构
[1] Tubingen AI Ctr, Tubingen, Germany
[2] Univ Tubingen, Tubingen, Germany
[3] Ecole Polytech Fed Lausanne, Lausanne, Switzerland
[4] Max Planck Inst Intelligent Syst, Tubingen, Germany
[5] Univ Cambridge, Cambridge, England
[6] Amazon, Seattle, WA USA
关键词
object learning; scene modeling; scene generation; causal modeling; causal representation learning; generative modeling; common fate;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Learning generative object models from unlabelled videos is a long standing problem relevant for causal scene modeling. We decompose this task into three easier subtasks, and provide candidate solutions for each of them. Inspired by the Common Fate Principle of Gestalt Psychology, we first extract (noisy) masks of moving objects via unsupervised motion segmentation. Second, generative models are trained on the masks of the background and the moving objects, respectively. Third, background and foreground models are combined in a conditional "dead leaves" scene model to sample novel scene configurations where occlusions and depth layering arise naturally. To evaluate the individual stages, we introduce the FISHBOWL dataset positioned between complex real-world scenes and common object-centric benchmarks of simplistic objects. We show that our approach learns generative models that generalize beyond occlusions present in the input videos and represents scenes in a modular fashion, allowing generation of plausible scenes outside the training distribution by permitting, for instance, object numbers or densities not observed during training. Code: https://github.com/mtangemann/common_fate_object_learning
引用
收藏
页码:281 / 327
页数:47
相关论文
共 50 条
  • [1] Unsupervised Multiple Object Cosegmentation via Ensemble MIML Learning
    Yang, Weichen
    Sun, Zhengxing
    Li, Bo
    Hu, Jiagao
    Yang, Kewei
    [J]. MULTIMEDIA MODELING, MMM 2017, PT II, 2017, 10133 : 393 - 404
  • [2] Unsupervised Learning of Object Landmarks via Self-Training Correspondence
    Mallis, Dimitrios
    Sanchez, Enrique
    Bell, Matt
    Tzimiropoulos, Georgios
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020, 2020, 33
  • [3] Object-agnostic Affordance Categorization via Unsupervised Learning of Graph Embeddings
    Toumpa A.
    Cohn A.G.
    [J]. Journal of Artificial Intelligence Research, 2023, (77): : 1 - 38
  • [4] Object-agnostic Affordance Categorization via Unsupervised Learning of Graph Embeddings
    Toumpa, Alexia
    Cohn, Anthony G.
    [J]. JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 2023, 77 : 1 - 38
  • [5] Unsupervised Learning of Foreground Object Segmentation
    Croitoru, Ioana
    Bogolin, Simion-Vlad
    Leordeanu, Marius
    [J]. INTERNATIONAL JOURNAL OF COMPUTER VISION, 2019, 127 (09) : 1279 - 1302
  • [6] Unsupervised Learning of Foreground Object Segmentation
    Ioana Croitoru
    Simion-Vlad Bogolin
    Marius Leordeanu
    [J]. International Journal of Computer Vision, 2019, 127 : 1279 - 1302
  • [7] Unsupervised learning of object deformation models
    Kokkinos, Iasonas
    Yuille, Alan
    [J]. 2007 IEEE 11TH INTERNATIONAL CONFERENCE ON COMPUTER VISION, VOLS 1-6, 2007, : 282 - 289
  • [8] Object Detection by Common Fate Hough Transform
    Wang, Zhipeng
    Cui, Jinshi
    Zha, Hongbin
    Kegesawa, Masataka
    Ikeuchi, Katsushi
    [J]. 2011 FIRST ASIAN CONFERENCE ON PATTERN RECOGNITION (ACPR), 2011, : 613 - 617
  • [9] Unsupervised Learning for Salient Object Detection via Minimization of Bilinear Factor Matrix Norm
    Li, Min
    Zhang, Yao
    Xiao, Mingqing
    Zhang, Weiqiang
    Sun, Xiaoli
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 34 (03) : 1354 - 1366
  • [10] Unsupervised Object Class Discovery via Saliency-Guided Multiple Class Learning
    Zhu, Jun-Yan
    Wu, Jiajun
    Xu, Yan
    Chang, Eric
    Tu, Zhuowen
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2015, 37 (04) : 862 - 875