Unsupervised learning of object landmarks by factorized spatial embeddings

被引:72
|
作者
Thewlis, James [1 ]
Bilen, Hakan [1 ,2 ]
Vedaldi, Andrea [1 ]
机构
[1] Univ Oxford, Oxford, England
[2] Univ Edinburgh, Edinburgh, Midlothian, Scotland
基金
英国工程与自然科学研究理事会;
关键词
D O I
10.1109/ICCV.2017.348
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Learning automatically the structure of object categories remains an important open problem in computer vision. In this paper, we propose a novel unsupervised approach that can discover and learn landmarks in object categories, thus characterizing their structure. Our approach is based on factorizing image deformations, as induced by a viewpoint change or an object deformation, by learning a deep neural network that detects landmarks consistently with such visual effects. Furthermore, we show that the learned landmarks establish meaningful correspondences between different object instances in a category without having to impose this requirement explicitly. We assess the method qualitatively on a variety of object types, natural and man-made. We also show that our unsupervised landmarks are highly predictive of manually-annotated landmarks in face benchmark datasets, and can be used to regress these with a high degree of accuracy.
引用
收藏
页码:3229 / 3238
页数:10
相关论文
共 50 条
  • [1] Unsupervised Learning of Object Landmarks through Conditional Image Generation
    Jakab, Tomas
    Gupta, Ankush
    Bilen, Hakan
    Vedaldi, Andrea
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 31 (NIPS 2018), 2018, 31
  • [2] Unsupervised Learning of Object Landmarks via Self-Training Correspondence
    Mallis, Dimitrios
    Sanchez, Enrique
    Bell, Matt
    Tzimiropoulos, Georgios
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020, 2020, 33
  • [3] Unsupervised Discovery of Object Landmarks as Structural Representations
    Zhang, Yuting
    Guo, Yijie
    Jin, Yixin
    Luo, Yijun
    He, Zhiyuan
    Lee, Honglak
    [J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 2694 - 2703
  • [4] Object-agnostic Affordance Categorization via Unsupervised Learning of Graph Embeddings
    Toumpa A.
    Cohn A.G.
    [J]. Journal of Artificial Intelligence Research, 2023, (77): : 1 - 38
  • [5] Object-agnostic Affordance Categorization via Unsupervised Learning of Graph Embeddings
    Toumpa, Alexia
    Cohn, Anthony G.
    [J]. JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 2023, 77 : 1 - 38
  • [6] Unsupervised Learning of Object-Centric Embeddings for Cell Instance Segmentation in Microscopy Images
    Wolf, Steffen
    Lalit, Manan
    McDole, Katie
    Funke, Jan
    [J]. 2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 21206 - 21215
  • [7] Factorized Diffusion Autoencoder for Unsupervised Disentangled Representation Learning
    Wu, Ancong
    Zheng, Wei-Shi
    [J]. THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 6, 2024, : 5930 - 5939
  • [8] Unsupervised Learning of Disentangled Location Embeddings
    Ouyang, Kun
    Liang, Yuxuan
    Liu, Ye
    Rosenblum, David S.
    Yang, Wenzhuo
    [J]. 2020 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2020,
  • [9] Unsupervised Learning of Landmarks by Descriptor Vector Exchange
    Thewlis, James
    Albanie, Samuel
    Bilen, Hakan
    Vedaldi, Andrea
    [J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 6370 - 6380
  • [10] Learning Invariant Object and Spatial View Representations in the Brain Using Slow Unsupervised Learning
    Rolls, Edmund T.
    [J]. FRONTIERS IN COMPUTATIONAL NEUROSCIENCE, 2021, 15