Language-Mediated, Object-Centric Representation Learning

被引:0
|
作者
Wang, Ruocheng [1 ]
Mao, Jiayuan [2 ]
Gershman, Samuel J. [3 ]
Wu, Jiajun
机构
[1] Stanford Univ, Stanford, CA 94305 USA
[2] MIT CSAIL, Cambridge, MA USA
[3] Harvard Univ, Cambridge, MA 02138 USA
关键词
INDIVIDUATION;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present Language-mediated, Object-centric Representation Learning (LORL), a paradigm for learning disentangled, object-centric scene representations from vision and language. LORL builds upon recent advances in unsupervised object discovery and segmentation, notably MONet and Slot Attention. While these algorithms learn an object-centric representation just by reconstructing the input image, LORL enables them to further learn to associate the learned representations to concepts, i.e., words for object categories, properties, and spatial relationships, from language input. These object-centric concepts derived from language facilitate the learning of object-centric representations. LORL can be integrated with various unsupervised object discovery algorithms that are language-agnostic. Experiments show that the integration of LORL consistently improves the performance of unsupervised object discovery methods on two datasets via the help of language. We also show that concepts learned by LORL, in conjunction with object discovery methods, aid downstream tasks such as referring expression comprehension.
引用
下载
收藏
页码:2033 / 2046
页数:14
相关论文
共 50 条
  • [21] Learning Dexterous Grasping with Object-Centric Visual Affordances
    Mandikal, Priyanka
    Grauman, Kristen
    2021 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2021), 2021, : 6169 - 6176
  • [22] Object-Centric Slot Diffusion
    Jiang, Jindong
    Deng, Fei
    Singh, Gautam
    Ahn, Sungjin
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [23] Learning and Sequencing of Object-Centric Manipulation Skills for Industrial Tasks
    Rozo, Leonel
    Guo, Meng
    Kupcsik, Andras G.
    Todescato, Marco
    Schillinger, Philipp
    Giftthaler, Markus
    Ochs, Matthias
    Spies, Markus
    Waniek, Nicolai
    Kesper, Patrick
    Buerger, Mathias
    2020 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2020, : 9072 - 9079
  • [24] Floating Waste Discovery by Request via Object-Centric Learning
    Fu, Bingfei
    CMC-COMPUTERS MATERIALS & CONTINUA, 2024, 80 (01): : 1407 - 1424
  • [25] Data-efficient learning of object-centric grasp preferences
    Fleytoux, Yoann
    Ma, Anji
    Ivaldi, Serena
    Mouret, Jean-Baptiste
    2022 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, ICRA 2022, 2022, : 6337 - 6343
  • [26] Learning object-centric complementary features for zero-shot learning
    Liu, Jie
    Song, Kechen
    He, Yu
    Dong, Hongwen
    Yan, Yunhui
    Meng, Qinggang
    SIGNAL PROCESSING-IMAGE COMMUNICATION, 2020, 89
  • [27] Simple Unsupervised Object-Centric Learning for Complex and Naturalistic Videos
    Singh, Gautam
    Wu, Yi-Fu
    Ahn, Sungjin
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
  • [28] Unsupervised Object-Centric Learning From Multiple Unspecified Viewpoints
    Yuan, Jinyang
    Chen, Tonglin
    Shen, Zhimeng
    Li, Bin
    Xue, Xiangyang
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2024, 46 (05) : 3897 - 3909
  • [29] Object-centric Learning with Cyclic Walks between Parts and Whole
    Wang, Ziyu
    Shou, Mike Zheng
    Zhang, Mengmi
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [30] Dynamics Learning with Object-Centric Interaction Networks for Robot Manipulation
    Wang, Jiayu
    Hu, Chuxiong
    Wang, Yunan
    Zhu, Yu
    IEEE Access, 2021, 9 : 68277 - 68288