Multi-Object Representation Learning via Feature Connectivity and Object-Centric Regularization

被引:0
|
作者
Foo, Alex [1 ]
Hsu, Wynne [1 ]
Lee, Mong Li [1 ]
机构
[1] Natl Univ Singapore, Sch Comp, Singapore, Singapore
基金
新加坡国家研究基金会;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Discovering object-centric representations from images has the potential to greatly improve the robustness, sample efficiency and interpretability of machine learning algorithms. Current works on multi-object images typically follow a generative approach that optimizes for input reconstruction and fail to scale to real-world datasets despite significant increases in model capacity. We address this limitation by proposing a novel method that leverages feature connectivity to cluster neighboring pixels likely to belong to the same object. We further design two object-centric regularization terms to refine object representations in the latent space, enabling our approach to scale to complex real-world images. Experimental results on simulated, real-world, complex texture and common object images demonstrate a substantial improvement in the quality of discovered objects compared to state-of-the-art methods, as well as the sample efficiency and generalizability of our approach. We also show that the discovered object-centric representations can accurately predict key object properties in downstream tasks, highlighting the potential of our method to advance the field of multi-object representation learning.
引用
收藏
页数:13
相关论文
共 50 条
  • [1] Multi-Object Manipulation via Object-Centric Neural Scattering Functions
    Tian, Stephen
    Cai, Yancheng
    Yu, Hong-Xing
    Zakharov, Sergey
    Liu, Katherine
    Gaidon, Adrien
    Li, Yunzhu
    Wu, Jiajun
    [J]. 2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 9021 - 9031
  • [2] Learning Object-Centric Representations of Multi-Object Scenes from Multiple Views
    Nanbo, Li
    Eastwood, Cian
    Fisher, Robert B.
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020, 2020, 33
  • [3] Object-Centric Representation Learning for Video Scene Understanding
    Zhou, Yi
    Zhang, Hui
    Park, Seung-In
    Yoo, ByungIn
    Qi, Xiaojuan
    [J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2024, 46 (12) : 8410 - 8423
  • [4] Language-Mediated, Object-Centric Representation Learning
    Wang, Ruocheng
    Mao, Jiayuan
    Gershman, Samuel J.
    Wu, Jiajun
    [J]. FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL-IJCNLP 2021, 2021, : 2033 - 2046
  • [5] Object-Centric Representation Learning from Unlabeled Videos
    Gao, Ruohan
    Jayaraman, Dinesh
    Grauman, Kristen
    [J]. COMPUTER VISION - ACCV 2016, PT V, 2017, 10115 : 248 - 263
  • [6] Object-Centric Representation Learning for Video Question Answering
    Long Hoang Dang
    Thao Minh Le
    Vuong Le
    Truyen Tran
    [J]. 2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,
  • [7] OCVOS: OBJECT-CENTRIC REPRESENTATION FOR VIDEO OBJECT SEGMENTATION
    Jo, Junho
    Wee, Dongyoon
    Cho, Nam Ik
    [J]. 2023 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2023, : 1655 - 1659
  • [8] Semantic Tracklets: An Object-Centric Representation for Visual Multi-Agent Reinforcement Learning
    Liu, Iou-Jen
    Ren, Zhongzheng
    Yeh, Raymond A.
    Schwing, Alexander G.
    [J]. 2021 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2021, : 5603 - 5610
  • [9] Is an Object-Centric Video Representation Beneficial for Transfer?
    Zhang, Chuhan
    Gupta, Ankush
    Zisserman, Andrew
    [J]. COMPUTER VISION - ACCV 2022, PT IV, 2023, 13844 : 379 - 397
  • [10] Floating Waste Discovery by Request via Object-Centric Learning
    Fu, Bingfei
    [J]. CMC-COMPUTERS MATERIALS & CONTINUA, 2024, 80 (01): : 1407 - 1424