Generative 3D Part Assembly via Dynamic Graph Learning

被引:0
|
作者
Huang, Jialei [3 ]
Zhan, Guanqi [3 ]
Fan, Qingnan
Mo, Kaichun [1 ]
Shao, Lin [1 ]
Chen, Baoquan [2 ]
Guibas, Leonidas [1 ]
Dong, Hao [2 ]
机构
[1] Stanford Univ, Stanford, CA USA
[2] Peking Univ, Peng Cheng Lab, CFCS CS Dept, AIIT, Beijing, Peoples R China
[3] Peking Univ, Beijing, Peoples R China
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Autonomous part assembly is a challenging yet crucial task in 3D computer vision and robotics. Analogous to buying an IKEA furniture, given a set of 3D parts that can assemble a single shape, an intelligent agent needs to perceive the 3D part geometry, reason to propose pose estimations for the input parts, and finally call robotic planning and control routines for actuation. In this paper, we focus on the pose estimation subproblem from the vision side involving geometric and relational reasoning over the input part geometry. Essentially, the task of generative 3D part assembly is to predict a 6-DoF part pose, including a rigid rotation and translation, for each input part that assembles a single 3D shape as the final output. To tackle this problem, we propose an assembly-oriented dynamic graph learning framework that leverages an iterative graph neural network as a backbone. It explicitly conducts sequential part assembly refinements in a coarse-to-fine manner, exploits a pair of part relation reasoning module and part aggregation module for dynamically adjusting both part features and their relations in the part graph. We conduct extensive experiments and quantitative comparisons to three strong baseline methods, demonstrating the effectiveness of the proposed approach.
引用
收藏
页数:12
相关论文
共 50 条
  • [31] SinGRAF: Learning a 3D Generative Radiance Field for a Single Scene
    Son, Minjung
    Park, Jeong Joon
    Guihas, Leonidas
    Wetzstein, Gordon
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 8507 - 8517
  • [32] Fine Detailed Texture Learning for 3D Meshes With Generative Models
    Dundar, Aysegul
    Gao, Jun
    Tao, Andrew
    Catanzaro, Bryan
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (12) : 14563 - 14574
  • [33] SFGAN: Unsupervised Generative Adversarial Learning of 3D Scene Flow from the 3D Scene Self
    Wang, Guangming
    Jiang, Chaokang
    Shen, Zehang
    Miao, Yanzi
    Wang, Hesheng
    ADVANCED INTELLIGENT SYSTEMS, 2022, 4 (04)
  • [34] 3D based generative PROTAC linker design with reinforcement learning
    Li, Baiqing
    Ran, Ting
    Chen, Hongming
    BRIEFINGS IN BIOINFORMATICS, 2023, 24 (05)
  • [35] 3D Human Gesture Matching Via Graph Cut
    Guo, Tianchu
    Wu, Xiaoyu
    2013 6TH INTERNATIONAL CONGRESS ON IMAGE AND SIGNAL PROCESSING (CISP), VOLS 1-3, 2013, : 675 - 679
  • [36] Explicit 3D reconstruction from images with dynamic graph learning and rendering-guided diffusion
    Wu, Di
    Zhou, Linli
    Li, Jincheng
    Xiong, Jianqiao
    Song, Liangtu
    NEUROCOMPUTING, 2024, 601
  • [37] Learning part-in-whole relation of 3D shapes for part-based 3D model retrieval
    Furuya, Takahiko
    Ohbuchi, Ryutarou
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2018, 166 : 102 - 114
  • [38] Improving 3D Human Pose Estimation via 3D Part Affinity Fields
    Liu, Ding
    Zhao, Zixu
    Wang, Xinchao
    Hu, Yuxiao
    Zhang, Lei
    Huang, Thomas S.
    2019 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2019, : 1004 - 1013
  • [39] Topological and geometrical joint learning for 3D graph data
    Han, Li
    Lan, Pengyan
    Shi, Xue
    Wang, Xiaomin
    He, Jinhai
    Li, Genyu
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (10) : 15457 - 15474
  • [40] Topological and geometrical joint learning for 3D graph data
    Li Han
    Pengyan Lan
    Xue Shi
    Xiaomin Wang
    Jinhai He
    Genyu Li
    Multimedia Tools and Applications, 2023, 82 : 15457 - 15474