Semantics-Guided Latent Space Exploration for Shape Generation

被引:8
|
作者
Jahan, Tansin [1 ]
Guan, Yanran [1 ]
van Kaick, Oliver [1 ]
机构
[1] Carleton Univ, Sch Comp Sci, Ottawa, ON, Canada
基金
加拿大自然科学与工程研究理事会;
关键词
CCS Concepts; circle Computing methodologies -> Shape modeling;
D O I
10.1111/cgf.142619
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
We introduce an approach to incorporate user guidance into shape generation approaches based on deep networks. Generative networks such as autoencoders and generative adversarial networks are trained to encode shapes into latent vectors, effectively learning a latent shape space that can be sampled for generating new shapes. Our main idea is to enable users to explore the shape space with the use of high-level semantic keywords. Specifically, the user inputs a set of keywords that describe the general attributes of the shape to be generated, e.g., "four legs" for a chair. Then, our method maps the keywords to a subspace of the latent space, where the subspace captures the shapes possessing the specified attributes. The user then explores only this subspace to search for shapes that satisfy the design goal, in a process similar to using a parametric shape model. Our exploratory approach allows users to model shapes at a high level without the need for advanced artistic skills, in contrast to existing methods that allow to guide the generation with sketching or partial modeling of a shape. Our technical contribution to enable this exploration-based approach is the introduction of a label regression neural network coupled with shape encoder/decoder networks. The label regression network takes the user-provided keywords and maps them to distributions in the latent space. We show that our method allows users to explore the shape space and generate a variety of shapes with selected high-level attributes.
引用
收藏
页码:115 / 126
页数:12
相关论文
共 50 条
  • [41] A benchmark dataset and semantics-guided detection network for spatial–temporal human actions in urban driving scenes
    Zhong, Fujin
    Wu, Yini
    Yu, Hong
    Wang, Guoyin
    Lu, Zhantao
    Pattern Recognition, 2025, 158
  • [42] Semi-supervised class-conditional image synthesis with Semantics-guided Adaptive Feature Transforms
    Huo, Xiaoyang
    Zhang, Yunfei
    Wu, Si
    PATTERN RECOGNITION, 2024, 146
  • [43] SIEGE: A Semantics-Guided Safety Enhancement Framework for AI-Enabled Cyber-Physical Systems
    Song, Jiayang
    Xie, Xuan
    Ma, Lei
    IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 2023, 49 (08) : 4058 - 4080
  • [44] 360ST-Mapping: An Online Semantics-Guided Topological Mapping Module for Omnidirectional Visual SLAM
    Liu, Hongji
    Huang, Huajian
    Yeung, Sai-Kit
    Liu, Ming
    2022 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2022, : 802 - 807
  • [45] SEMANTICS-GUIDED MULTI-LEVEL RGB-D FEATURE FUSION FOR INDOOR SEMANTIC SEGMENTATION
    Li, Yabei
    Zhang, Junge
    Cheng, Yanhua
    Huang, Kaiqi
    Tan, Tieniu
    2017 24TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2017, : 1262 - 1266
  • [46] Guided Test Case Generation Through AI Enabled Output Space Exploration
    Budnik, Christof
    Gario, Marco
    Markov, Georgi
    Wang, Zhu
    2018 IEEE/ACM 13TH INTERNATIONAL WORKSHOP ON AUTOMATION OF SOFTWARE TEST (AST), 2018, : 53 - 56
  • [47] SwipeGANSpace: Swipe-to-Compare Image Generation via Efficient Latent Space Exploration
    Nakashima, Yuto
    Yang, Mingzhe
    Baba, Yukino
    PROCEEDINGS OF 2024 29TH ANNUAL CONFERENCE ON INTELLIGENT USER INTERFACES, IUI 2024, 2024, : 675 - 685
  • [48] Latent-NeRF for Shape-Guided Generation of 3D Shapes and Textures
    Metzer, Gal
    Richardson, Elad
    Patashnik, Or
    Giryes, Raja
    Cohen-Or, Daniel
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 12663 - 12673
  • [49] Learning geometry-aware joint latent space for simultaneous multimodal shape generation
    Komarichev, Artem
    Hua, Jing
    Zhong, Zichun
    COMPUTER AIDED GEOMETRIC DESIGN, 2022, 93
  • [50] Alignment-Free RGBT Salient Object Detection: Semantics-Guided Asymmetric Correlation Network and a Unified Benchmark
    Wang K.
    Lin D.
    Li C.
    Tu Z.
    Luo B.
    IEEE Transactions on Multimedia, 2024, 26 : 1 - 16