Semantics-Guided Latent Space Exploration for Shape Generation

被引：8

作者：

Jahan, Tansin ^{[1
]}

Guan, Yanran ^{[1
]}

van Kaick, Oliver ^{[1
]}

机构：

[1] Carleton Univ, Sch Comp Sci, Ottawa, ON, Canada

来源：

COMPUTER GRAPHICS FORUM | 2021年 / 40卷 / 02期

基金：

加拿大自然科学与工程研究理事会;

关键词：

CCS Concepts; circle Computing methodologies -> Shape modeling;

D O I：

10.1111/cgf.142619

中图分类号：

TP31 [计算机软件];

学科分类号：

081202 ; 0835 ;

摘要：

We introduce an approach to incorporate user guidance into shape generation approaches based on deep networks. Generative networks such as autoencoders and generative adversarial networks are trained to encode shapes into latent vectors, effectively learning a latent shape space that can be sampled for generating new shapes. Our main idea is to enable users to explore the shape space with the use of high-level semantic keywords. Specifically, the user inputs a set of keywords that describe the general attributes of the shape to be generated, e.g., "four legs" for a chair. Then, our method maps the keywords to a subspace of the latent space, where the subspace captures the shapes possessing the specified attributes. The user then explores only this subspace to search for shapes that satisfy the design goal, in a process similar to using a parametric shape model. Our exploratory approach allows users to model shapes at a high level without the need for advanced artistic skills, in contrast to existing methods that allow to guide the generation with sketching or partial modeling of a shape. Our technical contribution to enable this exploration-based approach is the introduction of a label regression neural network coupled with shape encoder/decoder networks. The label regression network takes the user-provided keywords and maps them to distributions in the latent space. We show that our method allows users to explore the shape space and generate a variety of shapes with selected high-level attributes.

引用

页码：115 / 126

页数：12

共 50 条

[41] A benchmark dataset and semantics-guided detection network for spatial–temporal human actions in urban driving scenes
Zhong, Fujin
Wu, Yini
Yu, Hong
Wang, Guoyin
Lu, Zhantao
Pattern Recognition, 2025, 158
[42] Semi-supervised class-conditional image synthesis with Semantics-guided Adaptive Feature Transforms
Huo, Xiaoyang
Zhang, Yunfei
Wu, Si
PATTERN RECOGNITION, 2024, 146
[43] SIEGE: A Semantics-Guided Safety Enhancement Framework for AI-Enabled Cyber-Physical Systems
Song, Jiayang
Xie, Xuan
Ma, Lei
IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 2023, 49 (08) : 4058 - 4080
[44] 360ST-Mapping: An Online Semantics-Guided Topological Mapping Module for Omnidirectional Visual SLAM
Liu, Hongji
Huang, Huajian
Yeung, Sai-Kit
Liu, Ming
2022 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2022, : 802 - 807
[45] SEMANTICS-GUIDED MULTI-LEVEL RGB-D FEATURE FUSION FOR INDOOR SEMANTIC SEGMENTATION
Li, Yabei
Zhang, Junge
Cheng, Yanhua
Huang, Kaiqi
Tan, Tieniu
2017 24TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2017, : 1262 - 1266
[46] Guided Test Case Generation Through AI Enabled Output Space Exploration
Budnik, Christof
Gario, Marco
Markov, Georgi
Wang, Zhu
2018 IEEE/ACM 13TH INTERNATIONAL WORKSHOP ON AUTOMATION OF SOFTWARE TEST (AST), 2018, : 53 - 56
[47] SwipeGANSpace: Swipe-to-Compare Image Generation via Efficient Latent Space Exploration
Nakashima, Yuto
Yang, Mingzhe
Baba, Yukino
PROCEEDINGS OF 2024 29TH ANNUAL CONFERENCE ON INTELLIGENT USER INTERFACES, IUI 2024, 2024, : 675 - 685
[48] Latent-NeRF for Shape-Guided Generation of 3D Shapes and Textures
Metzer, Gal
Richardson, Elad
Patashnik, Or
Giryes, Raja
Cohen-Or, Daniel
2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 12663 - 12673
[49] Learning geometry-aware joint latent space for simultaneous multimodal shape generation
Komarichev, Artem
Hua, Jing
Zhong, Zichun
COMPUTER AIDED GEOMETRIC DESIGN, 2022, 93
[50] Alignment-Free RGBT Salient Object Detection: Semantics-Guided Asymmetric Correlation Network and a Unified Benchmark
Wang K.
Lin D.
Li C.
Tu Z.
Luo B.
IEEE Transactions on Multimedia, 2024, 26 : 1 - 16

← 1 2 3 4 5 →