GSEditPro: 3D Gaussian Splatting Editing with Attention-based Progressive Localization

被引:0
|
作者
Sun, Y. [1 ]
Tian, R. [1 ]
Han, X. [1 ]
Liu, X. [2 ]
Zhang, Y. [1 ]
Xu, K. [2 ]
机构
[1] Nanjing Univ, State Key Lab Novel Software Technol, Nanjing, Peoples R China
[2] Natl Univ Def Technol, Changsha, Peoples R China
关键词
Semantics;
D O I
10.1111/cgf.15215
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
With the emergence of large-scale Text-to-Image(T2I) models and implicit 3D representations like Neural Radiance Fields (NeRF), many text-driven generative editing methods based on NeRF have appeared. However, the implicit encoding of geometric and textural information poses challenges in accurately locating and controlling objects during editing. Recently, significant advancements have been made in the editing methods of 3D Gaussian Splatting, a real-time rendering technology that relies on explicit representation. However, these methods still suffer from issues including inaccurate localization and limited manipulation over editing. To tackle these challenges, we propose GSEditPro, a novel 3D scene editing framework which allows users to perform various creative and precise editing using text prompts only. Leveraging the explicit nature of the 3D Gaussian distribution, we introduce an attention-based progressive localization module to add semantic labels to each Gaussian during rendering. This enables precise localization on editing areas by classifying Gaussians based on their relevance to the editing prompts derived from cross-attention layers of the T2I model. Furthermore, we present an innovative editing optimization method based on 3D Gaussian Splatting, obtaining stable and refined editing results through the guidance of Score Distillation Sampling and pseudo ground truth. We prove the efficacy of our method through extensive experiments.
引用
收藏
页数:12
相关论文
共 50 条
  • [21] Related Keyframe Optimization Gaussian-Simultaneous Localization and Mapping: A 3D Gaussian Splatting-Based Simultaneous Localization and Mapping with Related Keyframe Optimization
    Ma, Xiasheng
    Song, Ci
    Ji, Yimin
    Zhong, Shanlin
    APPLIED SCIENCES-BASEL, 2025, 15 (03):
  • [22] Generalizable 3D Gaussian Splatting for novel view synthesis
    Zhao, Chuyue
    Huang, Xin
    Yang, Kun
    Wang, Xue
    Wang, Qing
    PATTERN RECOGNITION, 2025, 161
  • [23] Characterization and Analysis of the 3D Gaussian Splatting Rendering Pipeline
    Lee, Jiwon
    Lee, Yunjae
    Kwon, Youngeun
    Rhu, Minsoo
    IEEE COMPUTER ARCHITECTURE LETTERS, 2025, 24 (01) : 13 - 16
  • [24] ThermalGS: Dynamic 3D Thermal Reconstruction with Gaussian Splatting
    Liu, Yuxiang
    Chen, Xi
    Yan, Shen
    Cui, Zeyu
    Xiao, Huaxin
    Liu, Yu
    Zhang, Maojun
    REMOTE SENSING, 2025, 17 (02)
  • [25] SWinGS: Sliding Windows for Dynamic 3D Gaussian Splatting
    Shaw, Richard
    Nazarczuk, Michal
    Song, Jifei
    Moreau, Arthur
    Catley-Chandar, Sibi
    Dhamo, Helisa
    Perez-Pellitero, Eduardo
    COMPUTER VISION - ECCV 2024, PT LV, 2025, 15113 : 37 - 54
  • [26] 3D reconstruction of non-cooperative space targets of poor lighting based on 3D gaussian splatting
    Yibin Zhao
    Jianjun Yi
    Yihan Pan
    Liwei Chen
    Signal, Image and Video Processing, 2025, 19 (6)
  • [27] Attention-based Proposals Refinement for 3D Object Detection
    Minh-Quan Dao
    Hery, Elwan
    Fremont, Vincent
    2022 IEEE INTELLIGENT VEHICLES SYMPOSIUM (IV), 2022, : 197 - 205
  • [28] Attention-based Active 3D Point Cloud Segmentation
    Johnson-Roberson, Matthew
    Bohg, Jeannette
    Bjorkman, Marten
    Kragic, Danica
    IEEE/RSJ 2010 INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS 2010), 2010, : 1165 - 1170
  • [29] Characterizing Satellite Geometry via Accelerated 3D Gaussian Splatting
    Nguyen, Van Minh
    Sandidge, Emma
    Mahendrakar, Trupti
    White, Ryan T.
    AEROSPACE, 2024, 11 (03)
  • [30] On the Error Analysis of 3D Gaussian Splatting and an Optimal Projection Strategy
    Huang, Letian
    Bai, Jiayang
    Guo, Jie
    Li, Yuanqi
    Guo, Yanwen
    COMPUTER VISION - ECCV 2024, PT XVII, 2025, 15075 : 247 - 263