INSTANT3D: FAST TEXT-TO-3D WITH SPARSE-VIEW GENERATION AND LARGE RECONSTRUCTION MODEL

被引:0
|
作者
Li, Jiahao [1 ,2 ]
Tan, Hao [1 ]
Zhang, Kai [1 ]
Xu, Zexiang [1 ]
Luan, Fujun [1 ]
Xu, Yinghao [1 ,3 ]
Hong, Yicong [1 ,4 ]
Sunkavalli, Kalyan [1 ]
Shakhnarovich, Greg [2 ]
Bi, Sai [1 ]
机构
[1] Adobe Research
[2] TTIC, United States
[3] Stanford University, United States
[4] Australian National Univeristy, Australia
来源
arXiv | 2023年
关键词
Compilation and indexing terms; Copyright 2024 Elsevier Inc;
D O I
暂无
中图分类号
学科分类号
摘要
Diffusion - HTTP - Image reconstruction - Three dimensional computer graphics
引用
收藏
相关论文
共 50 条
  • [31] PI3D: Efficient Text-to-3D Generation with Pseudo-Image Diffusion
    Liu, Ying-Tian
    Guo, Yuan-Chen
    Luo, Guan
    Sun, Heyi
    Yin, Wei
    Zhang, Song-Hai
    2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2024, : 19915 - 19924
  • [32] GradeADreamer: Enhanced Text-to-3D Generation Using Gaussian Splatting and Multi-View Diffusion
    Ukarapol, Trapoom
    Pruvost, Kevin
    arXiv,
  • [33] An FPGA Accelerator for 3D Cone-beam Sparse-view Computed Tomography Reconstruction
    Gu, Yuhan
    Wu, Qing
    Yuan, Zhechen
    Zhang, Xiangyu
    Su, Wenyan
    Zhang, Yuyao
    Lou, Xin
    2024 IEEE 6TH INTERNATIONAL CONFERENCE ON AI CIRCUITS AND SYSTEMS, AICAS 2024, 2024, : 577 - 581
  • [34] Sparse-view planar 3D reconstruction method based on hierarchical token pooling Transformer
    Zhang, Jiahui
    Yang, Jinfu
    Fu, Fuji
    Ma, Jiaqi
    APPLIED SOFT COMPUTING, 2025, 174
  • [35] Direct2.5: Diverse Text-to-3D Generation via Multi-view 2.5D Diffusion
    Lu, Yuanxun
    Zhang, Jingyang
    Li, Shiwei
    Fang, Tian
    McKinnon, David
    Tsin, Yanghai
    Quan, Long
    Cao, Xun
    Yao, Yao
    2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2024, 2024, : 8744 - 8753
  • [36] Hyper-3DG: Text-to-3D Gaussian Generation via Hypergraph
    Donglin Di
    Jiahui Yang
    Chaofan Luo
    Zhou Xue
    Wei Chen
    Xun Yang
    Yue Gao
    International Journal of Computer Vision, 2025, 133 (5) : 2886 - 2909
  • [37] Learning 3D Gaussians for Extremely Sparse-View Cone-Beam CT Reconstruction
    Lin, Yiqun
    Wang, Hualiang
    Chen, Jixiang
    Li, Xiaomeng
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2024, PT VII, 2024, 15007 : 425 - 435
  • [38] DreamControl: Control-Based Text-to-3D Generation with 3D Self-Prior
    Huang, Tianyu
    Zeng, Yihan
    Zhang, Zhilu
    Xu, Wan
    Xu, Hang
    Xu, Songcen
    Lau, Rynson W.H.
    Zuo, Wangmeng
    arXiv, 2023,
  • [39] ATT3D: Amortized Text-to-3D Object Synthesis
    Lorraine, Jonathan
    Xie, Kevin
    Zeng, Xiaohui
    Lin, Chen-Hsuan
    Takikawa, Towaki
    Sharp, Nicholas
    Lin, Tsung-Yi
    Liu, Ming-Yu
    Fidler, Sanja
    Lucas, James
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 17900 - 17910
  • [40] DreamMesh: Jointly Manipulating and Texturing Triangle Meshes for Text-to-3D Generation
    Yang, Haibo
    Chen, Yang
    Pan, Yingwei
    Yao, Ting
    Chen, Zhineng
    Wu, Zuxuan
    Jiang, Yu-Gang
    Mei, Tao
    COMPUTER VISION - ECCV 2024, PT LIX, 2025, 15117 : 162 - 178