Diffusion for Natural Image Matting

被引:0
|
作者
Hu, Yihan [1 ,2 ,5 ]
Lin, Yiheng [1 ,2 ]
Wang, Wei [1 ,2 ]
Zhao, Yao [1 ,2 ,3 ]
Wei, Yunchao [1 ,2 ,3 ]
Shi, Humphrey [4 ,5 ]
机构
[1] Beijing Jiaotong Univ, Inst Informat Sci, Beijing, Peoples R China
[2] Minist Educ, Visual Intelligence X Int Joint Lab, Beijing, Peoples R China
[3] Pengcheng Lab, Shenzhen, Peoples R China
[4] Georgia Inst Technol, Atlanta, GA 30332 USA
[5] Picsart AI Res PAIR, Atlanta, GA USA
来源
关键词
Image matting; Diffusion process; Iterative refinement;
D O I
10.1007/978-3-031-72998-0_11
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Existing natural image matting algorithms inevitably have flaws in their predictions on difficult cases, and their one-step prediction manner cannot further correct these errors. In this paper, we investigate a multi-step iterative approach for the first time to tackle the challenging natural image matting task, and achieve excellent performance by introducing a pixel-level denoising diffusion method (DiffMatte) for the alpha matte refinement. To improve iteration efficiency, we design a lightweight diffusion decoder as the only iterative component to directly denoise the alpha matte, saving the huge computational overhead of repeatedly encoding matting features. We also propose an ameliorated self-aligned strategy to consolidate the performance gains brought about by the iterative diffusion process. This allows the model to adapt to various types of errors by aligning the noisy samples used in training and inference, mitigating performance degradation caused by sampling drift. Extensive experimental results demonstrate that DiffMatte not only reaches the state-of-the-art level on the mainstream Composition-1k test set, surpassing the previous best methods by 8% and 15% in the SAD metric and MSE metric respectively, but also show stronger generalization ability in other benchmarks. The code will be open-sourced for the following research and applications. Code is available at https://github.com/YihanHu-2022/DiffMatte.
引用
收藏
页码:181 / 199
页数:19
相关论文
共 50 条
  • [1] Natural image and video matting
    Abhilash, R.
    ICCIMA 2007: INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND MULTIMEDIA APPLICATIONS, VOL IV, PROCEEDINGS, 2007, : 471 - 477
  • [2] Deep Automatic Natural Image Matting
    Li, Jizhizi
    Zhang, Jing
    Tao, Dacheng
    PROCEEDINGS OF THE THIRTIETH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2021, 2021, : 800 - 806
  • [3] Natural image matting based on surrogate model
    Liang, Yihui
    Gou, Hongshan
    Feng, Fujian
    Liu, Guisong
    Huang, Han
    APPLIED SOFT COMPUTING, 2023, 143
  • [4] Natural Image Matting with Total Variation Regularisation
    Tierney, Stephen
    Gao, Junbin
    2012 INTERNATIONAL CONFERENCE ON DIGITAL IMAGE COMPUTING TECHNIQUES AND APPLICATIONS (DICTA), 2012,
  • [5] Natural Image Matting with Attended Global Context
    Yi-Yi Zhang
    Li Niu
    Yasushi Makihara
    Jian-Fu Zhang
    Wei-Jie Zhao
    Yasushi Yagi
    Li-Qing Zhang
    Journal of Computer Science and Technology, 2023, 38 : 659 - 673
  • [6] New Appearance Models for Natural Image Matting
    Singaraju, Dheeraj
    Rother, Carsten
    Rhemann, Christoph
    CVPR: 2009 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOLS 1-4, 2009, : 659 - +
  • [7] Color subspace exploring for natural image matting
    Kong, Yating
    Li, Jide
    Hu, Liangpeng
    Li, Xiaoqiang
    IET IMAGE PROCESSING, 2024, 18 (09) : 2244 - 2256
  • [8] Tensorial Evolutionary Optimization for Natural Image Matting
    Lei, Si-Chao
    Gong, Yue-Jiao
    Xiao, Xiao-Lin
    Zhou, Yi-Cong
    Zhang, Jun
    ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2024, 20 (07)
  • [9] Local learning approach for natural image matting
    Peng, Hong-Jing
    Chen, Song-Can
    Zhang, Dao-Qiang
    Ruan Jian Xue Bao/Journal of Software, 2009, 20 (04): : 834 - 844
  • [10] Natural Image Matting Using HSI Framework
    Khandelwal, Vineet
    Gupta, Abhinav
    Kashyap, Manish
    Gandhi, Hitesh
    Dhawan, Aishwar
    2009 2ND IEEE INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND INFORMATION TECHNOLOGY, VOL 2, 2009, : 141 - 144