Diffusion for Natural Image Matting

被引:0
|
作者
Hu, Yihan [1 ,2 ,5 ]
Lin, Yiheng [1 ,2 ]
Wang, Wei [1 ,2 ]
Zhao, Yao [1 ,2 ,3 ]
Wei, Yunchao [1 ,2 ,3 ]
Shi, Humphrey [4 ,5 ]
机构
[1] Beijing Jiaotong Univ, Inst Informat Sci, Beijing, Peoples R China
[2] Minist Educ, Visual Intelligence X Int Joint Lab, Beijing, Peoples R China
[3] Pengcheng Lab, Shenzhen, Peoples R China
[4] Georgia Inst Technol, Atlanta, GA 30332 USA
[5] Picsart AI Res PAIR, Atlanta, GA USA
来源
关键词
Image matting; Diffusion process; Iterative refinement;
D O I
10.1007/978-3-031-72998-0_11
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Existing natural image matting algorithms inevitably have flaws in their predictions on difficult cases, and their one-step prediction manner cannot further correct these errors. In this paper, we investigate a multi-step iterative approach for the first time to tackle the challenging natural image matting task, and achieve excellent performance by introducing a pixel-level denoising diffusion method (DiffMatte) for the alpha matte refinement. To improve iteration efficiency, we design a lightweight diffusion decoder as the only iterative component to directly denoise the alpha matte, saving the huge computational overhead of repeatedly encoding matting features. We also propose an ameliorated self-aligned strategy to consolidate the performance gains brought about by the iterative diffusion process. This allows the model to adapt to various types of errors by aligning the noisy samples used in training and inference, mitigating performance degradation caused by sampling drift. Extensive experimental results demonstrate that DiffMatte not only reaches the state-of-the-art level on the mainstream Composition-1k test set, surpassing the previous best methods by 8% and 15% in the SAD metric and MSE metric respectively, but also show stronger generalization ability in other benchmarks. The code will be open-sourced for the following research and applications. Code is available at https://github.com/YihanHu-2022/DiffMatte.
引用
收藏
页码:181 / 199
页数:19
相关论文
共 50 条
  • [21] Cross Depth Image Filter-based Natural Image Matting
    Li, Yujie
    Lu, Huimin
    Zhang, Lifeng
    Serikawa, Seiichi
    2013 14TH ACIS INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING, ARTIFICIAL INTELLIGENCE, NETWORKING AND PARALLEL/DISTRIBUTED COMPUTING (SNPD 2013), 2013, : 601 - 604
  • [22] Automatic framework for high-efficient natural image matting
    He, Fazhi
    Wu, Yue
    Zhang, Dengyi
    Huang, Zhiyong
    Wei, Lingyun
    Xiao, Chunxia
    MIPPR 2007: MULTISPECTRAL IMAGE PROCESSING, 2007, 6787
  • [23] A fast approach for natural image matting using structure information
    Ning, Qianhui
    Wang, Weiqiang
    Zhu, Caifeng
    Qing, Laiyun
    Huang, Qingming
    2007 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, VOLS 1-5, 2007, : 1399 - 1402
  • [24] Long-Range Feature Propagating for Natural Image Matting
    Liu, Qinglin
    Xie, Haozhe
    Zhang, Shengping
    Zhong, Bineng
    Ji, Rongrong
    PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021, 2021, : 526 - 534
  • [25] Effective Local-Global Transformer for Natural Image Matting
    Hu, Liangpeng
    Kong, Yating
    Li, Jide
    Li, Xiaoqiang
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 33 (08) : 3888 - 3898
  • [26] Natural Image Matting Using Deep Convolutional Neural Networks
    Cho, Donghyeon
    Tai, Yu-Wing
    Kweon, Inso
    COMPUTER VISION - ECCV 2016, PT II, 2016, 9906 : 626 - 643
  • [27] Natural image matting with non-negative matrix factorization
    Wang, K
    Zheng, NN
    Liu, WX
    2005 International Conference on Image Processing (ICIP), Vols 1-5, 2005, : 1285 - 1288
  • [28] A Survey on Natural Image Matting With Closed-Form Solutions
    Li, Xiaoqiang
    Li, Jide
    Lu, Hong
    IEEE ACCESS, 2019, 7 : 136658 - 136675
  • [29] NATURAL IMAGE MATTING FOR MULTIPLE WIDE-BASELINE VIEWS
    Sarim, Muhammad
    Hilton, Adrian
    Guillemaut, Jean-Yves
    Takai, Takeshi
    Kim, Hansung
    2010 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, 2010, : 2233 - 2236
  • [30] NATURAL IMAGE MATTING WITH SHIFTED WINDOW SELF-ATTENTION
    Wang, Zhikun
    Liu, Yang
    Li, Zonglin
    Wang, Chenyang
    Zhang, Shengping
    2022 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2022, : 2911 - 2915