Diffusion for Natural Image Matting

被引:0
|
作者
Hu, Yihan [1 ,2 ,5 ]
Lin, Yiheng [1 ,2 ]
Wang, Wei [1 ,2 ]
Zhao, Yao [1 ,2 ,3 ]
Wei, Yunchao [1 ,2 ,3 ]
Shi, Humphrey [4 ,5 ]
机构
[1] Beijing Jiaotong Univ, Inst Informat Sci, Beijing, Peoples R China
[2] Minist Educ, Visual Intelligence X Int Joint Lab, Beijing, Peoples R China
[3] Pengcheng Lab, Shenzhen, Peoples R China
[4] Georgia Inst Technol, Atlanta, GA 30332 USA
[5] Picsart AI Res PAIR, Atlanta, GA USA
来源
关键词
Image matting; Diffusion process; Iterative refinement;
D O I
10.1007/978-3-031-72998-0_11
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Existing natural image matting algorithms inevitably have flaws in their predictions on difficult cases, and their one-step prediction manner cannot further correct these errors. In this paper, we investigate a multi-step iterative approach for the first time to tackle the challenging natural image matting task, and achieve excellent performance by introducing a pixel-level denoising diffusion method (DiffMatte) for the alpha matte refinement. To improve iteration efficiency, we design a lightweight diffusion decoder as the only iterative component to directly denoise the alpha matte, saving the huge computational overhead of repeatedly encoding matting features. We also propose an ameliorated self-aligned strategy to consolidate the performance gains brought about by the iterative diffusion process. This allows the model to adapt to various types of errors by aligning the noisy samples used in training and inference, mitigating performance degradation caused by sampling drift. Extensive experimental results demonstrate that DiffMatte not only reaches the state-of-the-art level on the mainstream Composition-1k test set, surpassing the previous best methods by 8% and 15% in the SAD metric and MSE metric respectively, but also show stronger generalization ability in other benchmarks. The code will be open-sourced for the following research and applications. Code is available at https://github.com/YihanHu-2022/DiffMatte.
引用
收藏
页码:181 / 199
页数:19
相关论文
共 50 条
  • [41] Natural shadow matting
    Wu, Tai-Pang
    Tang, Chi-Keung
    Brown, Michael S.
    Shum, Heung-Yeung
    ACM TRANSACTIONS ON GRAPHICS, 2007, 26 (02):
  • [42] A Markov random field model-based approach to natural image matting
    Lin, Sheng-You
    Shi, Jiao-Ying
    JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY, 2007, 22 (01) : 161 - 167
  • [43] Trimap-guided feature mining and fusion network for natural image matting
    Jiang, Weihao
    Yu, Dongdong
    Xie, Zhaozhi
    Li, Yaoyi
    Yuan, Zehuan
    Lu, Hongtao
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2023, 230
  • [44] Designing Effective Inter-Pixel Information Flow for Natural Image Matting
    Aksoy, Yagiz
    Aydin, Tunc Ozan
    Pollefeys, Marc
    30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 228 - 236
  • [45] Unsupervised and reliable image matting based on modified spectral matting
    Hu, Wu-Chih
    Jhu, Jia-Jie
    Lin, Cheng-Pin
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2012, 23 (04) : 665 - 676
  • [46] A Markov Random Field Model-Based Approach to Natural Image Matting
    Sheng-You Lin
    Jiao-Ying Shi
    Journal of Computer Science and Technology, 2007, 22 : 161 - 167
  • [47] A Survey on Image Matting Techniques
    Boda, Jagruti
    Pandya, Dhatri
    PROCEEDINGS OF THE 2018 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATION AND SIGNAL PROCESSING (ICCSP), 2018, : 765 - 770
  • [48] Easy matting - A stroke based approach for continuous image matting
    Guan, Yu
    Chen, Wei
    Liang, Xiao
    Ding, Zi'ang
    Peng, Qunsheng
    COMPUTER GRAPHICS FORUM, 2006, 25 (03) : 567 - 576
  • [49] Natural Matting for Degraded Pictures
    Prabhu, Sahana M.
    Rajagopalan, A. N.
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2011, 20 (12) : 3647 - 3653
  • [50] Image and Video Matting: A Survey
    Wang, Jue
    Cohen, Michael F.
    FOUNDATIONS AND TRENDS IN COMPUTER GRAPHICS AND VISION, 2007, 3 (02): : 97 - 180