Diffusion for Natural Image Matting

被引:0
|
作者
Hu, Yihan [1 ,2 ,5 ]
Lin, Yiheng [1 ,2 ]
Wang, Wei [1 ,2 ]
Zhao, Yao [1 ,2 ,3 ]
Wei, Yunchao [1 ,2 ,3 ]
Shi, Humphrey [4 ,5 ]
机构
[1] Beijing Jiaotong Univ, Inst Informat Sci, Beijing, Peoples R China
[2] Minist Educ, Visual Intelligence X Int Joint Lab, Beijing, Peoples R China
[3] Pengcheng Lab, Shenzhen, Peoples R China
[4] Georgia Inst Technol, Atlanta, GA 30332 USA
[5] Picsart AI Res PAIR, Atlanta, GA USA
来源
关键词
Image matting; Diffusion process; Iterative refinement;
D O I
10.1007/978-3-031-72998-0_11
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Existing natural image matting algorithms inevitably have flaws in their predictions on difficult cases, and their one-step prediction manner cannot further correct these errors. In this paper, we investigate a multi-step iterative approach for the first time to tackle the challenging natural image matting task, and achieve excellent performance by introducing a pixel-level denoising diffusion method (DiffMatte) for the alpha matte refinement. To improve iteration efficiency, we design a lightweight diffusion decoder as the only iterative component to directly denoise the alpha matte, saving the huge computational overhead of repeatedly encoding matting features. We also propose an ameliorated self-aligned strategy to consolidate the performance gains brought about by the iterative diffusion process. This allows the model to adapt to various types of errors by aligning the noisy samples used in training and inference, mitigating performance degradation caused by sampling drift. Extensive experimental results demonstrate that DiffMatte not only reaches the state-of-the-art level on the mainstream Composition-1k test set, surpassing the previous best methods by 8% and 15% in the SAD metric and MSE metric respectively, but also show stronger generalization ability in other benchmarks. The code will be open-sourced for the following research and applications. Code is available at https://github.com/YihanHu-2022/DiffMatte.
引用
收藏
页码:181 / 199
页数:19
相关论文
共 50 条
  • [31] Targeting Accurate Object Extraction From an Image: A Comprehensive Study of Natural Image Matting
    Zhu, Qingsong
    Shao, Ling
    Li, Xuelong
    Wang, Lei
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2015, 26 (02) : 185 - 207
  • [32] NATURAL IMAGE MATTING VIA ADAPTIVE LOCAL AND NONLOCAL SAMPLE CLUSTERING
    Yang, Haiyan
    Au, Oscar C.
    Yuan, Yuan
    Sun, Wenxiu
    Ling, Yonggen
    Pang, Jiahao
    2014 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2014, : 3322 - 3326
  • [33] Natural Image Matting with Low-Level Feature Attention Guidance
    Jiang, Hang
    Wu, Song
    He, Dehong
    Xiao, Guoqiang
    KNOWLEDGE SCIENCE, ENGINEERING AND MANAGEMENT, KSEM 2022, PT III, 2022, 13370 : 550 - 561
  • [34] Deep Image Matting
    Xu, Ning
    Price, Brian
    Cohen, Scott
    Huang, Thomas
    30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 311 - 320
  • [35] Referring Image Matting
    Li, Jizhizi
    Zhang, Jing
    Tao, Dacheng
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 22448 - 22457
  • [36] Disentangled Image Matting
    Cai, Shaofan
    Zhang, Xiaoshuai
    Fan, Haoqiang
    Huang, Haibin
    Liu, Jiangyu
    Liu, Jiaming
    Liu, Jiaying
    Wang, Jue
    Sun, Jian
    2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 8818 - 8827
  • [37] Sampling Propagation Attention With Trimap Generation Network for Natural Image Matting
    Zhou, Yuhongze
    Zhou, Liguang
    Lam, Tin Lun
    Xu, Yangsheng
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 33 (10) : 5828 - 5843
  • [38] Ant Colony Alpha Matte: A New Approach for Natural Image Matting
    Soleimani, Vahid
    Vincheh, Farnoosh Heidari
    2013 8TH IRANIAN CONFERENCE ON MACHINE VISION & IMAGE PROCESSING (MVIP 2013), 2013, : 169 - 174
  • [39] Matte anything: Interactive natural image matting with segment anything model
    Yao, Jingfeng
    Wang, Xinggang
    Ye, Lang
    Liu, Wenyu
    IMAGE AND VISION COMPUTING, 2024, 147
  • [40] Semantic Image Matting
    Sun, Yanan
    Tang, Chi-Keung
    Tai, Yu-Wing
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 11115 - 11124