FormerUnify: Transformer-Based Unified Fusion for Efficient Image Matting

被引:0
|
作者
Wang, Jiaquan [1 ]
机构
[1] Shanghai Univ, Shanghai 200444, Peoples R China
关键词
Image matting; Transformer; Feature pyramid; Unified fusion;
D O I
10.1007/978-981-97-8685-5_29
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recently, deep learning-based methods in the field of image matting have incorporated additional modules and complex network structures to capture more comprehensive image information, thereby achieving higher accuracy. However, these innovations inevitably result in a decrement of inference speed and higher computational resource consumption. In this paper, we propose a Transformer-based unified fusion network for image matting, denoted as FormerUnify. Compared to existing methods, it is able to achieve a more optimal balance between accuracy and efficiency. FormerUnify is built upon the classic encoder-decoder framework, with its centerpiece being the Unified Fusion Decoder. This decoder is composed of three essential layers: unify layer, fusion layer, and upsampling prediction head, all of which work in concert to unify and fuse the rich multi-scale features extracted by the encoder effectively. Furthermore, we couple the Unified Fusion Decoder with an advanced Transformer-based encoder, and optimize their integration to enhance their compatibility and performance. Experimental evaluations on two synthetic datasets (Composition-1K and Distinctions-646) and an real-world dataset (AIM-500) affirm that FormerUnify achieves rapid inference speed without compromising its superior accuracy.
引用
收藏
页码:412 / 425
页数:14
相关论文
共 50 条
  • [1] ProxyMatting: Transformer-based image matting via region proxy
    Li, Jide
    Yang, Kequan
    Wu, Yuanchen
    Ye, Xichen
    Yang, Hanqi
    Li, Xiaoqiang
    KNOWLEDGE-BASED SYSTEMS, 2025, 310
  • [2] MatteFormer: Transformer-Based Image Matting via Prior-Tokens
    Park, GyuTae
    Son, SungJoon
    Yoo, JaeYoung
    Kim, SeHo
    Kwak, Nojun
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, : 11686 - 11696
  • [3] From Composited to Real-World: Transformer-Based Natural Image Matting
    Wang, Yanfeng
    Tang, Lv
    Zhong, Yijie
    Li, Bo
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (04) : 2097 - 2111
  • [4] TIPFNet: a transformer-based infrared polarization image fusion network
    Li, Kunyuan
    Qi, Meibin
    Zhuang, Shuo
    Yang, Yanfang
    Gao, Jun
    OPTICS LETTERS, 2022, 47 (16) : 4255 - 4258
  • [5] An efficient swin transformer-based method for underwater image enhancement
    Rong Wang
    Yonghui Zhang
    Jian Zhang
    Multimedia Tools and Applications, 2023, 82 : 18691 - 18708
  • [6] An efficient swin transformer-based method for underwater image enhancement
    Wang, Rong
    Zhang, Yonghui
    Zhang, Jian
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (12) : 18691 - 18708
  • [7] TransHash: Transformer-based Hamming Hashing for Efficient Image Retrieval
    Chen, Yongbiao
    Zhang, Sheng
    Liu, Fangxin
    Chang, Zhigang
    Ye, Mang
    Qi, Zhengwei
    PROCEEDINGS OF THE 2022 INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL, ICMR 2022, 2022, : 127 - 136
  • [8] Transformer-based Image Compression
    Lu, Ming
    Guo, Peiyao
    Shi, Huiqing
    Cao, Chuntong
    Ma, Zhan
    DCC 2022: 2022 DATA COMPRESSION CONFERENCE (DCC), 2022, : 469 - 469
  • [9] WFormer: A Transformer-Based Soft Fusion Model for Robust Image Watermarking
    Luo, Ting
    Wu, Jun
    He, Zhouyan
    Xu, Haiyong
    Jiang, Gangyi
    Chang, Chin-Chen
    IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTATIONAL INTELLIGENCE, 2024, : 1 - 18
  • [10] Transformer-Based End-to-End Anatomical and Functional Image Fusion
    Zhang, Jing
    Liu, Aiping
    Wang, Dan
    Liu, Yu
    Wang, Z. Jane
    Chen, Xun
    IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2022, 71