FormerUnify: Transformer-Based Unified Fusion for Efficient Image Matting

被引:0
|
作者
Wang, Jiaquan [1 ]
机构
[1] Shanghai Univ, Shanghai 200444, Peoples R China
关键词
Image matting; Transformer; Feature pyramid; Unified fusion;
D O I
10.1007/978-981-97-8685-5_29
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recently, deep learning-based methods in the field of image matting have incorporated additional modules and complex network structures to capture more comprehensive image information, thereby achieving higher accuracy. However, these innovations inevitably result in a decrement of inference speed and higher computational resource consumption. In this paper, we propose a Transformer-based unified fusion network for image matting, denoted as FormerUnify. Compared to existing methods, it is able to achieve a more optimal balance between accuracy and efficiency. FormerUnify is built upon the classic encoder-decoder framework, with its centerpiece being the Unified Fusion Decoder. This decoder is composed of three essential layers: unify layer, fusion layer, and upsampling prediction head, all of which work in concert to unify and fuse the rich multi-scale features extracted by the encoder effectively. Furthermore, we couple the Unified Fusion Decoder with an advanced Transformer-based encoder, and optimize their integration to enhance their compatibility and performance. Experimental evaluations on two synthetic datasets (Composition-1K and Distinctions-646) and an real-world dataset (AIM-500) affirm that FormerUnify achieves rapid inference speed without compromising its superior accuracy.
引用
收藏
页码:412 / 425
页数:14
相关论文
共 50 条
  • [31] On Efficient Transformer-Based Image Pre-training for Low-Level Vision
    Li, Wenbo
    Lu, Xin
    Qian, Shengju
    Lu, Jiangbo
    PROCEEDINGS OF THE THIRTY-SECOND INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2023, 2023, : 1089 - 1097
  • [32] Transformer-based Image Compression with Variable Image Quality Objectives
    Kao, Chia-Hao
    Chen, Yi-Hsin
    Chien, Cheng
    Chiu, Wei-Chen
    Peng, Wen-Hsiao
    2023 ASIA PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE, APSIPA ASC, 2023, : 1718 - 1725
  • [33] Transformer-Based Sensor Fusion for Autonomous Driving: A Survey
    Singh, Apoorv
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS, ICCVW, 2023, : 3304 - 3309
  • [34] Image dehazing based on a transmission fusion strategy by automatic image matting
    Yuan, Feiniu
    Zhou, Yu
    Xia, Xue
    Shi, Jinting
    Fang, Yuming
    Qian, Xueming
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2020, 194
  • [35] A Transformer-Based Fusion Recommendation Model For IPTV Applications
    Li, Heng
    Lei, Hang
    Yang, Maolin
    Zeng, Jinghong
    Zhu, Di
    Fu, Shouwei
    2020 3RD INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND BIG DATA (ICAIBD 2020), 2020, : 177 - 182
  • [36] Advancing Hyperspectral and Multispectral Image Fusion: An Information-Aware Transformer-Based Unfolding Network
    Sun, Jianqiao
    Chen, Bo
    Lu, Ruiying
    Cheng, Ziheng
    Qu, Chunhui
    Yuan, Xin
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, : 1 - 15
  • [37] Interactive CNN and Transformer-Based Cross-Attention Fusion Network for Medical Image Classification
    Cai, Shu
    Zhang, Qiude
    Wang, Shanshan
    Hu, Junjie
    Zeng, Liang
    Li, Kaiyan
    INTERNATIONAL JOURNAL OF IMAGING SYSTEMS AND TECHNOLOGY, 2025, 35 (03)
  • [38] DesnowFormer: an effective transformer-based image desnowing network
    Zhang, Ting
    Jiang, Nanfeng
    Lin, Junhong
    Lin, Jielian
    Zhao, Tiesong
    2022 IEEE INTERNATIONAL CONFERENCE ON VISUAL COMMUNICATIONS AND IMAGE PROCESSING (VCIP), 2022,
  • [39] Recent progress in transformer-based medical image analysis
    Liu, Zhaoshan
    Lv, Qiujie
    Yang, Ziduo
    Li, Yifan
    Lee, Chau Hung
    Shen, Lei
    COMPUTERS IN BIOLOGY AND MEDICINE, 2023, 164
  • [40] TransInpaint: Transformer-based Image Inpainting with Context Adaptation
    Shamsolmoali, Pourya
    Zareapoor, Masoumeh
    Granger, Eric
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS, ICCVW, 2023, : 849 - 858