Structure-Aware Transformer for Shadow Detection

被引:0
|
作者
Sun, Wanlu [1 ]
Xiang, Liyun [2 ]
Zhao, Wei [3 ]
机构
[1] City Univ Macau, Fac Data Sci, Taipa, Macau, Peoples R China
[2] Zhejiang Yuexiu Univ, Sch Int Business, Shaoxing, Peoples R China
[3] Huzhou Coll, Sch Intelligent Mfg, Huzhou, Peoples R China
基金
国家重点研发计划;
关键词
computer vision; image segmentation; REMOVAL;
D O I
10.1049/ipr2.70031
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Shadow detection helps reduce ambiguity in object detection and tracking. However, existing shadow detection methods tend to misidentify complex shadows and their similar patterns, such as soft shadow regions and shadow-like regions, since they treat all cases equally, leading to an incomplete structure of the detected shadow regions. To alleviate this issue, we propose a structure-aware transformer network (STNet) for robust shadow detection. Specifically, we first develop a transformer-based shadow detection network to learn significant contextual information interactions. To this end, a context-aware enhancement (CaE) block is also introduced into the backbone to expand the receptive field, thus enhancing semantic interaction. Then, we design an edge-guided multi-task learning framework to produce intermediate and main predictions with a rich structure. By fusing these two complementary predictions, we can obtain an edge-preserving refined shadow map. Finally, we introduce an auxiliary semantic-aware learning to overcome the interference from complex scenes, which facilitates the model to perceive shadow and non-shadow regions using a semantic affinity loss. By doing these, we can predict high-quality shadow maps in different scenarios. Experimental results demonstrate that our method reduces the balance error rate (BER) by 4.53%, 2.54%, and 3.49% compared to state-of-the-art (SOTA) methods on the benchmark datasets SBU, ISTD, and UCF, respectively.
引用
收藏
页数:11
相关论文
共 50 条
  • [31] Parallel structure-aware halftoning
    Huisi Wu
    Tien-Tsin Wong
    Pheng-Ann Heng
    Multimedia Tools and Applications, 2013, 67 : 529 - 547
  • [32] Local-to-Global Structure-Aware Transformer for Question Answering over Structured Knowledge
    Wang, Yingyao
    Wang, Han
    Duan, Chaoqun
    Zhao, Tiejun
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2023, E106D (10) : 1705 - 1714
  • [33] Structure-Aware Positional Transformer for Visible-Infrared Person Re-Identification
    Chen, Cuiqun
    Ye, Mang
    Qi, Meibin
    Wu, Jingjing
    Jiang, Jianguo
    Lin, Chia-Wen
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 31 : 2352 - 2364
  • [34] A Tree-Based Structure-Aware Transformer Decoder for Image-To-Markup Generation
    Zhong, Shuhan
    Song, Sizhe
    Li, Guanyao
    Chan, S-H Gary
    PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2022, 2022, : 5751 - 5760
  • [35] An improved efficient model for structure-aware lane detection of unmanned vehicles
    Lv, Zezheng
    Huang, Xiaoci
    Liang, Yaozhong
    Cao, Wenguan
    Chong, Yuxiang
    PROCEEDINGS OF THE INSTITUTION OF MECHANICAL ENGINEERS PART D-JOURNAL OF AUTOMOBILE ENGINEERING, 2021, 235 (09) : 2496 - 2508
  • [36] Structure-Aware Visualization of Text Corpora
    Singh, Jaspreet
    Zerr, Sergej
    Siersdorfer, Stefan
    CHIIR'17: PROCEEDINGS OF THE 2017 CONFERENCE HUMAN INFORMATION INTERACTION AND RETRIEVAL, 2017, : 107 - 116
  • [37] Structure-aware QR Code abstraction
    Qiao, Siyuan
    Fang, Xiaoxin
    Sheng, Bin
    Wu, Wen
    Wu, Enhua
    VISUAL COMPUTER, 2015, 31 (6-8): : 1123 - 1133
  • [38] Structure-Aware Parameter-Free Group Query via Heterogeneous Information Network Transformer
    Chen, Hsi-Wen
    Shuai, Hong-Han
    Yang, De-Nian
    Lee, Wang-Chien
    Shi, Chuan
    Yu, Philip S.
    Chen, Ming-Syan
    2021 IEEE 37TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE 2021), 2021, : 2075 - 2080
  • [39] STNMDA: A Novel Model for Predicting Potential Microbe-Drug Associations with Structure-Aware Transformer
    Fan, Liu
    Yang, Xiaoyu
    Wang, Lei
    Zhu, Xianyou
    CURRENT BIOINFORMATICS, 2024, 19 (10) : 919 - 932
  • [40] Structure-Aware Transfer of Facial Blendshapes
    Mousas, Christos
    Anagnostopoulos, Christos-Nikolaos
    PROCEEDINGS SCCG: 2015 31ST SPRING CONFERENCE ON COMPUTER GRAPHICS, 2015, : 55 - 62