Dense Prediction with Attentive Feature Aggregation

被引:2
|
作者
Yang, Yung-Hsu [1 ]
Huang, Thomas E. [2 ]
Sun, Min [1 ]
Bulo, Samuel Rota [3 ]
Kontschieder, Peter [3 ]
Yu, Fisher [2 ]
机构
[1] Natl Tsing Hua Univ, Hsinchu, Taiwan
[2] ETH, Zurich, Switzerland
[3] Facebook Real Labs, Zurich, Switzerland
关键词
D O I
10.1109/WACV56688.2023.00018
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Aggregating information from features across different layers is essential for dense prediction models. Despite its limited expressiveness, vanilla feature concatenation dominates the choice of aggregation operations. In this paper, we introduce Attentive Feature Aggregation (AFA) to fuse different network layers with more expressive non-linear operations. AFA exploits both spatial and channel attention to compute weighted averages of the layer activations. Inspired by neural volume rendering, we further extend AFA with Scale-Space Rendering (SSR) to perform a late fusion of multi-scale predictions. AFA is applicable to a wide range of existing network designs. Our experiments show consistent and significant improvements on challenging semantic segmentation benchmarks, including Cityscapes and BDD100K at negligible computational and parameter overhead. In particular, AFA improves the performance of the Deep Layer Aggregation (DLA) model by nearly 6% mIoU on Cityscapes. Our experimental analyses show that AFA learns to progressively refine segmentation maps and improve boundary details, leading to new state-of-the-art results on boundary detection benchmarks on NYUDv2 and BSDS500.
引用
收藏
页码:97 / 106
页数:10
相关论文
共 50 条
  • [41] Pyramid Feature Aggregation for Hierarchical Quality Prediction of Stitched Panoramic Images
    Zhou, Yu
    Gong, Weikang
    Sun, Yanjing
    Li, Leida
    Wu, Jinjian
    Gao, Xinbo
    IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 : 4177 - 4186
  • [42] Design of a Dense Layered Network Model for Epileptic Seizures Prediction with Feature Representation
    Parveen, Summia
    Kumar, S. A. Siva
    MohanRaj, P.
    Jabakumar, Kingsly
    Ganesh, R. Senthil
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2022, 13 (10) : 218 - 223
  • [43] RaFPN: Relation-Aware Feature Pyramid Network for Dense Image Prediction
    Zhou, Zhuangzhuang
    Zhu, Yingying
    IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 7787 - 7800
  • [44] Learning Private Neural Language Modeling with Attentive Aggregation
    Ji, Shaoxiong
    Pan, Shirui
    Long, Guodong
    Li, Xue
    Jiang, Jing
    Huang, Zi
    2019 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2019,
  • [45] Attentive Excitation and Aggregation for Bilingual Referring Image Segmentation
    Zhou, Qianli
    Hui, Tianrui
    Wang, Rong
    Hu, Haimiao
    Liu, Si
    ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY, 2021, 12 (02)
  • [46] Hand gesture recognition based on attentive feature fusion
    Yu, Bin
    Luo, Zhiming
    Wu, Huangbin
    Li, Shaozi
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2020, 32 (22):
  • [47] Bidirectional Attentive Fusion with Context Gating for Dense Video Captioning
    Wang, Jingwen
    Jiang, Wenhao
    Ma, Lin
    Liu, Wei
    Xu, Yong
    2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 7190 - 7198
  • [48] Deep convolution feature aggregation: an application to diabetic retinopathy severity level prediction
    Jyostna Devi Bodapati
    Nagur Shareef Shaik
    Veeranjaneyulu Naralasetti
    Signal, Image and Video Processing, 2021, 15 : 923 - 930
  • [49] Deep convolution feature aggregation: an application to diabetic retinopathy severity level prediction
    Bodapati, Jyostna Devi
    Shaik, Nagur Shareef
    Naralasetti, Veeranjaneyulu
    SIGNAL IMAGE AND VIDEO PROCESSING, 2021, 15 (05) : 923 - 930
  • [50] Efficient Language-Driven Action Localization by Feature Aggregation and Prediction Adjustment
    Shang, Zirui
    Yang, Shuo
    Wu, Xinxiao
    Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), (555-568):