End-to-End Instance Segmentation with Recurrent Attention

被引:161
|
作者
Ren, Mengye [1 ]
Zemel, Richard S. [1 ,2 ]
机构
[1] Univ Toronto, Toronto, ON, Canada
[2] Canadian Inst Adv Res, Toronto, ON, Canada
关键词
D O I
10.1109/CVPR.2017.39
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
While convolutional neural networks have gained impressive success recently in solving structured prediction problems such as semantic segmentation, it remains a challenge to differentiate individual object instances in the scene. Instance segmentation is very important in a variety of applications, such as autonomous driving, image captioning, and visual question answering. Techniques that combine large graphical models with low-level vision have been proposed to address this problem; however, we propose an end-to-end recurrent neural network (RNN) architecture with an attention mechanism to model a human-like counting process, and produce detailed instance segmentations. The network is jointly trained to sequentially produce regions of interest as well as a dominant object segmentation within each region. The proposed model achieves competitive results on the CVPPP [27], KITTI [12], and Cityscapes [8] datasets.
引用
收藏
页码:293 / 301
页数:9
相关论文
共 50 条
  • [21] End-to-End Segmentation of Brain White Matter Hyperintensities Combining Attention and Inception Modules
    Zhao X.
    Wang X.
    Wang H.
    Guangxue Xuebao/Acta Optica Sinica, 2021, 41 (09):
  • [22] An End-to-End Geometric Characterization-aware Semantic Instance Segmentation Network for ALS Point Clouds
    Wang, Jinhong
    Yao, Wei
    MID-TERM SYMPOSIUM THE ROLE OF PHOTOGRAMMETRY FOR A SUSTAINABLE WORLD, VOL. 48-2, 2024, : 435 - 442
  • [23] CAB U-Net: An end-to-end category attention boosting algorithm for segmentation
    Ding, Xiaofeng
    Peng, Yaxin
    Shen, Chaomin
    Zeng, Tieyong
    COMPUTERIZED MEDICAL IMAGING AND GRAPHICS, 2020, 84
  • [24] End-to-end attention convolutional recurrent network for online handwritten Chinese text recognition
    Qu, Xiwen
    Wu, Zhihong
    Huang, Jun
    MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (23) : 62541 - 62558
  • [25] End-to-end Language Identification using Attention-based Recurrent Neural Networks
    Geng, Wang
    Wang, Wenfu
    Zhao, Yuanyuan
    Cai, Xinyuan
    Xu, Bo
    17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 2944 - 2948
  • [26] RPAN: An End-to-End Recurrent Pose-Attention Network for Action Recognition in Videos
    Du, Wenbin
    Wang, Yali
    Qiao, Yu
    2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 3745 - 3754
  • [27] The End-to-End Segmentation on Automotive Radar Imagery
    Xiao, Yang
    Daniel, Liam
    Gashinova, Marina
    2021 18TH EUROPEAN RADAR CONFERENCE (EURAD), 2021, : 265 - 268
  • [28] End-to-End Supervised Lung Lobe Segmentation
    Ferreira, Filipe T.
    Sousa, Patrick
    Galdran, Adrian
    Sousa, Marta R.
    Campilho, Aurelio
    2018 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2018,
  • [29] End-to-End Ultrametric Learning for Hierarchical Segmentation
    Lapertot, Raphael
    Chierchia, Giovanni
    Perret, Benjamin
    DISCRETE GEOMETRY AND MATHEMATICAL MORPHOLOGY, DGMM 2024, 2024, 14605 : 286 - 297
  • [30] SUPPORTIVE ATTENTION IN END-TO-END MEMORY NETWORKS
    Chien, Jen-Tzung
    Lin, Ting-An
    2018 IEEE 28TH INTERNATIONAL WORKSHOP ON MACHINE LEARNING FOR SIGNAL PROCESSING (MLSP), 2018,