An interactive instance segmentation system with multi-resolution convolutional neural networks

被引:0
|
作者
Sung, Po-Wei [1 ]
Yang, Wei-Jong [1 ]
Yang, Jar-Ferr [1 ]
Chan, Din-Yuan [2 ]
机构
[1] Natl Cheng Kung Univ, Dept Elect Engn, Inst Comp & Commun Engn, Tainan, Taiwan
[2] Natl Chiayi Univ, Dept Comp Sci & Informat Engn, Chiayi, Taiwan
关键词
Adaptive thresholds - Convolutional neural network - Heatmaps - Learn+ - Multiple features - Multiple resolutions - Network backbones - Neural network model - Segmentation system - Sensitive features;
D O I
10.1049/cvi2.12016
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, a fast interactive instance segmentation (IIS) system is proposed and it is composed of an effective heatmap generator, a multi-resolution network (MRNet), and an adaptive threshold refiner to promptly and precisely predict the masks of the objects. The proposed heatmap generator after interaction clicks can help the MRNet to successfully learn the sensitive features for better prediction. Based on convolutional neural network models, the proposed MRNet backbone produces multiple features across multiple resolutions and can intrinsically predict the sharp contour of the object. After the probabilistic prediction achieved by the MRNet, the Otsu's threshold refiner is proposed to further remove some uncertain pixels in the predicted mask. Experimental results demonstrate that the proposed IIS system can promptly predict sharp masks of the targeted objects with mIoU of 89.1% in PASCAL VOC 2012 [1] validation set. Compared to other existing interactive methods, the proposed system can effectively predict the segmentation mask with higher accuracy and less interaction efforts.
引用
收藏
页码:99 / 109
页数:11
相关论文
共 50 条
  • [1] Multi-resolution convolutional neural networks for inverse problems
    Wang, Feng
    Eljarrat, Alberto
    Mueller, Johannes
    Henninen, Trond R.
    Erni, Rolf
    Koch, Christoph T.
    [J]. SCIENTIFIC REPORTS, 2020, 10 (01)
  • [2] Multi-resolution convolutional neural networks for inverse problems
    Feng Wang
    Alberto Eljarrat
    Johannes Müller
    Trond R. Henninen
    Rolf Erni
    Christoph T. Koch
    [J]. Scientific Reports, 10
  • [3] Multi-Resolution for Disparity Estimation with Convolutional Neural Networks
    Jammal, Samer
    Tillo, Tammam
    Xiao, Jimin
    [J]. 2017 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC 2017), 2017, : 1756 - 1761
  • [4] Multi-Resolution Convolutional Recurrent Networks
    Chien, Jen-Tzung
    Huang, Yu-Min
    [J]. 2021 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2021, : 2043 - 2048
  • [5] 3D Convolutional Neural Networks for Brain Tumor Segmentation: A Comparison of Multi-resolution Architectures
    Casamitjana, Adria
    Puch, Santi
    Aduriz, Asier
    Vilaplana, Veronica
    [J]. BRAINLESION: GLIOMA, MULTIPLE SCLEROSIS, STROKE AND TRAUMATIC BRAIN INJURIES, 2016, 2016, 10154 : 150 - 161
  • [6] HookNet: Multi-resolution convolutional neural networks for semantic segmentation in histopathology whole-slide images
    van Rijthoven, Mart
    Balkenhol, Maschenka
    Silina, Karina
    van der Laak, Jeroen
    Ciompi, Francesco
    [J]. MEDICAL IMAGE ANALYSIS, 2021, 68
  • [7] Multi-resolution convolutional neural networks for fully automated segmentation of acutely injured lungs in multiple species
    Gerard, Sarah E.
    Herrmann, Jacob
    Kaczka, David W.
    Musch, Guido
    Fernandez-Bustamante, Ana
    Reinhardt, Joseph M.
    [J]. MEDICAL IMAGE ANALYSIS, 2020, 60
  • [8] Multi-Resolution Convolutional Residual Neural Networks for Monaural Speech Dereverberation
    Zhao, Lei
    Zhu, Wenbo
    Li, Shengqiang
    Luo, Hong
    Zhang, Xiao-Lei
    Rahardja, Susanto
    [J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2024, 32 : 2338 - 2351
  • [9] Multi-Resolution Fully Convolutional Neural Networks for Monaural Audio Source Separation
    Grais, Emad M.
    Wierstorf, Hagen
    Ward, Dominic
    Plumbley, Mark D.
    [J]. LATENT VARIABLE ANALYSIS AND SIGNAL SEPARATION (LVA/ICA 2018), 2018, 10891 : 340 - 350
  • [10] Multi-resolution Deep Learning Convolutional Networks for Improvements in OCT Retinal Layer Segmentation
    Whitney, Jon
    Sevgi, Duriye Damla
    Srivastava, Sunil K.
    Ehlers, Justis P.
    [J]. INVESTIGATIVE OPHTHALMOLOGY & VISUAL SCIENCE, 2021, 62 (08)