An interactive instance segmentation system with multi-resolution convolutional neural networks

被引:0
|
作者
Sung, Po-Wei [1 ]
Yang, Wei-Jong [1 ]
Yang, Jar-Ferr [1 ]
Chan, Din-Yuan [2 ]
机构
[1] Natl Cheng Kung Univ, Dept Elect Engn, Inst Comp & Commun Engn, Tainan, Taiwan
[2] Natl Chiayi Univ, Dept Comp Sci & Informat Engn, Chiayi, Taiwan
关键词
Adaptive thresholds - Convolutional neural network - Heatmaps - Learn+ - Multiple features - Multiple resolutions - Network backbones - Neural network model - Segmentation system - Sensitive features;
D O I
10.1049/cvi2.12016
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, a fast interactive instance segmentation (IIS) system is proposed and it is composed of an effective heatmap generator, a multi-resolution network (MRNet), and an adaptive threshold refiner to promptly and precisely predict the masks of the objects. The proposed heatmap generator after interaction clicks can help the MRNet to successfully learn the sensitive features for better prediction. Based on convolutional neural network models, the proposed MRNet backbone produces multiple features across multiple resolutions and can intrinsically predict the sharp contour of the object. After the probabilistic prediction achieved by the MRNet, the Otsu's threshold refiner is proposed to further remove some uncertain pixels in the predicted mask. Experimental results demonstrate that the proposed IIS system can promptly predict sharp masks of the targeted objects with mIoU of 89.1% in PASCAL VOC 2012 [1] validation set. Compared to other existing interactive methods, the proposed system can effectively predict the segmentation mask with higher accuracy and less interaction efforts.
引用
收藏
页码:99 / 109
页数:11
相关论文
共 50 条
  • [31] Deep Prior-Based Audio Inpainting Using Multi-Resolution Harmonic Convolutional Neural Networks
    Miotello, Federico
    Pezzoli, Mirco
    Comanducci, Luca
    Antonacci, Fabio
    Sarti, Augusto
    [J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2024, 32 : 113 - 123
  • [32] i-m-Top: An Interactive Multi-Resolution Tabletop System Accommodating to Multi-Resolution Human Vision
    Hu, Ting-Ting
    Chia, Yi-Wei
    Chan, Li-Wei
    Hung, Yi-Ping
    Hsu, Jane
    [J]. THIRD ANNUAL IEEE INTERNATIONAL WORKSHOP ON HORIZONTAL INTERACTIVE HUMAN-COMPUTER SYSTEMS, PROCEEDINGS: TABLETOPS AND INTERACTIVE SURFACES, 2008, : 189 - 192
  • [33] Multi-Resolution Fusion Convolutional Neural Network for Screw Locking Series
    Liu, Tianyu
    Zhou, Daoxiang
    Li, Ming
    Li, Xinyu
    [J]. Hsi-An Chiao Tung Ta Hsueh/Journal of Xi'an Jiaotong University, 2020, 54 (03): : 161 - 168
  • [34] Human Pose Estimation via Multi-resolution Convolutional Neural Network
    Zhu, Aichun
    Jin, Jing
    Wang, Tian
    Zhu, Qiurong
    [J]. PROCEEDINGS 2017 4TH IAPR ASIAN CONFERENCE ON PATTERN RECOGNITION (ACPR), 2017, : 700 - 705
  • [35] Deep Prior-Based Audio Inpainting Using Multi-Resolution Harmonic Convolutional Neural Networks
    Miotello, Federico
    Pezzoli, Mirco
    Comanducci, Luca
    Antonacci, Fabio
    Sarti, Augusto
    [J]. IEEE/ACM Transactions on Audio Speech and Language Processing, 2024, 32 : 113 - 123
  • [36] Deep Convolutional Neural Networks for Multi-Instance Multi-Task Learning
    Zeng, Tao
    Ji, Shuiwang
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM), 2015, : 579 - 588
  • [37] Application of convolutional neural networks for glands instance segmentation in the images of colon epithelial neoplasms
    Mikhailov, I.
    Khvostikov, A.
    Krylov, A.
    Oleynikova, N.
    Malkov, P.
    Kharlova, O.
    Danilova, N.
    [J]. VIRCHOWS ARCHIV, 2019, 475 : S124 - S124
  • [38] An Integration Convolutional Neural Network for Nuclei Instance Segmentation
    Qu, Aiping
    Cheng, Zhiming
    He, Xiaofeng
    Li, Yue
    [J]. 2020 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE, 2020, : 1104 - 1109
  • [39] A dyadic multi-resolution deep convolutional neural wavelet network for image classification
    Ejbali, Ridha
    Zaied, Mourad
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2018, 77 (05) : 6149 - 6163
  • [40] A dyadic multi-resolution deep convolutional neural wavelet network for image classification
    Ridha Ejbali
    Mourad Zaied
    [J]. Multimedia Tools and Applications, 2018, 77 : 6149 - 6163