Review of Image Semantic Segmentation Based on Deep Learning

被引:0
|
作者
Tian X. [1 ]
Wang L. [1 ]
Ding Q. [1 ]
机构
[1] School of Information Science and Technology, Beijing Forestry University, Beijing
来源
Ruan Jian Xue Bao/Journal of Software | 2019年 / 30卷 / 02期
关键词
Deep learning; Fully supervised learning; Image semantic segmentation; Pixel classification; Weakly supervised learning;
D O I
10.13328/j.cnki.jos.005659
中图分类号
学科分类号
摘要
Recent years, applying Deep Learning (DL) into Image Semantic Segmentation (ISS) has been widely used due to its state-of-the-art performances and high-quality results. This paper systematically reviews the contribution of DL to the field of ISS. Different methods of ISS based on DL (ISSbDL) are summarized. These methods are divided into ISS based on the Regional Classification (ISSbRC) and ISS based on the Pixel Classification (ISSbPC) according to the image segmentation characteristics and segmentation granularity. Then, the methods of ISSbPC are surveyed from two points of view: ISS based on Fully Supervised Learning (ISSbFSL) and ISS based on Weakly Supervised Learning (ISSbWSL). The representative algorithms of each method are introduced and analyzed, as well as the basic workflow, framework, advantages and disadvantages of these methods are detailedly analyzed and compared. In addition, the related experiments of ISS are analyzed and summarized, and the common data sets and performance evaluation indexes in ISS experiments are introduced. Finally, possible research directions and trends are given and analyzed. © Copyright 2019, Institute of Software, the Chinese Academy of Sciences. All rights reserved.
引用
收藏
页码:440 / 468
页数:28
相关论文
共 127 条
  • [1] Csurka G., Perronnin F., An efficient approach to semantic segmentation, Int'l Journal of Computer Vision, 95, 2, pp. 198-212, (2011)
  • [2] Hinton G.E., Salakhutdinov R.R., Reducing the dimensionality of data with neural networks, Science, 313, 5786, pp. 504-507, (2006)
  • [3] Jiang F., Gu Q., Hao H.Z., Li N., Guo Y.W., Chen D.X., Survey on content-based image segmentation methods, Ruan Jian Xue Bao/Journal of Software, 28, 1, pp. 160-183, (2017)
  • [4] Wei Y.C., Zhao Y., A review on image semantic segmentation based on DCNN, Journal of Beijing Jiaotong University, 40, 4, pp. 82-91, (2016)
  • [5] Lecun Y., Bottou L., Bengio Y., Haffner P., Gradient-based learning applied to document recognition, Proc. of the IEEE, 86, 11, pp. 2278-2324, (1998)
  • [6] Pearlmutter B.A., Gradient calculations for dynamic recurrent neural networks: A survey, IEEE Trans. on Neural Networks, 6, 5, pp. 1212-1228, (1995)
  • [7] Goodfellow I.J., Pougetabadie J., Mirza M., Xu B., Wardefarley D., Ozair S., Courville A., Bengio Y., Generative adversarial networks, Advances in Neural Information Processing Systems, pp. 2672-2680, (2014)
  • [8] Hochreiter S., Schmidhuber J., Long short-term memory, Neural Computation, 9, 8, pp. 1735-1780, (1997)
  • [9] Cho K., Merrienboer B.V., Gulcehre C., Bahdanau D., Bougares F., Schwenk H., Bengio Y., Learning phrase representations using RNN encoder-decoder for statistical machine translation, (2014)
  • [10] Couprie C., Farabet C., Najman L., Lecun Y., Indoor semantic segmentation using depth information, (2013)