Retrieval of remote sensing images based on semisupervised deep learning

被引:0
|
作者
Zhang H. [1 ]
Liu X. [1 ,2 ]
Yang S. [1 ,2 ]
Li Y. [1 ]
机构
[1] Institute of Remote Sensing and Digital Earth, Chinese Academy of Sciences, Beijing
[2] University of Chinese Academy of Sciences, Beijing
来源
Li, Yu (liyu@radi.ac.cn) | 1600年 / Science Press卷 / 21期
基金
中国国家自然科学基金;
关键词
Convolutional neural networks; Deep learning; Remote sensing image retrieval; Softmax classifier; Sparse auto encoder;
D O I
10.11834/jrs.20176105
中图分类号
学科分类号
摘要
Massive data and diversity characteristics exert higher demands on the retrieval of remote sensing images. The feature extraction algorithm is the most outstanding factor that influences the performance of retrieval methods. Traditional feature extraction methods cause the problem of semantic gap, which occurs when the low-level feature does not perfectly reflect or match the purpose of retrieval. BOVW and multilayer cluster analysis have been proposed to solve semantic gap. However, the usability of these methods is limited given their dependence on classification or cluster algorithms and artificiality. A semi supervised deep-learning method was proposed in this paper. This method combines Sparse Auto encoder (SA) and the principle of Convolutional Neural Networks (CNNs). The method involves four steps: first, the remote sensing images are pretreated with the ZCA whitening method. Second, the feature dictionary is extracted using SA, an algorithm that deals with nonannotated data. Subsequently, the feature dictionary is utilized in image convolution following the principle of CNNs, which imitates the neural net of organisms and decreases the number of features. Average pooling is conducted after image convolution. Both convolution and pooling are implemented to reduce model complexity, thus calculating distance faster. Third, the soft max classifier categorizes remote sensing images into five classes. Lastly, the remote sensing image retrieval is sorted based on the Euclidean distance between the query image and database in the same category as the query image. Experimental results based on high-resolution remote sensing images demonstrate that the proposed method is effective and is more accurate than the methods that are based on color and texture features. Image classification before sorting speeds up retrieval by 27.6%. In addition, the semi-supervised deep learning algorithm is stable when the number of returned images increases. Given that the number of neurons in the hidden SA layer and the region size of pooling primarily affect retrieval results, this study conducted several optimization experiments on these two parameters. Moreover, the algorithm in this research performed well when the number of images in the data set and the retrieval accuracy on a larger data set increased, which are meaningful for the retrieval of massive remote sensing images. The semisupervised deep learning algorithm decreases the time of image annotation, which is an exhausting job. Unlike traditional methods that extracts the features of color, texture, and shape, the method in this study directly extracts the feature dictionary from image elements with good accuracy. Moreover, the efficiency of the proposed method is guaranteed by convolution and pooling, which reduce the feature dimension. Furthermore, our experiment proves that this algorithm performs well in retrieving high-resolution remote sensing images. © 2017, Science Press. All right reserved.
引用
收藏
页码:406 / 414
页数:8
相关论文
共 27 条
  • [1] Bengio Y., Lamblin P., Popovici D., Larochelle H., Greedy layer-wise training of deep networks, Proceedings of the 19th International Conference on Neural Information Processing Systems, pp. 153-160, (2006)
  • [2] Bishop C.M., Pattern Recognition and Machine Learning, (2007)
  • [3] Bourlard H., Kamp Y., Auto-association by multilayer perceptrons and singular value decomposition, Biological Cybernetics, 59, 4-5, pp. 291-294, (1988)
  • [4] Coates A., Lee H., Ng A.Y., An analysis of single-layer networks in unsupervised feature learning, Proceedings of the 14th International Conference on Artificial Intelligence and Statistics, pp. 215-223, (2011)
  • [5] Duan B.B., Han L.X., Xie J., Fuzzy C-means clustering algorithm based on stacked sparse autoencoders, Computer Engineering and Applications, 51, 4, pp. 154-157, (2015)
  • [6] Haralick R.M., Shangmugam K., Texture features for image classification, IEEE Transaction on Systems, SMC-3, 6, pp. 768-780, (1973)
  • [7] Hinton G.E., Salakhutdinov R.R., Reducing the dimensionality of data with neural networks, Science, 313, 5786, pp. 504-507, (2006)
  • [8] Hu F., Xia G.S., Hu J.W., Zhang L.P., Transferring deep convolutional neural networks for the scene classification of high-resolution remote sensing imagery, Remote Sensing, 7, 11, pp. 14680-14707, (2015)
  • [9] Hu F., Xia G.S., Hu J.W., Zhang L.P., Transferring deep convolutional neural networks for the scene classification of high-resolution remote sensing imagery, Remote Sensing, 7, 11, pp. 14680-14707, (2015)
  • [10] Huang J., Sun Y., Xu H.R., The application of Sparse coding on image retrieval, Digital Technology and Application, 11, (2013)