Unseen Land Cover Classification from High-Resolution Orthophotos Using Integration of Zero-Shot Learning and Convolutional Neural Networks

被引：29

作者：

Pradhan, Biswajeet ^{[1
,2
]}

Al-Najjar, Husam A. H. ^{[1
]}

Sameen, Maher Ibrahim ^{[1
]}

Tsang, Ivor ^{[3
]}

Alamri, Abdullah M. ^{[4
]}

机构：

[1] Univ Technol Sydney, Fac Engn & IT, Ctr Adv Modelling & Geospatial Informat Syst CAMG, Sydney, NSW 2007, Australia

[2] Sejong Univ, Dept Energy & Mineral Resources Engn, 209 Neungdong Ro, Seoul 05006, South Korea

[3] Univ Technol Sydney, Fac Engn & IT, Ctr Artificial Intelligence, Sydney, NSW 2007, Australia

[4] King Saud Univ, Coll Sci, Dept Geol & Geophys, POB 2455, Riyadh 11451, Saudi Arabia

来源：

REMOTE SENSING | 2020年 / 12卷 / 10期

关键词：

land cover classification; deep-learning; CNN; Zero-Shot Learning; remote sensing; orthophotos; GRAINED OBJECT RECOGNITION;

D O I：

10.3390/rs12101676

中图分类号：

X [环境科学、安全科学];

学科分类号：

08 ; 0830 ;

摘要：

Zero-shot learning (ZSL) is an approach to classify objects unseen during the training phase and shown to be useful for real-world applications, especially when there is a lack of sufficient training data. Only a limited amount of works has been carried out on ZSL, especially in the field of remote sensing. This research investigates the use of a convolutional neural network (CNN) as a feature extraction and classification method for land cover mapping using high-resolution orthophotos. In the feature extraction phase, we used a CNN model with a single convolutional layer to extract discriminative features. In the second phase, we used class attributes learned from the Word2Vec model (pre-trained by Google News) to train a second CNN model that performed class signature prediction by using both the features extracted by the first CNN and class attributes during training and only the features during prediction. We trained and tested our models on datasets collected over two subareas in the Cameron Highlands (training dataset, first test dataset) and Ipoh (second test dataset) in Malaysia. Several experiments have been conducted on the feature extraction and classification models regarding the main parameters, such as the network's layers and depth, number of filters, and the impact of Gaussian noise. As a result, the best models were selected using various accuracy metrics such as top-k categorical accuracy for k = [1,2,3], Recall, Precision, and F1-score. The best model for feature extraction achieved 0.953 F1-score, 0.941 precision, 0.882 recall for the training dataset and 0.904 F1-score, 0.869 precision, 0.949 recall for the first test dataset, and 0.898 F1-score, 0.870 precision, 0.838 recall for the second test dataset. The best model for classification achieved an average of 0.778 top-one, 0.890 top-two and 0.942 top-three accuracy, 0.798 F1-score, 0.766 recall and 0.838 precision for the first test dataset and 0.737 top-one, 0.906 top-two, 0.924 top-three, 0.729 F1-score, 0.676 recall and 0.790 precision for the second test dataset. The results demonstrated that the proposed ZSL is a promising tool for land cover mapping based on high-resolution photos.

引用

页数：26

共 50 条

[1] Fusion of Deep Convolutional Neural Networks for Land Cover Classification of High-Resolution Imagery
Scott, Grant J.
Marcum, Richard A.
Davis, Curt H.
Nivin, Tyler W.
[J]. IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2017, 14 (09) : 1638 - 1642
[2] A Generalized Zero-Shot Learning Framework for PolSAR Land Cover Classification
Gui, Rong
Xu, Xin
Wang, Lei
Yang, Rui
Pu, Fangling
[J]. REMOTE SENSING, 2018, 10 (08)
[3] Training Deep Convolutional Neural Networks for Land-Cover Classification of High-Resolution Imagery
Scott, Grant J.
England, Matthew R.
Starms, William A.
Marcum, Richard A.
Davis, Curt H.
[J]. IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2017, 14 (04) : 549 - 553
[4] Analyzing the land cover of an urban environment using high-resolution orthophotos
Akbari, H
Rose, LS
Taha, H
[J]. LANDSCAPE AND URBAN PLANNING, 2003, 63 (01) : 1 - 14
[5] From Zero-shot Learning to Conventional Supervised Classification: Unseen Visual Data Synthesis
Long, Yang
Liu, Li
Shao, Ling
Shen, Fumin
Ding, Guiguang
Han, Jungong
[J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 6165 - 6174
[6] Hierarchical Zero-Shot Classification with Convolutional Neural Network Features and Semantic Attribute Learning
Markowitz, Jared
Schmidt, Aurora C.
Burlina, Philippe M.
Wang, I-Jeng
[J]. PROCEEDINGS OF THE FIFTEENTH IAPR INTERNATIONAL CONFERENCE ON MACHINE VISION APPLICATIONS - MVA2017, 2017, : 194 - 197
[7] Zero-Shot Object Detection with Textual Descriptions Using Convolutional Neural Networks
Zhang, Licheng
Wang, Xianzhi
Yao, Lina
Zheng, Feng
[J]. 2020 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2020,
[8] Predicting Deep Zero-Shot Convolutional Neural Networks using Textual Descriptions
Ba, Jimmy Lei
Swersky, Kevin
Fidler, Sanja
Salakhutdinov, Ruslan
[J]. 2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, : 4247 - 4255
[9] Heterogeneous Data Integration using Confidence Estimation of Unseen Visual Data for Zero-shot Learning
Seo, Sanghyun
Kim, Juntae
[J]. PROCEEDINGS OF THE 2019 2ND INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING AND INFORMATION MANAGEMENT (ICSIM 2019) / 2019 2ND INTERNATIONAL CONFERENCE ON BIG DATA AND SMART COMPUTING (ICBDSC 2019), 2019, : 171 - 174
[10] Fully Connected Conditional Random Fields for High-Resolution Remote Sensing Land Use/Land Cover Classification with Convolutional Neural Networks
Zhang, Bin
Wang, Cunpeng
Shen, Yonglin
Liu, Yueyan
[J]. REMOTE SENSING, 2018, 10 (12)

← 1 2 3 4 5 →