A Deep Learning Network for Individual Tree Segmentation in UAV Images with a Coupled CSPNet and Attention Mechanism

被引:10
|
作者
Lv, Lujin [1 ,2 ,3 ]
Li, Xuejian [1 ,2 ,3 ]
Mao, Fangjie [1 ,2 ,3 ]
Zhou, Lv [4 ]
Xuan, Jie [1 ,2 ,3 ]
Zhao, Yinyin [1 ,2 ,3 ]
Yu, Jiacong [1 ,2 ,3 ]
Song, Meixuan [1 ,2 ,3 ]
Huang, Lei [1 ,2 ,3 ]
Du, Huaqiang [1 ,2 ,3 ]
机构
[1] Zhejiang A&F Univ, State Key Lab Subtrop Silviculture, Hangzhou 311300, Peoples R China
[2] Zhejiang A&F Univ, Key Lab Carbon Cycling Forest Ecosyst & Carbon Seq, Hangzhou 311300, Peoples R China
[3] Zhejiang A&F Univ, Sch Environm & Resources Sci, Hangzhou 311300, Peoples R China
[4] Beijing Forestry Univ, Res Ctr Forest Management Engn State Forestry & Gr, Beijing 100083, Peoples R China
基金
中国国家自然科学基金;
关键词
individual tree detection; Mask R-CNN; urban forest; deep learning; UAV; attention mechanism; CONVOLUTIONAL NEURAL-NETWORKS; CROWN DELINEATION; EXTRACTION; URBAN; CNN;
D O I
10.3390/rs15184420
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Accurate individual tree detection by unmanned aerial vehicles (UAVs) is a critical technique for smart forest management and serves as the foundation for evaluating ecological functions. Existing object detection and segmentation methods, on the other hand, have reduced accuracy when detecting and segmenting individual trees in complicated urban forest landscapes, as well as poor mask segmentation quality. This study proposes a novel Mask-CSP-attention-coupled network (MCAN) based on the Mask R-CNN algorithm. MCAN uses the Cross Stage Partial Net (CSPNet) framework with the Sigmoid Linear Unit (SiLU) activation function in the backbone network to form a new Cross Stage Partial Residual Net (CSPResNet) and employs a convolutional block attention module (CBAM) mechanism to the feature pyramid network (FPN) for feature fusion and multiscale segmentation to further improve the feature extraction ability of the model, enhance its detail information detection ability, and improve its individual tree detection accuracy. In this study, aerial photography of the study area was conducted by UAVs, and the acquired images were used to produce a dataset for training and validation. The method was compared with the Mask Region-based Convolutional Neural Network (Mask R-CNN), Faster Region-based Convolutional Neural Network (Faster R-CNN), and You Only Look Once v5 (YOLOv5) on the test set. In addition, four scenes-namely, a dense forest distribution, building forest intersection, street trees, and active plaza vegetation-were set up, and the improved segmentation network was used to perform individual tree segmentation on these scenes to test the large-scale segmentation ability of the model. MCAN's average precision (AP) value for individual tree identification is 92.40%, which is 3.7%, 3.84%, and 12.53% better than that of Mask R-CNN, Faster R-CNN, and YOLOv5, respectively. In comparison to Mask R-CNN, the segmentation AP value is 97.70%, an increase of 8.9%. The segmentation network's precision for the four scenes in multi-scene segmentation ranges from 95.55% to 92.33%, showing that the proposed network performs high-precision segmentation in many contexts.
引用
收藏
页数:19
相关论文
共 50 条
  • [11] Deep Learning Approach for Multi-class Semantic Segmentation of UAV Images
    Chouhan, Avinash
    Chutia, Dibyajyoti
    Aggarwal, Shiv Prasad
    INTERNATIONAL JOURNAL ON ARTIFICIAL INTELLIGENCE TOOLS, 2023, 32 (07)
  • [12] Residual Network for Deep Reinforcement Learning with Attention Mechanism
    Zhu, Hanhua
    Kaneko, Tomoyuki
    JOURNAL OF INFORMATION SCIENCE AND ENGINEERING, 2021, 37 (03) : 517 - 533
  • [13] Tree Species Classification from UAV Canopy Images with Deep Learning Models
    Huang, Yunmei
    Ou, Botong
    Meng, Kexin
    Yang, Baijian
    Carpenter, Joshua
    Jung, Jinha
    Fei, Songlin
    REMOTE SENSING, 2024, 16 (20)
  • [14] Litchi Fruit Instance Segmentation from UAV Sensed Images Using Spatial Attention-Based Deep Learning Model
    Chakraborty, Debarun
    Deka, Bhabesh
    PATTERN RECOGNITION AND MACHINE INTELLIGENCE, PREMI 2023, 2023, 14301 : 862 - 870
  • [15] Generalization of a deep learning network for beamforming and segmentation of ultrasound images
    Seoni, Silvia
    Matrone, Giulia
    Casali, Nicola
    Spairani, Edoardo
    Meiburger, Kristen M.
    INTERNATIONAL ULTRASONICS SYMPOSIUM (IEEE IUS 2021), 2021,
  • [16] A Novel Unsupervised Segmentation Method of Canopy Images from UAV Based on Hybrid Attention Mechanism
    Li, Jiaqi
    Wu, Yin
    Zhang, Haojia
    Wang, Hancong
    ELECTRONICS, 2023, 12 (22)
  • [17] Bidirectional attention network for real-time segmentation of forest fires based on UAV images
    Ji, Zhuangwei
    Zhong, Xincheng
    International Journal of Information and Communication Technology, 2024, 25 (06) : 38 - 51
  • [18] A deep-learning model for semantic segmentation of meshes from UAV oblique images
    Tang, Rongkui
    Xia, Mengjiao
    Yang, Yetao
    Zhang, Chen
    INTERNATIONAL JOURNAL OF REMOTE SENSING, 2022, 43 (13) : 4774 - 4792
  • [19] Deep Learning with Skip Connection Attention for Choroid Layer Segmentation in OCT Images
    Mao, Xiaoqian
    Zhao, Yitian
    Chen, Bang
    Ma, Yuhui
    Gu, Zaiwang
    Gu, Shenshen
    Yang, Jianlong
    Cheng, Jun
    Liu, Jiang
    42ND ANNUAL INTERNATIONAL CONFERENCES OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY: ENABLING INNOVATIVE TECHNOLOGIES FOR GLOBAL HEALTHCARE EMBC'20, 2020, : 1641 - 1645
  • [20] Performance analysis of deep learning models for tree species identification from UAV images
    Vaghela Himali Pradipkumar
    Alagu Raja Ramasamy Alagumalai
    Arabian Journal of Geosciences, 2023, 16 (11)