A Deep Learning Network for Individual Tree Segmentation in UAV Images with a Coupled CSPNet and Attention Mechanism

被引:10
|
作者
Lv, Lujin [1 ,2 ,3 ]
Li, Xuejian [1 ,2 ,3 ]
Mao, Fangjie [1 ,2 ,3 ]
Zhou, Lv [4 ]
Xuan, Jie [1 ,2 ,3 ]
Zhao, Yinyin [1 ,2 ,3 ]
Yu, Jiacong [1 ,2 ,3 ]
Song, Meixuan [1 ,2 ,3 ]
Huang, Lei [1 ,2 ,3 ]
Du, Huaqiang [1 ,2 ,3 ]
机构
[1] Zhejiang A&F Univ, State Key Lab Subtrop Silviculture, Hangzhou 311300, Peoples R China
[2] Zhejiang A&F Univ, Key Lab Carbon Cycling Forest Ecosyst & Carbon Seq, Hangzhou 311300, Peoples R China
[3] Zhejiang A&F Univ, Sch Environm & Resources Sci, Hangzhou 311300, Peoples R China
[4] Beijing Forestry Univ, Res Ctr Forest Management Engn State Forestry & Gr, Beijing 100083, Peoples R China
基金
中国国家自然科学基金;
关键词
individual tree detection; Mask R-CNN; urban forest; deep learning; UAV; attention mechanism; CONVOLUTIONAL NEURAL-NETWORKS; CROWN DELINEATION; EXTRACTION; URBAN; CNN;
D O I
10.3390/rs15184420
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Accurate individual tree detection by unmanned aerial vehicles (UAVs) is a critical technique for smart forest management and serves as the foundation for evaluating ecological functions. Existing object detection and segmentation methods, on the other hand, have reduced accuracy when detecting and segmenting individual trees in complicated urban forest landscapes, as well as poor mask segmentation quality. This study proposes a novel Mask-CSP-attention-coupled network (MCAN) based on the Mask R-CNN algorithm. MCAN uses the Cross Stage Partial Net (CSPNet) framework with the Sigmoid Linear Unit (SiLU) activation function in the backbone network to form a new Cross Stage Partial Residual Net (CSPResNet) and employs a convolutional block attention module (CBAM) mechanism to the feature pyramid network (FPN) for feature fusion and multiscale segmentation to further improve the feature extraction ability of the model, enhance its detail information detection ability, and improve its individual tree detection accuracy. In this study, aerial photography of the study area was conducted by UAVs, and the acquired images were used to produce a dataset for training and validation. The method was compared with the Mask Region-based Convolutional Neural Network (Mask R-CNN), Faster Region-based Convolutional Neural Network (Faster R-CNN), and You Only Look Once v5 (YOLOv5) on the test set. In addition, four scenes-namely, a dense forest distribution, building forest intersection, street trees, and active plaza vegetation-were set up, and the improved segmentation network was used to perform individual tree segmentation on these scenes to test the large-scale segmentation ability of the model. MCAN's average precision (AP) value for individual tree identification is 92.40%, which is 3.7%, 3.84%, and 12.53% better than that of Mask R-CNN, Faster R-CNN, and YOLOv5, respectively. In comparison to Mask R-CNN, the segmentation AP value is 97.70%, an increase of 8.9%. The segmentation network's precision for the four scenes in multi-scene segmentation ranges from 95.55% to 92.33%, showing that the proposed network performs high-precision segmentation in many contexts.
引用
收藏
页数:19
相关论文
共 50 条
  • [21] High-Resolution Aerial Images Semantic Segmentation Using Deep Fully Convolutional Network With Channel Attention Mechanism
    Luo, Haifeng
    Chen, Chongcheng
    Fang, Lina
    Zhu, Xi
    Lu, Lijing
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2019, 12 (09) : 3492 - 3507
  • [22] MAD-UNet: A deep U-shaped network combined with an attention mechanism for pancreas segmentation in CT images
    Li, Weisheng
    Qin, Sheng
    Li, Feiyan
    Wang, Linhong
    MEDICAL PHYSICS, 2021, 48 (01) : 329 - 341
  • [23] DEEP LEARNING FOR SEMANTIC SEGMENTATION OF UAV VIDEOS
    Wang, Yiwen
    Lyn, Ye
    Cao, Yanpeng
    Yang, Michael Ying
    2019 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS 2019), 2019, : 2459 - 2462
  • [24] Brain tumor segmentation based on deep learning and an attention mechanism using MRI multi-modalities brain images
    Ranjbarzadeh, Ramin
    Kasgari, Abbas Bagherian
    Ghoushchi, Saeid Jafarzadeh
    Anari, Shokofeh
    Naseri, Maryam
    Bendechache, Malika
    SCIENTIFIC REPORTS, 2021, 11 (01)
  • [25] Brain tumor segmentation based on deep learning and an attention mechanism using MRI multi-modalities brain images
    Ramin Ranjbarzadeh
    Abbas Bagherian Kasgari
    Saeid Jafarzadeh Ghoushchi
    Shokofeh Anari
    Maryam Naseri
    Malika Bendechache
    Scientific Reports, 11
  • [26] Deep learning network detects individual carbon nanotubes in SEM images
    不详
    AMERICAN CERAMIC SOCIETY BULLETIN, 2023, 102 (05): : 21 - 21
  • [27] UAVSeg: Dual-Encoder Cross-Scale Attention Network for UAV Images' Semantic Segmentation
    Wang, Zhen
    You, Zhu-Hong
    Xu, Nan
    Zhang, Chuanlei
    Huang, De-Shuang
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2025, 63
  • [28] Echocardiographic segmentation based on semi-supervised deep learning with attention mechanism
    Liang, Jiajun
    Pan, Huijuan
    Xiang, Zhuo
    Qin, Jing
    Qiu, Yali
    Guo, Libao
    Wang, Tianfu
    Jiang, Wei
    Lei, Baiying
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 83 (12) : 36953 - 36973
  • [29] A Dual-Branch Deep Learning Framework at the Grid Scale for Individual Tree Segmentation
    Ding, Ze
    Zhang, Huaiqing
    Wang, Ruisheng
    Zhang, Li
    Jiang, Hanxiao
    Yun, Ting
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2025, 22
  • [30] Echocardiographic segmentation based on semi-supervised deep learning with attention mechanism
    Jiajun Liang
    Huijuan Pan
    Zhuo Xiang
    Jing Qin
    Yali Qiu
    Libao Guo
    Tianfu Wang
    Wei Jiang
    Baiying Lei
    Multimedia Tools and Applications, 2024, 83 : 36953 - 36973