A Deep Learning Network for Individual Tree Segmentation in UAV Images with a Coupled CSPNet and Attention Mechanism

被引:10
|
作者
Lv, Lujin [1 ,2 ,3 ]
Li, Xuejian [1 ,2 ,3 ]
Mao, Fangjie [1 ,2 ,3 ]
Zhou, Lv [4 ]
Xuan, Jie [1 ,2 ,3 ]
Zhao, Yinyin [1 ,2 ,3 ]
Yu, Jiacong [1 ,2 ,3 ]
Song, Meixuan [1 ,2 ,3 ]
Huang, Lei [1 ,2 ,3 ]
Du, Huaqiang [1 ,2 ,3 ]
机构
[1] Zhejiang A&F Univ, State Key Lab Subtrop Silviculture, Hangzhou 311300, Peoples R China
[2] Zhejiang A&F Univ, Key Lab Carbon Cycling Forest Ecosyst & Carbon Seq, Hangzhou 311300, Peoples R China
[3] Zhejiang A&F Univ, Sch Environm & Resources Sci, Hangzhou 311300, Peoples R China
[4] Beijing Forestry Univ, Res Ctr Forest Management Engn State Forestry & Gr, Beijing 100083, Peoples R China
基金
中国国家自然科学基金;
关键词
individual tree detection; Mask R-CNN; urban forest; deep learning; UAV; attention mechanism; CONVOLUTIONAL NEURAL-NETWORKS; CROWN DELINEATION; EXTRACTION; URBAN; CNN;
D O I
10.3390/rs15184420
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Accurate individual tree detection by unmanned aerial vehicles (UAVs) is a critical technique for smart forest management and serves as the foundation for evaluating ecological functions. Existing object detection and segmentation methods, on the other hand, have reduced accuracy when detecting and segmenting individual trees in complicated urban forest landscapes, as well as poor mask segmentation quality. This study proposes a novel Mask-CSP-attention-coupled network (MCAN) based on the Mask R-CNN algorithm. MCAN uses the Cross Stage Partial Net (CSPNet) framework with the Sigmoid Linear Unit (SiLU) activation function in the backbone network to form a new Cross Stage Partial Residual Net (CSPResNet) and employs a convolutional block attention module (CBAM) mechanism to the feature pyramid network (FPN) for feature fusion and multiscale segmentation to further improve the feature extraction ability of the model, enhance its detail information detection ability, and improve its individual tree detection accuracy. In this study, aerial photography of the study area was conducted by UAVs, and the acquired images were used to produce a dataset for training and validation. The method was compared with the Mask Region-based Convolutional Neural Network (Mask R-CNN), Faster Region-based Convolutional Neural Network (Faster R-CNN), and You Only Look Once v5 (YOLOv5) on the test set. In addition, four scenes-namely, a dense forest distribution, building forest intersection, street trees, and active plaza vegetation-were set up, and the improved segmentation network was used to perform individual tree segmentation on these scenes to test the large-scale segmentation ability of the model. MCAN's average precision (AP) value for individual tree identification is 92.40%, which is 3.7%, 3.84%, and 12.53% better than that of Mask R-CNN, Faster R-CNN, and YOLOv5, respectively. In comparison to Mask R-CNN, the segmentation AP value is 97.70%, an increase of 8.9%. The segmentation network's precision for the four scenes in multi-scene segmentation ranges from 95.55% to 92.33%, showing that the proposed network performs high-precision segmentation in many contexts.
引用
收藏
页数:19
相关论文
共 50 条
  • [1] Individual tree detection and species classification of Amazonian palms using UAV images and deep learning
    Ferreira, Matheus Pinheiro
    Alves de Almeida, Danilo Roberti
    Papa, Daniel de Almeida
    Silva Minervino, Juliano Baldez
    Pessoa Veras, Hudson Franklin
    Formighieri, Arthur
    Nascimento Santos, Caio Alexandre
    Dantas Ferreira, Marcio Aurelio
    Figueiredo, Evandro Orfano
    Linhares Ferreira, Evandro Jose
    FOREST ECOLOGY AND MANAGEMENT, 2020, 475
  • [2] FO-Net: An advanced deep learning network for individual tree identification using UAV high-resolution images
    Zeng, Jian
    Shen, Xin
    Zhou, Kai
    Cao, Lin
    ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2025, 220 : 323 - 338
  • [3] A deep learning semantic segmentation network with attention mechanism for concrete crack detection
    Hang, Jiaqi
    Wu, Yingjie
    Li, Yancheng
    Lai, Tao
    Zhang, Jinge
    Li, Yang
    STRUCTURAL HEALTH MONITORING-AN INTERNATIONAL JOURNAL, 2023, 22 (05): : 3006 - 3026
  • [4] CEREBROVASCULAR NETWORK SEGMENTATION OF MRA IMAGES WITH DEEP LEARNING
    Sanches, Pedro
    Meyer, Cyril
    Vigon, Vincent
    Naegel, Benoit
    2019 IEEE 16TH INTERNATIONAL SYMPOSIUM ON BIOMEDICAL IMAGING (ISBI 2019), 2019, : 768 - 771
  • [5] Individual Tree Crown Segmentation Directly from UAV-Borne LiDAR Data Using the PointNet of Deep Learning
    Chen, Xinxin
    Jiang, Kang
    Zhu, Yushi
    Wang, Xiangjun
    Yun, Ting
    FORESTS, 2021, 12 (02): : 1 - 22
  • [6] Fruit Tree Canopy Segmentation by Unmanned Aerial Vehicle Photogrammetry Coupled on Convolutional Neural Network and Attention Mechanism
    He H.
    Zhou F.
    Chen M.
    Chen T.
    Guan Y.
    Zeng H.
    Wei Y.
    Journal of Geo-Information Science, 2023, 25 (12) : 2387 - 2401
  • [7] Individual Sick Fir Tree (Abies mariesii) Identification in Insect Infested Forests by Means of UAV Images and Deep Learning
    Nguyen, Ha Trang
    Lopez Caceres, Maximo Larry
    Moritake, Koma
    Kentsch, Sarah
    Shu, Hase
    Diez, Yago
    REMOTE SENSING, 2021, 13 (02) : 1 - 24
  • [8] A hybrid attention deep learning network for refined segmentation of cracks from shield tunnel lining images
    Zhao, Shuai
    Zhang, Guokai
    Zhang, Dongming
    Tan, Daoyuan
    Huang, Hongwei
    JOURNAL OF ROCK MECHANICS AND GEOTECHNICAL ENGINEERING, 2023, 15 (12) : 3105 - 3117
  • [9] Semantic segmentation network for mangrove tree species based on UAV remote sensing images
    Wang, Xin
    Zhang, Yu
    Ca, Jingye
    Qin, Qin
    Feng, Yi
    Yan, Jingke
    SCIENTIFIC REPORTS, 2024, 14 (01):
  • [10] Research on Deep Learning-based Semantic Segmentation Algorithm for UAV Images
    Yan, Qiang
    Cheng, Guojian
    PROCEEDINGS OF 2023 7TH INTERNATIONAL CONFERENCE ON ELECTRONIC INFORMATION TECHNOLOGY AND COMPUTER ENGINEERING, EITCE 2023, 2023, : 1579 - 1584