Multi-task Visual Perception Method in Dragon Orchards Based on OrchardYOLOP

被引:0
|
作者
Zhao, Wenfeng [1 ]
Huang, Yuanjue [1 ]
Zhong, Minyue [1 ]
Li, Zhenyuan [1 ]
Luo, Zitao [1 ]
Huang, Jiajun [1 ]
机构
[1] College of Electronic Engineering, College of Artificial Intelligence, South China Agricultural University, Guangzhou,510642, China
关键词
Autonomous driving - Complex terrains - Dragon orchard - Lighting environment - Multi tasks - Objects detection - Semantic segmentation - Unstructured environments - Visual perception - YOLOP;
D O I
10.6041/j.issn.1000-1298.2024.11.018
中图分类号
学科分类号
摘要
In the face of challenges such as complex terrains, fluctuating lighting, and unstructured environments, modern orchard robots require the efficient processing of a vast array of environmental information. Traditional algorithms that sequentially execute multiple single tasks are limited by computational power which are unable to meet these demands. Aiming to address the requirements for real-time performance and accuracy in multi-tasking autonomous driving robots within dragon fruit orchard environments. Building upon the YOLOP, focus attention convolution module was introduced, C2F and SPPF modules were employed, and the loss function for segmentation tasks was optimized, culminating in the OrchardYOLOP. Experiments demonstrated that OrchardYOLOP achieved a precision of 84. 1 % in target detection tasks, an mloU of 89. 7% in drivable area segmentation tasks, and an mloU increased to 90. 8% in fruit tree region segmentation tasks, with an inference speed of 33. 33 frames per second and a parameter count of only 9. 67 X 10 . Compared with the YOLOP algorithm, not only did it meet the realtime requirements in terms of speed, but also it significantly improved accuracy, addressing key issues in multi-task visual perception in dragon fruit orchards and providing an effective solution for multi-task autonomous driving visual perception in unstructured environments. © 2024 Chinese Society of Agricultural Machinery. All rights reserved.
引用
收藏
页码:160 / 170
相关论文
共 50 条
  • [31] Multi-task Compositional Network for Visual Relationship Detection
    Zhan, Yibing
    Yu, Jun
    Yu, Ting
    Tao, Dacheng
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2020, 128 (8-9) : 2146 - 2165
  • [32] Classification of rapid serial visual presentation based EEG with multi-task learning
    Xie P.
    Hu J.
    Jiang G.
    Wang P.
    Men Y.
    Yi Qi Yi Biao Xue Bao/Chinese Journal of Scientific Instrument, 2023, 44 (11): : 215 - 223
  • [33] Language AdaptiveWeight Generation for Multi-task Visual Grounding
    Su, Wei
    Miao, Peihan
    Dou, Huanzhang
    Wang, Gaoang
    Qiao, Liang
    Li, Zheyang
    Li, Xi
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 10857 - 10866
  • [34] Multi-task Self-Supervised Visual Learning
    Doersch, Carl
    Zisserman, Andrew
    2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 2070 - 2079
  • [35] DENSETRACKER: A MULTI-TASK DENSE NETWORK FOR VISUAL TRACKING
    Zhao, Fei
    Tang, Ming
    Wu, Yi
    Wang, Jinqiao
    2017 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2017, : 607 - 612
  • [36] A novel multi-satellite and multi-task scheduling method based on task network graph aggregation
    Fan, Huilong
    Yang, Zhan
    Zhang, Xi
    Wu, Shimin
    Long, Jun
    Liu, Limin
    EXPERT SYSTEMS WITH APPLICATIONS, 2022, 205
  • [37] Multi-Task Rank Learning for Visual Saliency Estimation
    Li, Jia
    Tian, Yonghong
    Huang, Tiejun
    Gao, Wen
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2011, 21 (05) : 623 - 636
  • [38] Multi-task Compositional Network for Visual Relationship Detection
    Yibing Zhan
    Jun Yu
    Ting Yu
    Dacheng Tao
    International Journal of Computer Vision, 2020, 128 : 2146 - 2165
  • [39] Nuclear mass based on the multi-task learning neural network method
    Ming, Xing-Chen
    Zhang, Hong-Fei
    Xu, Rui-Rui
    Sun, Xiao-Dong
    Tian, Yuan
    Ge, Zhi-Gang
    NUCLEAR SCIENCE AND TECHNIQUES, 2022, 33 (05)
  • [40] Multi-Task Chinese Speech Recognition Method Based on the Squeezeformer Model
    Guo, Ying
    Wang, Li
    IAENG International Journal of Computer Science, 2025, 52 (01) : 23 - 31