Multi-task Visual Perception Method in Dragon Orchards Based on OrchardYOLOP

被引：0

作者：

Zhao, Wenfeng ^{[1
]}

Huang, Yuanjue ^{[1
]}

Zhong, Minyue ^{[1
]}

Li, Zhenyuan ^{[1
]}

Luo, Zitao ^{[1
]}

Huang, Jiajun ^{[1
]}

机构：

[1] College of Electronic Engineering, College of Artificial Intelligence, South China Agricultural University, Guangzhou,510642, China

来源：

Nongye Jixie Xuebao/Transactions of the Chinese Society for Agricultural Machinery | 2024年 / 55卷 / 11期

关键词：

Autonomous driving - Complex terrains - Dragon orchard - Lighting environment - Multi tasks - Objects detection - Semantic segmentation - Unstructured environments - Visual perception - YOLOP;

D O I：

10.6041/j.issn.1000-1298.2024.11.018

中图分类号：

学科分类号：

摘要：

In the face of challenges such as complex terrains, fluctuating lighting, and unstructured environments, modern orchard robots require the efficient processing of a vast array of environmental information. Traditional algorithms that sequentially execute multiple single tasks are limited by computational power which are unable to meet these demands. Aiming to address the requirements for real-time performance and accuracy in multi-tasking autonomous driving robots within dragon fruit orchard environments. Building upon the YOLOP, focus attention convolution module was introduced, C2F and SPPF modules were employed, and the loss function for segmentation tasks was optimized, culminating in the OrchardYOLOP. Experiments demonstrated that OrchardYOLOP achieved a precision of 84. 1 % in target detection tasks, an mloU of 89. 7% in drivable area segmentation tasks, and an mloU increased to 90. 8% in fruit tree region segmentation tasks, with an inference speed of 33. 33 frames per second and a parameter count of only 9. 67 X 10 . Compared with the YOLOP algorithm, not only did it meet the realtime requirements in terms of speed, but also it significantly improved accuracy, addressing key issues in multi-task visual perception in dragon fruit orchards and providing an effective solution for multi-task autonomous driving visual perception in unstructured environments. © 2024 Chinese Society of Agricultural Machinery. All rights reserved.

引用

页码：160 / 170

共 50 条

[31] Multi-task Compositional Network for Visual Relationship Detection
Zhan, Yibing
Yu, Jun
Yu, Ting
Tao, Dacheng
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2020, 128 (8-9) : 2146 - 2165
[32] Classification of rapid serial visual presentation based EEG with multi-task learning
Xie P.
Hu J.
Jiang G.
Wang P.
Men Y.
Yi Qi Yi Biao Xue Bao/Chinese Journal of Scientific Instrument, 2023, 44 (11): : 215 - 223
[33] Language AdaptiveWeight Generation for Multi-task Visual Grounding
Su, Wei
Miao, Peihan
Dou, Huanzhang
Wang, Gaoang
Qiao, Liang
Li, Zheyang
Li, Xi
2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 10857 - 10866
[34] Multi-task Self-Supervised Visual Learning
Doersch, Carl
Zisserman, Andrew
2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 2070 - 2079
[35] DENSETRACKER: A MULTI-TASK DENSE NETWORK FOR VISUAL TRACKING
Zhao, Fei
Tang, Ming
Wu, Yi
Wang, Jinqiao
2017 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2017, : 607 - 612
[36] A novel multi-satellite and multi-task scheduling method based on task network graph aggregation
Fan, Huilong
Yang, Zhan
Zhang, Xi
Wu, Shimin
Long, Jun
Liu, Limin
EXPERT SYSTEMS WITH APPLICATIONS, 2022, 205
[37] Multi-Task Rank Learning for Visual Saliency Estimation
Li, Jia
Tian, Yonghong
Huang, Tiejun
Gao, Wen
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2011, 21 (05) : 623 - 636
[38] Multi-task Compositional Network for Visual Relationship Detection
Yibing Zhan
Jun Yu
Ting Yu
Dacheng Tao
International Journal of Computer Vision, 2020, 128 : 2146 - 2165
[39] Nuclear mass based on the multi-task learning neural network method
Ming, Xing-Chen
Zhang, Hong-Fei
Xu, Rui-Rui
Sun, Xiao-Dong
Tian, Yuan
Ge, Zhi-Gang
NUCLEAR SCIENCE AND TECHNIQUES, 2022, 33 (05)
[40] Multi-Task Chinese Speech Recognition Method Based on the Squeezeformer Model
Guo, Ying
Wang, Li
IAENG International Journal of Computer Science, 2025, 52 (01) : 23 - 31

← 1 2 3 4 5 →