Semantic segmentation-based observation pose estimation method for tomato harvesting robots

被引：0

作者：

Dong, Lizhong ^{[1
]}

Zhu, Licheng ^{[1
]}

Zhao, Bo ^{[1
]}

Wang, Ruixue ^{[1
]}

Ni, Jipeng ^{[1
]}

Liu, Suchun ^{[1
]}

Chen, Kaikang ^{[1
]}

Cui, Xuezhi ^{[1
]}

Zhou, Liming ^{[1
]}

机构：

[1] Chinese Acad Agr Mechanizat Sci Grp Co Ltd, State Key Lab Agr Equipment Technol, Beijing 100083, Peoples R China

来源：

COMPUTERS AND ELECTRONICS IN AGRICULTURE | 2025年 / 230卷

关键词：

Machine vision; Deep learning; Semantic segmentation; Harvesting robot;

D O I：

10.1016/j.compag.2025.109895

中图分类号：

S [农业科学];

学科分类号：

09 ;

摘要：

Accurate identification and localization of peduncle cutting points are crucial for the automated harvesting of tomatoes. Due to the slender nature of tomato peduncles, occlusions from surrounding fruits, stems, and other obstacles often occur, which can adversely affect the accuracy of harvesting point detection. An optimal observation viewpoint of the tomato clusters can significantly enhance the visibility of peduncles within the camera frame. This study presents a pose estimation method for tomato cluster observation based on semantic segmentation, aimed at improving peduncle recognition accuracy from the end-effector camera's perspective. A lightweight semantic segmentation network, Dual-Resolution Network with Convolutional Attention (DRCANet), is developed to efficiently identify tomatoes and stems in harvesting scenes. The DRCANet adopts a dual-branch structure that incorporates the Convolutional Attention (CA) Block in the low-resolution semantic branch to enable more efficient semantic feature extraction. Further optimization of model performance is achieved by integrating a Multi-Scale Convolution with Channel Excitation Module (MSCEM), the adaptive-weighted-fusion module (AWF), and shallow feature fusion. The proposed DRCANet predicts masks for both tomatoes and stems in the images. By combining these predicted masks with depth information, the spatial point cloud data of tomatoes and stems are extracted. The spatial relationship between each tomato cluster and its corresponding stem is then analyzed, leading to the final observation pose estimation for each tomato cluster. Experimental results demonstrate that the proposed DRCANet achieves mIoU and mPA values of 82.83 % and 91.37 %, respectively, with an average inference time of 11.42 ms. The proposed observation pose estimation method achieves an accuracy of 77.84 % with an average processing time of 68.25 ms. This study validates the effectiveness of optimizing the observation perspective in improving the recognition accuracy of tomato peduncle picking points, offering a novel approach to enhancing the harvesting success rate of tomato harvesting robots.

引用

页数：17

共 50 条

[31] Image expansion using segmentation-based method
Murad Agha, Abdul Karim
Ward, Rabab
Zahir, Saif
IEEE Pacific RIM Conference on Communications, Computers, and Signal Processing - Proceedings, 1999, : 95 - 98
[32] Segmentation visual pose estimation method based on neural radiation field
Hong, Yong
Luo, Shupei
Chen, Xin
Li, Deren
Wang, Mi
Zhongguo Guanxing Jishu Xuebao/Journal of Chinese Inertial Technology, 2024, 32 (10): : 975 - 984
[33] A segmentation-based method for metal artifact reduction
Yu, Hengyong
Zeng, Kai
Bharkhada, Deepak K.
Wang, Ge
Madsen, Mark T.
Saba, Osama
Policeni, Bruno
Howard, Matthew A.
Smoker, Wendy R. K.
ACADEMIC RADIOLOGY, 2007, 14 (04) : 495 - 504
[34] A new pose estimation method based on inertial and visual sensors for autonomous robots
Xu, De
Li, You Fu
2007 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND BIOMIMETICS, VOLS 1-5, 2007, : 405 - 410
[35] Segmentation-based quantification of Tuta absoluta's damage on tomato plants
Loyani, Loyani
SMART AGRICULTURAL TECHNOLOGY, 2024, 7
[36] A Tomato Recognition Method for Harvesting with Robots using Point Clouds
Yoshida, Takeshi
Fukao, Takanori
Hasegawa, Takaomi
2019 IEEE/SICE INTERNATIONAL SYMPOSIUM ON SYSTEM INTEGRATION (SII), 2019, : 456 - 461
[37] RGB-D-Based Pose Estimation of Workpieces with Semantic Segmentation and Point Cloud Registration
Xu, Hui
Chen, Guodong
Wang, Zhenhua
Sun, Lining
Su, Fan
SENSORS, 2019, 19 (08)
[38] Joint Multi-Person Pose Estimation and Semantic Part Segmentation
Xia, Fangting
Wang, Peng
Chen, Xianjie
Yuille, Alan
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 6080 - 6089
[39] Semantic Segmentation-based Algorithm for Urban Road Waterlogging Disaster Detection
Li, Wei
Zhu, Huasheng
Feng, Xiangsheng
Li, Fen
2021 THE 5TH INTERNATIONAL CONFERENCE ON VIDEO AND IMAGE PROCESSING, ICVIP 2021, 2021, : 104 - 110
[40] Pose estimation based on human detection and segmentation
Chen Qiang
Zheng EnLiang
Liu YunCai
SCIENCE IN CHINA SERIES F-INFORMATION SCIENCES, 2009, 52 (02): : 244 - 251

← 1 2 3 4 5 →