Semantic segmentation-based observation pose estimation method for tomato harvesting robots

被引：0

作者：

Dong, Lizhong ^{[1
]}

Zhu, Licheng ^{[1
]}

Zhao, Bo ^{[1
]}

Wang, Ruixue ^{[1
]}

Ni, Jipeng ^{[1
]}

Liu, Suchun ^{[1
]}

Chen, Kaikang ^{[1
]}

Cui, Xuezhi ^{[1
]}

Zhou, Liming ^{[1
]}

机构：

[1] Chinese Acad Agr Mechanizat Sci Grp Co Ltd, State Key Lab Agr Equipment Technol, Beijing 100083, Peoples R China

来源：

COMPUTERS AND ELECTRONICS IN AGRICULTURE | 2025年 / 230卷

关键词：

Machine vision; Deep learning; Semantic segmentation; Harvesting robot;

D O I：

10.1016/j.compag.2025.109895

中图分类号：

S [农业科学];

学科分类号：

09 ;

摘要：

Accurate identification and localization of peduncle cutting points are crucial for the automated harvesting of tomatoes. Due to the slender nature of tomato peduncles, occlusions from surrounding fruits, stems, and other obstacles often occur, which can adversely affect the accuracy of harvesting point detection. An optimal observation viewpoint of the tomato clusters can significantly enhance the visibility of peduncles within the camera frame. This study presents a pose estimation method for tomato cluster observation based on semantic segmentation, aimed at improving peduncle recognition accuracy from the end-effector camera's perspective. A lightweight semantic segmentation network, Dual-Resolution Network with Convolutional Attention (DRCANet), is developed to efficiently identify tomatoes and stems in harvesting scenes. The DRCANet adopts a dual-branch structure that incorporates the Convolutional Attention (CA) Block in the low-resolution semantic branch to enable more efficient semantic feature extraction. Further optimization of model performance is achieved by integrating a Multi-Scale Convolution with Channel Excitation Module (MSCEM), the adaptive-weighted-fusion module (AWF), and shallow feature fusion. The proposed DRCANet predicts masks for both tomatoes and stems in the images. By combining these predicted masks with depth information, the spatial point cloud data of tomatoes and stems are extracted. The spatial relationship between each tomato cluster and its corresponding stem is then analyzed, leading to the final observation pose estimation for each tomato cluster. Experimental results demonstrate that the proposed DRCANet achieves mIoU and mPA values of 82.83 % and 91.37 %, respectively, with an average inference time of 11.42 ms. The proposed observation pose estimation method achieves an accuracy of 77.84 % with an average processing time of 68.25 ms. This study validates the effectiveness of optimizing the observation perspective in improving the recognition accuracy of tomato peduncle picking points, offering a novel approach to enhancing the harvesting success rate of tomato harvesting robots.

引用

页数：17

共 50 条

[21] Deep semantic segmentation-based multiple description coding
Xue Li
Lili Meng
Yanyan Tan
Jia Zhang
Wenbo Wan
Huaxiang Zhang
Multimedia Tools and Applications, 2021, 80 : 10323 - 10337
[22] Semantic Segmentation-based Label Position Defect Detection
Yue, Hongwei
Jin, Yingying
2024 9TH INTERNATIONAL CONFERENCE ON SIGNAL AND IMAGE PROCESSING, ICSIP, 2024, : 602 - 606
[23] Visual Object Recognition and Pose Estimation Based on a Deep Semantic Segmentation Network
Lin, Chien-Ming
Tsai, Chi-Yi
Lai, Yu-Cheng
Li, Shin-An
Wong, Ching-Chang
IEEE SENSORS JOURNAL, 2018, 18 (22) : 9370 - 9381
[24] Fruit pose recognition and directional orderly grasping strategies for tomato harvesting robots
Rong, Jiacheng
Wang, Pengbo
Wang, Tianjian
Hu, Ling
Yuan, Ting
COMPUTERS AND ELECTRONICS IN AGRICULTURE, 2022, 202
[25] An Image Segmentation-Based Thresholding Method
Pai, Pei-Yan
Chang, Chin-Chen
Chan, Yung-Kuan
Tsai, Meng-Hsiun
Guo, Shu-Wei
JOURNAL OF IMAGING SCIENCE AND TECHNOLOGY, 2012, 56 (03)
[26] Semantic Image Segmentation-Based Safe Landing Point Determination Method for VTOL UAVs
Kim S.-H.
Journal of Institute of Control, Robotics and Systems, 2022, 28 (10) : 897 - 902
[27] Segmentation-based multi-class semantic object detection
Vieux, Remi
Benois-Pineau, Jenny
Domenger, Jean-Philippe
Braquelaire, Achille
MULTIMEDIA TOOLS AND APPLICATIONS, 2012, 60 (02) : 305 - 326
[28] Real-Time Semantic Segmentation-Based Stereo Reconstruction
Miclea, Vlad-Cristian
Nedevschi, Sergiu
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2020, 21 (04) : 1514 - 1524
[29] DSSLIC: DEEP SEMANTIC SEGMENTATION-BASED LAYERED IMAGE COMPRESSION
Akbari, Mohammad
Liang, Jie
Han, Jingning
2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 2042 - 2046
[30] Segmentation-based multi-class semantic object detection
Remi Vieux
Jenny Benois-Pineau
Jean-Philippe Domenger
Achille Braquelaire
Multimedia Tools and Applications, 2012, 60 : 305 - 326

← 1 2 3 4 5 →