Semantic segmentation-based observation pose estimation method for tomato harvesting robots

被引:0
|
作者
Dong, Lizhong [1 ]
Zhu, Licheng [1 ]
Zhao, Bo [1 ]
Wang, Ruixue [1 ]
Ni, Jipeng [1 ]
Liu, Suchun [1 ]
Chen, Kaikang [1 ]
Cui, Xuezhi [1 ]
Zhou, Liming [1 ]
机构
[1] Chinese Acad Agr Mechanizat Sci Grp Co Ltd, State Key Lab Agr Equipment Technol, Beijing 100083, Peoples R China
关键词
Machine vision; Deep learning; Semantic segmentation; Harvesting robot;
D O I
10.1016/j.compag.2025.109895
中图分类号
S [农业科学];
学科分类号
09 ;
摘要
Accurate identification and localization of peduncle cutting points are crucial for the automated harvesting of tomatoes. Due to the slender nature of tomato peduncles, occlusions from surrounding fruits, stems, and other obstacles often occur, which can adversely affect the accuracy of harvesting point detection. An optimal observation viewpoint of the tomato clusters can significantly enhance the visibility of peduncles within the camera frame. This study presents a pose estimation method for tomato cluster observation based on semantic segmentation, aimed at improving peduncle recognition accuracy from the end-effector camera's perspective. A lightweight semantic segmentation network, Dual-Resolution Network with Convolutional Attention (DRCANet), is developed to efficiently identify tomatoes and stems in harvesting scenes. The DRCANet adopts a dual-branch structure that incorporates the Convolutional Attention (CA) Block in the low-resolution semantic branch to enable more efficient semantic feature extraction. Further optimization of model performance is achieved by integrating a Multi-Scale Convolution with Channel Excitation Module (MSCEM), the adaptive-weighted-fusion module (AWF), and shallow feature fusion. The proposed DRCANet predicts masks for both tomatoes and stems in the images. By combining these predicted masks with depth information, the spatial point cloud data of tomatoes and stems are extracted. The spatial relationship between each tomato cluster and its corresponding stem is then analyzed, leading to the final observation pose estimation for each tomato cluster. Experimental results demonstrate that the proposed DRCANet achieves mIoU and mPA values of 82.83 % and 91.37 %, respectively, with an average inference time of 11.42 ms. The proposed observation pose estimation method achieves an accuracy of 77.84 % with an average processing time of 68.25 ms. This study validates the effectiveness of optimizing the observation perspective in improving the recognition accuracy of tomato peduncle picking points, offering a novel approach to enhancing the harvesting success rate of tomato harvesting robots.
引用
收藏
页数:17
相关论文
共 50 条
  • [31] Image expansion using segmentation-based method
    Murad Agha, Abdul Karim
    Ward, Rabab
    Zahir, Saif
    IEEE Pacific RIM Conference on Communications, Computers, and Signal Processing - Proceedings, 1999, : 95 - 98
  • [32] Segmentation visual pose estimation method based on neural radiation field
    Hong, Yong
    Luo, Shupei
    Chen, Xin
    Li, Deren
    Wang, Mi
    Zhongguo Guanxing Jishu Xuebao/Journal of Chinese Inertial Technology, 2024, 32 (10): : 975 - 984
  • [33] A segmentation-based method for metal artifact reduction
    Yu, Hengyong
    Zeng, Kai
    Bharkhada, Deepak K.
    Wang, Ge
    Madsen, Mark T.
    Saba, Osama
    Policeni, Bruno
    Howard, Matthew A.
    Smoker, Wendy R. K.
    ACADEMIC RADIOLOGY, 2007, 14 (04) : 495 - 504
  • [34] A new pose estimation method based on inertial and visual sensors for autonomous robots
    Xu, De
    Li, You Fu
    2007 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND BIOMIMETICS, VOLS 1-5, 2007, : 405 - 410
  • [35] Segmentation-based quantification of Tuta absoluta's damage on tomato plants
    Loyani, Loyani
    SMART AGRICULTURAL TECHNOLOGY, 2024, 7
  • [36] A Tomato Recognition Method for Harvesting with Robots using Point Clouds
    Yoshida, Takeshi
    Fukao, Takanori
    Hasegawa, Takaomi
    2019 IEEE/SICE INTERNATIONAL SYMPOSIUM ON SYSTEM INTEGRATION (SII), 2019, : 456 - 461
  • [37] RGB-D-Based Pose Estimation of Workpieces with Semantic Segmentation and Point Cloud Registration
    Xu, Hui
    Chen, Guodong
    Wang, Zhenhua
    Sun, Lining
    Su, Fan
    SENSORS, 2019, 19 (08)
  • [38] Joint Multi-Person Pose Estimation and Semantic Part Segmentation
    Xia, Fangting
    Wang, Peng
    Chen, Xianjie
    Yuille, Alan
    30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 6080 - 6089
  • [39] Semantic Segmentation-based Algorithm for Urban Road Waterlogging Disaster Detection
    Li, Wei
    Zhu, Huasheng
    Feng, Xiangsheng
    Li, Fen
    2021 THE 5TH INTERNATIONAL CONFERENCE ON VIDEO AND IMAGE PROCESSING, ICVIP 2021, 2021, : 104 - 110
  • [40] Pose estimation based on human detection and segmentation
    Chen Qiang
    Zheng EnLiang
    Liu YunCai
    SCIENCE IN CHINA SERIES F-INFORMATION SCIENCES, 2009, 52 (02): : 244 - 251