Semantic segmentation-based observation pose estimation method for tomato harvesting robots

被引:0
|
作者
Dong, Lizhong [1 ]
Zhu, Licheng [1 ]
Zhao, Bo [1 ]
Wang, Ruixue [1 ]
Ni, Jipeng [1 ]
Liu, Suchun [1 ]
Chen, Kaikang [1 ]
Cui, Xuezhi [1 ]
Zhou, Liming [1 ]
机构
[1] Chinese Acad Agr Mechanizat Sci Grp Co Ltd, State Key Lab Agr Equipment Technol, Beijing 100083, Peoples R China
关键词
Machine vision; Deep learning; Semantic segmentation; Harvesting robot;
D O I
10.1016/j.compag.2025.109895
中图分类号
S [农业科学];
学科分类号
09 ;
摘要
Accurate identification and localization of peduncle cutting points are crucial for the automated harvesting of tomatoes. Due to the slender nature of tomato peduncles, occlusions from surrounding fruits, stems, and other obstacles often occur, which can adversely affect the accuracy of harvesting point detection. An optimal observation viewpoint of the tomato clusters can significantly enhance the visibility of peduncles within the camera frame. This study presents a pose estimation method for tomato cluster observation based on semantic segmentation, aimed at improving peduncle recognition accuracy from the end-effector camera's perspective. A lightweight semantic segmentation network, Dual-Resolution Network with Convolutional Attention (DRCANet), is developed to efficiently identify tomatoes and stems in harvesting scenes. The DRCANet adopts a dual-branch structure that incorporates the Convolutional Attention (CA) Block in the low-resolution semantic branch to enable more efficient semantic feature extraction. Further optimization of model performance is achieved by integrating a Multi-Scale Convolution with Channel Excitation Module (MSCEM), the adaptive-weighted-fusion module (AWF), and shallow feature fusion. The proposed DRCANet predicts masks for both tomatoes and stems in the images. By combining these predicted masks with depth information, the spatial point cloud data of tomatoes and stems are extracted. The spatial relationship between each tomato cluster and its corresponding stem is then analyzed, leading to the final observation pose estimation for each tomato cluster. Experimental results demonstrate that the proposed DRCANet achieves mIoU and mPA values of 82.83 % and 91.37 %, respectively, with an average inference time of 11.42 ms. The proposed observation pose estimation method achieves an accuracy of 77.84 % with an average processing time of 68.25 ms. This study validates the effectiveness of optimizing the observation perspective in improving the recognition accuracy of tomato peduncle picking points, offering a novel approach to enhancing the harvesting success rate of tomato harvesting robots.
引用
收藏
页数:17
相关论文
共 50 条
  • [21] Deep semantic segmentation-based multiple description coding
    Xue Li
    Lili Meng
    Yanyan Tan
    Jia Zhang
    Wenbo Wan
    Huaxiang Zhang
    Multimedia Tools and Applications, 2021, 80 : 10323 - 10337
  • [22] Semantic Segmentation-based Label Position Defect Detection
    Yue, Hongwei
    Jin, Yingying
    2024 9TH INTERNATIONAL CONFERENCE ON SIGNAL AND IMAGE PROCESSING, ICSIP, 2024, : 602 - 606
  • [23] Visual Object Recognition and Pose Estimation Based on a Deep Semantic Segmentation Network
    Lin, Chien-Ming
    Tsai, Chi-Yi
    Lai, Yu-Cheng
    Li, Shin-An
    Wong, Ching-Chang
    IEEE SENSORS JOURNAL, 2018, 18 (22) : 9370 - 9381
  • [24] Fruit pose recognition and directional orderly grasping strategies for tomato harvesting robots
    Rong, Jiacheng
    Wang, Pengbo
    Wang, Tianjian
    Hu, Ling
    Yuan, Ting
    COMPUTERS AND ELECTRONICS IN AGRICULTURE, 2022, 202
  • [25] An Image Segmentation-Based Thresholding Method
    Pai, Pei-Yan
    Chang, Chin-Chen
    Chan, Yung-Kuan
    Tsai, Meng-Hsiun
    Guo, Shu-Wei
    JOURNAL OF IMAGING SCIENCE AND TECHNOLOGY, 2012, 56 (03)
  • [26] Semantic Image Segmentation-Based Safe Landing Point Determination Method for VTOL UAVs
    Kim S.-H.
    Journal of Institute of Control, Robotics and Systems, 2022, 28 (10) : 897 - 902
  • [27] Segmentation-based multi-class semantic object detection
    Vieux, Remi
    Benois-Pineau, Jenny
    Domenger, Jean-Philippe
    Braquelaire, Achille
    MULTIMEDIA TOOLS AND APPLICATIONS, 2012, 60 (02) : 305 - 326
  • [28] Real-Time Semantic Segmentation-Based Stereo Reconstruction
    Miclea, Vlad-Cristian
    Nedevschi, Sergiu
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2020, 21 (04) : 1514 - 1524
  • [29] DSSLIC: DEEP SEMANTIC SEGMENTATION-BASED LAYERED IMAGE COMPRESSION
    Akbari, Mohammad
    Liang, Jie
    Han, Jingning
    2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 2042 - 2046
  • [30] Segmentation-based multi-class semantic object detection
    Remi Vieux
    Jenny Benois-Pineau
    Jean-Philippe Domenger
    Achille Braquelaire
    Multimedia Tools and Applications, 2012, 60 : 305 - 326