Accurate detection and depth estimation of table grapes and peduncles for robot harvesting, combining monocular depth estimation and CNN methods

被引:15
|
作者
Coll-Ribes, Gabriel [1 ]
Torres-Rodriguez, Ivan J. [1 ]
Grau, Antoni [2 ]
Guerra, Edmundo [2 ]
Sanfeliu, Alberto [1 ,2 ]
机构
[1] UPC, CSIC, Inst Robot & Informat Ind, Llorens & Artiga 4-6, Barcelona 08028, Spain
[2] Univ Politecn Cataluna, Pau Gargallo 5, Barcelona 08028, Spain
基金
欧盟地平线“2020”;
关键词
Image segmentation; Monocular depth; Grape bunch and peduncle detection; Grape bunch and peduncle depth estimation; Robot harvesting;
D O I
10.1016/j.compag.2023.108362
中图分类号
S [农业科学];
学科分类号
09 ;
摘要
Precision agriculture is a growing field in the agricultural industry and it holds great potential in fruit and vegetable harvesting. In this work, we present a robust accurate method for the detection and localization of the peduncle of table grapes, with direct implementation in automatic grape harvesting with robots. The bunch and peduncle detection methods presented in this work rely on a combination of instance segmentation and monocular depth estimation using Convolutional Neural Networks (CNN). Regarding depth estimation, we propose a combination of different depth techniques that allow precise localization of the peduncle using traditional stereo cameras, even with the particular complexity of grape peduncles. The methods proposed in this work have been tested on the WGISD (Embrapa Wine Grape Instance Segmentation) dataset, improving the results of state-of-the-art techniques. Furthermore, within the context of the EU project CANOPIES, the methods have also been tested on a dataset of 1,326 RGB-D images of table grapes, recorded at the Corsira Agricultural Cooperative Society (Aprilia, Italy), using a Realsense D435i camera located at the arm of a CANOPIES two-manipulator robot developed in the project. The detection results on the WGISD dataset show that the use of RGB-D information (mAP = 0.949) leads to superior performance compared to the use of RGB data alone (mAP = 0.891). This trend is also evident in the CANOPIES Grape Bunch and Peduncle dataset, where the mAP for RGB-D images (mAP = 0.767) outperforms that of RGB data (mAP = 0.725). Regarding depth estimation, our method achieves a mean squared error of 2.66 cm within a distance of 1 m in the CANOPIES dataset.
引用
收藏
页数:17
相关论文
共 50 条
  • [1] EVALUATING MONOCULAR DEPTH ESTIMATION METHODS
    Padkan, N.
    Trybala, P.
    Battisti, R.
    Remondino, F.
    Bergeret, C.
    2ND GEOBENCH WORKSHOP ON EVALUATION AND BENCHMARKING OF SENSORS, SYSTEMS AND GEOSPATIAL DATA IN PHOTOGRAMMETRY AND REMOTE SENSING, VOL. 48-1, 2023, : 137 - 144
  • [2] Dual CNN Models for Unsupervised Monocular Depth Estimation
    Repala, Vamshi Krishna
    Dubey, Shiv Ram
    PATTERN RECOGNITION AND MACHINE INTELLIGENCE, PREMI 2019, PT I, 2019, 11941 : 209 - 217
  • [3] CNN Approach for Monocular Depth Estimation: Ear Case Study
    Magherini, Roberto
    Servi, Michaela
    Mussi, Elisa
    Furferi, Rocco
    Buonamici, Francesco
    Volpe, Yary
    DESIGN TOOLS AND METHODS IN INDUSTRIAL ENGINEERING II, ADM 2021, 2022, : 220 - 228
  • [4] Simultaneous Attack on CNN-Based Monocular Depth Estimation and Optical Flow Estimation
    Yamanaka, Koichiro
    Takahashi, Keita
    Fujii, Toshiaki
    Matsumoto, Ryuraroh
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2021, E104D (05): : 785 - 788
  • [5] ACED: ACCURATE AND EDGE-CONSISTENT MONOCULAR DEPTH ESTIMATION
    Swami, Kunal
    Bondada, Prasanna Vishnu
    Bajpai, Pankaj kKumar
    2020 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2020, : 1376 - 1380
  • [6] Out-of-Distribution Detection for Monocular Depth Estimation
    Hornauer, Julia
    Holzbock, Adrian
    Belagiannis, Vasileios
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION, ICCV, 2023, : 1911 - 1921
  • [7] Nested DWT-Based CNN Architecture for Monocular Depth Estimation
    Paul, Sandip
    Mishra, Deepak
    Marimuthu, Senthil Kumar
    SENSORS, 2023, 23 (06)
  • [8] YOLO MDE: Object Detection with Monocular Depth Estimation
    Yu, Jongsub
    Choi, Hyukdoo
    ELECTRONICS, 2022, 11 (01)
  • [9] Towards Good Practice for CNN-Based Monocular Depth Estimation
    Fang, Zhicheng
    Chen, Xiaoran
    Chen, Yuhua
    Van Gool, Luc
    2020 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2020, : 1080 - 1089
  • [10] Single-Stage Refinement CNN for Depth Estimation in Monocular Images
    Valdez Rodriguez, Jose E.
    Calvo, Hiram
    Felipe Riveron, Edgardo M.
    COMPUTACION Y SISTEMAS, 2020, 24 (02): : 439 - 451