共 7 条
Looking behind occlusions: A study on amodal segmentation for robust on-tree apple fruit size estimation
被引:28
|作者:
Gene-Mola, Jordi
[1
,2
]
Ferrer-Ferrer, Mar
[1
]
Gregorio, Eduard
[1
]
Blok, Pieter M.
[3
]
Hemming, Jochen
[1
,3
]
Morros, Josep-Ramon
[4
]
Rosell-Polo, Joan R.
[1
]
Vilaplana, Veronica
[4
]
Ruiz-Hidalgo, Javier
[4
]
机构:
[1] Univ Lleida UdL, Dept Agr & Forest Sci & Engn, Res Grp AgroICT & Precis Agr GRAP, Agrotecnio,CERCA Ctr, Lleida, Catalonia, Spain
[2] Inst AgriFood Res & Technol IRTA, Efficient Use Water Agr Program, Fruitcentre, Parc Cient & Tecnol Agroalimentari Gardeny PCiTAL, Lleida 25003, Spain
[3] Wageningen Univ & Res, NL-6700 AA Wageningen, Netherlands
[4] Univ Politecn Cataluna, Dept Signal Theory & Commun, Barcelona, Catalonia, Spain
关键词:
Fruit detection;
Fruit measurement;
Yield estimation;
Fruit visibility;
Deep learning;
Precision agriculture;
ORCHARD;
VISION;
GROWTH;
D O I:
10.1016/j.compag.2023.107854
中图分类号:
S [农业科学];
学科分类号:
09 ;
摘要:
The detection and sizing of fruits with computer vision methods is of interest because it provides relevant in-formation to improve the management of orchard farming. However, the presence of partially occluded fruits limits the performance of existing methods, making reliable fruit sizing a challenging task. While previous fruit segmentation works limit segmentation to the visible region of fruits (known as modal segmentation), in this work we propose an amodal segmentation algorithm to predict the complete shape, which includes its visible and occluded regions. To do so, an end-to-end convolutional neural network (CNN) for simultaneous modal and amodal instance segmentation was implemented. The predicted amodal masks were used to estimate the fruit diameters in pixels. Modal masks were used to identify the visible region and measure the distance between the apples and the camera using the depth image. Finally, the fruit diameters in millimetres (mm) were computed by applying the pinhole camera model. The method was developed with a Fuji apple dataset consisting of 3925 RGB-D images acquired at different growth stages with a total of 15,335 annotated apples, and was subsequently tested in a case study to measure the diameter of Elstar apples at different growth stages. Fruit detection results showed an F1-score of 0.86 and the fruit diameter results reported a mean absolute error (MAE) of 4.5 mm and R2 = 0.80 irrespective of fruit visibility. Besides the diameter estimation, modal and amodal masks were used to automatically determine the percentage of visibility of measured apples. This feature was used as a confidence value, improving the diameter estimation to MAE = 2.93 mm and R2 = 0.91 when limiting the size estimation to fruits detected with a visibility higher than 60%. The main advantages of the present methodology are its robustness for measuring partially occluded fruits and the capability to determine the visibility percentage. The main limitation is that depth images were generated by means of photogrammetry methods, which limits the efficiency of data acquisition. To overcome this limitation, future works should consider the use of commercial RGB-D sensors. The code and the dataset used to evaluate the method have been made publicly available at htt ps://github.com/GRAP-UdL-AT/Amodal_Fruit_Sizing.
引用
收藏
页数:13
相关论文