Deep-learning-based in-field citrus fruit detection and tracking

被引:47
|
作者
Zhang, Wenli [1 ]
Wang, Jiaqi [1 ]
Liu, Yuxin [1 ]
Chen, Kaizhen [1 ]
Li, Huibin [2 ]
Duan, Yulin [2 ]
Wu, Wenbin [2 ]
Shi, Yun [2 ]
Guo, Wei [3 ]
机构
[1] Beijing Univ Technol, Informat Dept, Beijing 100022, Peoples R China
[2] Chinese Acad Agr Sci, Inst Agr Resources & Reg Planning, Beijing 100081, Peoples R China
[3] Univ Tokyo, Inst Sustainable Agroecosyst Serv, Int Field Phen Res Lab, Tokyo 1880002, Japan
关键词
D O I
10.1093/hr/uhac003
中图分类号
Q94 [植物学];
学科分类号
071001 ;
摘要
Fruit yield estimation is crucial for establishing fruit harvest and marketing strategies. Recently, computer vision and deep learning techniques have been used to estimate citrus fruit yield and have exhibited notable fruit detection ability. However, computer-vision-based citrus fruit counting has two key limitations: inconsistent fruit detection accuracy and double-counting of the same fruit. Using oranges as the experimental material, this paper proposes a deep-learning-based orange counting algorithm using video sequences to help overcome these problems. The algorithm consists of two sub-algorithms, OrangeYolo for fruit detection and OrangeSort for fruit tracking. The OrangeYolo backbone network is partially based on the YOLOv3 algorithm, which has been improved upon to detect small objects (fruits) at multiple scales. The network structure was adjusted to detect small-scale targets while enabling multiscale target detection. A channel attention and spatial attention multiscale fusion module was introduced to fuse the semantic features of the deep network with the shallow textural detail features. OrangeYolo can achieve mean Average Precision (mAP) values of 0.957 in the citrus dataset, higher than the 0.905, 0.911, and 0.917 achieved with the YOLOv3, YOLOv4, and YOLOv5 algorithms. OrangeSort was designed to alleviate the double-counting problem associated with occluded fruits. A specific tracking region counting strategy and tracking algorithm based on motion displacement estimation were established. Six video sequences taken from two fields containing 22 trees were used as the validation dataset. The proposed method showed better performance (Mean Absolute Error (MAE) = 0.081, Standard Deviation (SD) = 0.08) than video-based manual counting and produced more accurate results than the existing standards Sort and DeepSort (MAE = 0.45 and 1.212; SD = 0.4741 and 1.3975).
引用
收藏
页数:15
相关论文
共 50 条
  • [21] A cascaded deep-learning-based model for face mask detection
    Kumar, Akhil
    [J]. DATA TECHNOLOGIES AND APPLICATIONS, 2023, 57 (01) : 84 - 107
  • [22] Deep-Learning-Based Bughole Detection for Concrete Surface Image
    Yao, Gang
    Wei, Fujia
    Yang, Yang
    Sun, Yujia
    [J]. ADVANCES IN CIVIL ENGINEERING, 2019, 2019
  • [23] A cascaded deep-learning-based model for face mask detection
    Kumar, Akhil
    [J]. DATA TECHNOLOGIES AND APPLICATIONS, 2022, : 1 - 24
  • [24] Annotated dataset for deep-learning-based bacterial colony detection
    Makrai, Laszlo
    Fodroczy, Bettina
    Nagy, Sara Agnes
    Czeiszing, Peter
    Csabai, Istvan
    Szita, Geza
    Solymosi, Norbert
    [J]. SCIENTIFIC DATA, 2023, 10 (01)
  • [25] In-field citrus detection and localisation based on RGB-D image analysis
    Lin, Guichao
    Tang, Yunchao
    Zou, Xiangjun
    Li, Jinhui
    Xiong, Juntao
    [J]. BIOSYSTEMS ENGINEERING, 2019, 186 : 34 - 44
  • [26] Annotated dataset for deep-learning-based bacterial colony detection
    László Makrai
    Bettina Fodróczy
    Sára Ágnes Nagy
    Péter Czeiszing
    István Csabai
    Géza Szita
    Norbert Solymosi
    [J]. Scientific Data, 10
  • [27] A Systematic Review on Deep-Learning-Based Phishing Email Detection
    Gray, L. Earl
    Conley, Justin M.
    Bursian, Steven J.
    Kamruzzaman, Abu
    Asif, Rameez
    [J]. ELECTRONICS, 2023, 12 (21)
  • [28] Deep-Learning-Based Network Intrusion Detection for SCADA Systems
    Yang, Huan
    Cheng, Liang
    Chuah, Mooi Choo
    [J]. 2019 IEEE CONFERENCE ON COMMUNICATIONS AND NETWORK SECURITY (CNS), 2019,
  • [29] A Deep-learning-based Floor Detection System for the Visually Impaired
    Delahoz, Yueng
    Labrador, Miguel A.
    [J]. 2017 IEEE 15TH INTL CONF ON DEPENDABLE, AUTONOMIC AND SECURE COMPUTING, 15TH INTL CONF ON PERVASIVE INTELLIGENCE AND COMPUTING, 3RD INTL CONF ON BIG DATA INTELLIGENCE AND COMPUTING AND CYBER SCIENCE AND TECHNOLOGY CONGRESS(DASC/PICOM/DATACOM/CYBERSCI, 2017, : 883 - 888
  • [30] Detection and tracking of human track and field motion targets based on deep learning
    Zhang, Yafei
    Zhang, Man
    Cui, Yongxia
    Zhang, Dongyuan
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2020, 79 (13-14) : 9543 - 9563