Deep-learning-based in-field citrus fruit detection and tracking

被引:47
|
作者
Zhang, Wenli [1 ]
Wang, Jiaqi [1 ]
Liu, Yuxin [1 ]
Chen, Kaizhen [1 ]
Li, Huibin [2 ]
Duan, Yulin [2 ]
Wu, Wenbin [2 ]
Shi, Yun [2 ]
Guo, Wei [3 ]
机构
[1] Beijing Univ Technol, Informat Dept, Beijing 100022, Peoples R China
[2] Chinese Acad Agr Sci, Inst Agr Resources & Reg Planning, Beijing 100081, Peoples R China
[3] Univ Tokyo, Inst Sustainable Agroecosyst Serv, Int Field Phen Res Lab, Tokyo 1880002, Japan
关键词
D O I
10.1093/hr/uhac003
中图分类号
Q94 [植物学];
学科分类号
071001 ;
摘要
Fruit yield estimation is crucial for establishing fruit harvest and marketing strategies. Recently, computer vision and deep learning techniques have been used to estimate citrus fruit yield and have exhibited notable fruit detection ability. However, computer-vision-based citrus fruit counting has two key limitations: inconsistent fruit detection accuracy and double-counting of the same fruit. Using oranges as the experimental material, this paper proposes a deep-learning-based orange counting algorithm using video sequences to help overcome these problems. The algorithm consists of two sub-algorithms, OrangeYolo for fruit detection and OrangeSort for fruit tracking. The OrangeYolo backbone network is partially based on the YOLOv3 algorithm, which has been improved upon to detect small objects (fruits) at multiple scales. The network structure was adjusted to detect small-scale targets while enabling multiscale target detection. A channel attention and spatial attention multiscale fusion module was introduced to fuse the semantic features of the deep network with the shallow textural detail features. OrangeYolo can achieve mean Average Precision (mAP) values of 0.957 in the citrus dataset, higher than the 0.905, 0.911, and 0.917 achieved with the YOLOv3, YOLOv4, and YOLOv5 algorithms. OrangeSort was designed to alleviate the double-counting problem associated with occluded fruits. A specific tracking region counting strategy and tracking algorithm based on motion displacement estimation were established. Six video sequences taken from two fields containing 22 trees were used as the validation dataset. The proposed method showed better performance (Mean Absolute Error (MAE) = 0.081, Standard Deviation (SD) = 0.08) than video-based manual counting and produced more accurate results than the existing standards Sort and DeepSort (MAE = 0.45 and 1.212; SD = 0.4741 and 1.3975).
引用
收藏
页数:15
相关论文
共 50 条
  • [1] Deep-learning-based detection of in-field tomatoes using a terrestrial mobile platform
    Apolo-Apolo, O. E.
    Andujar-Sanchez, D.
    Reiser, D.
    Perez-Ruiz, M.
    Martinez-Guanter, J.
    [J]. PRECISION AGRICULTURE'21, 2021, : 703 - 709
  • [2] DAE-Mask: a novel deep-learning-based automatic detection model for in-field wheat diseases
    Mao, Rui
    Zhang, Yuchen
    Wang, Zexi
    Hao, Xingan
    Zhu, Tao
    Gao, Shengchang
    Hu, Xiaoping
    [J]. PRECISION AGRICULTURE, 2024, 25 (02) : 785 - 810
  • [3] DAE-Mask: a novel deep-learning-based automatic detection model for in-field wheat diseases
    Rui Mao
    Yuchen Zhang
    Zexi Wang
    Xingan Hao
    Tao Zhu
    Shengchang Gao
    Xiaoping Hu
    [J]. Precision Agriculture, 2024, 25 : 785 - 810
  • [4] Deep-Learning-Based Precision Visual Tracking
    Peng, Xiaoming
    Xu, Zhiyong
    Ji, Xiang
    Peng, Yufan
    Zhang, Jianlin
    Zuo, Haorui
    Wei, Yuxing
    [J]. INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2023, 37 (06)
  • [5] Exploring Deep Learning for In-Field Fault Detection in Microprocessors
    Dutto, Simone
    Savino, Alessandro
    Di Carlo, Stefano
    [J]. PROCEEDINGS OF THE 2021 DESIGN, AUTOMATION & TEST IN EUROPE CONFERENCE & EXHIBITION (DATE 2021), 2021, : 1456 - 1459
  • [6] A Review on Deep-Learning-Based Cyberbullying Detection
    Hasan, Md. Tarek
    Hossain, Md. Al Emran
    Mukta, Md. Saddam Hossain
    Akter, Arifa
    Ahmed, Mohiuddin
    Islam, Salekul
    [J]. FUTURE INTERNET, 2023, 15 (05)
  • [7] Deep-Learning-Based Research on Refractive Detection
    Ding, Shangshang
    Zheng, Tianli
    Yao, Kang
    Zhang, Hetong
    Pei, Ronghao
    Fu, Weiwei
    [J]. Computer Engineering and Applications, 2024, 59 (03) : 193 - 201
  • [8] Deep-learning-based sequential phishing detection
    Ogawa, Yuji
    Kimura, Tomotaka
    Cheng, Jun
    [J]. IEICE COMMUNICATIONS EXPRESS, 2022, 11 (04): : 171 - 175
  • [9] Image-based monitoring of bolt loosening through deep-learning-based integrated detection and tracking
    Pan, Xiao
    Yang, T. Y.
    [J]. COMPUTER-AIDED CIVIL AND INFRASTRUCTURE ENGINEERING, 2022, 37 (10) : 1207 - 1222
  • [10] Deep-Learning-Based Detection of Transmission Line Insulators
    Zhang, Jian
    Xiao, Tian
    Li, Minhang
    Zhou, Yucai
    [J]. ENERGIES, 2023, 16 (14)