Estimating residual bait density using hybrid dilated convolution and attention multi-scale network

被引:0
|
作者
Zhang, Lizhen [1 ]
Li, Yantian [1 ]
Li, Zhijian [1 ]
Meng, Xiongdong [1 ]
Zhang, Yongqi [1 ]
Wu, Di [1 ]
机构
[1] 1. College of Engineering Science and Technology, Shanghai Ocean University, Shanghai 201306, China,2. Shanghai Engineering Research Center of Marine Renewable Energy, Shanghai 201306, China
关键词
Access control - Aquaculture - Channel estimation - Convolutional neural networks - Deep neural networks - Distribution functions - Elastin - Image annotation - Image segmentation - Light reflection - Multilayer neural networks - Scales (weighing instruments);
D O I
10.11975/j.issn.1002-6819.202403053
中图分类号
学科分类号
摘要
Shrimp has been one of the most favorite seafood for years. It is essential to timely estimate the amount of feed left in the bait tray after feeding in shrimp aquaculture. Feeding strategies can also be adjusted to reduce the bait costs in recent years. The traditional detection of residual baits can rely on visual inspection by the shrimp farmers. Neural networks and deep learning have been introduced to detect and count the residual baits at present. However, large-scale neural networks cannot be successfully implemented on mobile devices, due mainly to the low recognition accuracy and large model computation. In this study, the improved model was proposed to estimate the density map of residual baits using a hybrid dilated convolution and attention multi-scale network (HAMNet). The high accuracy and low complexity were achieved in the detection models of residual bait. The HAMNet model was divided into three components: A low-level feature extractor (LLFE), a high-level feature extractor (HLFE), and a density map restorer and generator (DMRG). These components served as the front-end, the middle-end, and the back-end network of the improved model, respectively. Firstly, inspired by the multi-column convolutional neural network (MCNN), the parallel convolution block (PCB) was designed in the front-end network, in order to extract the feature information of residual bait at multiple scales within a single-column architecture; At the same time, the hybrid dilated convolution block (HDCB) was introduced into the mid-end network to expand the receptive field, in order to further learn the multi-scale features. Secondly, a channel attention mechanism (CAM) was embedded into the network to recalibrate the weights of useful feature information using the interdependence among channels, in order to highlight the difference between the target and background. Finally, the learnable transposed convolutional layers were applied in the back-end network to recover the detailed information from the feature maps. The quality of density maps was improved to reduce the counting errors. As such, the high-quality density map was then obtained during downsampling in the front-end network. In addition, the effectiveness of the improved model was validated using residual bait images under bait tray conditions. A comparison was also implemented with the classical networks of density map estimation. Comparative experiments showed that the HAMNet model was achieved in the minimum mean absolute error (MAE) of 2.0, the minimum root mean square error (RMSE) of 2.9, and the least floating point operations (FLOPs) at 6.55 G on the residual bait datasets, with a parameter count of only 0.52MB. The HAMNet model shared the higher counting accuracy and stability with the lower computational complexity. Compared with the baseline MCNN network, the improved model achieved a 44.4% reduction in MAE, a 40.8% reduction in RMSE, and a 13.7% reduction in FLOPs. Compared with the CMTL, SANet, and CSRNet, the optimal balance was obtained in all performance metrics. In summary, the HAMNet model outperformed the rest, in terms of overall performance, thus improving the counting accuracy with the low computational volume. The finding can provide a strong reference to rapidly quantify the residual baits in shrimp aquaculture. Novel ideas were also offered to deploy the detection models of residual bait on the platforms with limited computational power. © 2024 Chinese Society of Agricultural Engineering. All rights reserved.
引用
收藏
页码:137 / 145
相关论文
共 50 条
  • [41] Md-Net: Multi-scale Dilated Convolution Network for CT Images Segmentation
    Xia, Haiying
    Sun, Weifan
    Song, Shuxiang
    Mou, Xiangwei
    [J]. NEURAL PROCESSING LETTERS, 2020, 51 (03) : 2915 - 2927
  • [42] MULTI-SCALE DILATED RESIDUAL CONVOLUTIONAL NEURAL NETWORK FOR HYPERSPECTRAL IMAGE CLASSIFICATION
    Pooja, Kumari
    Nidamanuri, Rama Rao
    Mishra, Deepak
    [J]. 2019 10TH WORKSHOP ON HYPERSPECTRAL IMAGING AND SIGNAL PROCESSING - EVOLUTION IN REMOTE SENSING (WHISPERS), 2019,
  • [43] Lightweight Object Detection Combined with Multi-Scale Dilated-Convolution and Multi-Scale Deconvolution
    Yi, Qingming
    Lü, Renyi
    Shi, Min
    Luo, Aiwen
    [J]. Huanan Ligong Daxue Xuebao/Journal of South China University of Technology (Natural Science), 2022, 50 (12): : 41 - 48
  • [44] CHANGE DETECTION IN SAR IMAGES BASED ON A MULTI-SCALE ATTENTION CONVOLUTION NETWORK
    Li, Xin
    Gao, Feng
    Dong, Junyu
    Qi, Lin
    [J]. 2022 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS 2022), 2022, : 3219 - 3222
  • [45] MHANet: Multi-scale hybrid attention network for crowd counting
    Yu, Ying
    Yu, Jiamao
    Qian, Jin
    Zhu, Zhiliang
    Han, Xing
    [J]. JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2023, 45 (06) : 9445 - 9455
  • [46] Multi-scale Dilated Convolutional Neural Network Model Based on Attention Mechanism
    Wang J.
    Lai X.
    Lei J.
    Zhang J.
    [J]. Moshi Shibie yu Rengong Zhineng/Pattern Recognition and Artificial Intelligence, 2021, 34 (06): : 497 - 508
  • [47] HYPERSPECTRAL IMAGE CLASSIFICATION VIA MULTI-SCALE RESIDUAL ATTENTION NETWORK
    Xie, Wen
    Wu, Qinzhe
    Ren, Wen
    Zhang, Yuzhuo
    [J]. IGARSS 2023 - 2023 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM, 2023, : 7649 - 7652
  • [48] Multi-scale Residual Pyramid Attention Network for Monocular Depth Estimation
    Liu, Jing
    Zhang, Xiaona
    Li, Zhaoxin
    Mao, Tianlu
    [J]. 2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 5137 - 5144
  • [49] Desert classification based on a multi-scale residual network with an attention mechanism
    Weng, Liguo
    Wang, Lexuan
    Xia, Min
    Shen, Huixiang
    Liu, Jia
    Xu, Yiqing
    [J]. GEOSCIENCES JOURNAL, 2021, 25 (03) : 387 - 399
  • [50] Desert classification based on a multi-scale residual network with an attention mechanism
    Liguo Weng
    Lexuan Wang
    Min Xia
    Huixiang Shen
    Jia Liu
    Yiqing Xu
    [J]. Geosciences Journal, 2021, 25 : 387 - 399