Attention Guided Food Recognition via Multi-Stage Local Feature Fusion

被引:1
|
作者
Deng, Gonghui [1 ]
Wu, Dunzhi [1 ]
Chen, Weizhen [1 ]
机构
[1] Wuhan Polytech Univ, Sch Elect & Elect Engn, Wuhan 430048, Peoples R China
来源
CMC-COMPUTERS MATERIALS & CONTINUA | 2024年 / 80卷 / 02期
关键词
Fine-grained image recognition; food image recognition; attention mechanism; local feature fusion;
D O I
10.32604/cmc.2024.052174
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The task of food image recognition, a nuanced subset of fine-grained image recognition, grapples with substantial intra-class variation and minimal inter-class differences. These challenges are compounded by the irregular and multi-scale nature of food images. Addressing these complexities, our study introduces an advanced model that leverages multiple attention mechanisms and multi-stage local fusion, grounded in the ConvNeXt architecture. Our model employs hybrid attention (HA) mechanisms to pinpoint critical discriminative regions within images, substantially mitigating the influence of background noise. Furthermore, it introduces a multi-stage local fusion (MSLF) module, fostering long-distance dependencies between feature maps at varying stages. This approach facilitates the assimilation of complementary features across scales, significantly bolstering the model's capacity for feature extraction. Furthermore, we constructed a dataset named Roushi60, which consists of 60 different categories of common meat dishes. Empirical evaluation of the ETH Food-101, ChineseFoodNet, and Roushi60 datasets reveals that our model achieves recognition accuracies of 91.12%, 82.86%, and 92.50%, respectively. These figures not only mark an improvement of 1.04%, 3.42%, and 1.36% over the foundational ConvNeXt network but also surpass the performance of most contemporary food image recognition methods. Such advancements underscore the efficacy of our proposed model in navigating the intricate landscape of food image recognition, setting a new benchmark for the field.
引用
收藏
页码:1985 / 2003
页数:19
相关论文
共 50 条
  • [1] Local-aware spatio-temporal attention network with multi-stage feature fusion for human action recognition
    Yaqing Hou
    Hua Yu
    Dongsheng Zhou
    Pengfei Wang
    Hongwei Ge
    Jianxin Zhang
    Qiang Zhang
    Neural Computing and Applications, 2021, 33 : 16439 - 16450
  • [2] Local-aware spatio-temporal attention network with multi-stage feature fusion for human action recognition
    Hou, Yaqing
    Yu, Hua
    Zhou, Dongsheng
    Wang, Pengfei
    Ge, Hongwei
    Zhang, Jianxin
    Zhang, Qiang
    NEURAL COMPUTING & APPLICATIONS, 2021, 33 (23): : 16439 - 16450
  • [3] A multi-stage feature fusion defogging network based on the attention mechanism
    Song, Yuqin
    Zhao, Jitao
    Shang, Chunliang
    JOURNAL OF SUPERCOMPUTING, 2024, 80 (04): : 4577 - 4599
  • [4] A multi-stage feature fusion defogging network based on the attention mechanism
    Yuqin Song
    Jitao Zhao
    Chunliang Shang
    The Journal of Supercomputing, 2024, 80 (4) : 4577 - 4599
  • [5] A Multi-Stage Adaptive Feature Fusion Neural Network for Multimodal Gait Recognition
    Zou, Shinan
    Xiong, Jianbo
    Fan, Chao
    Shen, Chuanfu
    Yu, Shiqi
    Tang, Jin
    IEEE TRANSACTIONS ON BIOMETRICS, BEHAVIOR, AND IDENTITY SCIENCE, 2024, 6 (04): : 539 - 549
  • [6] A Multi-Stage Adaptive Feature Fusion Neural Network for Multimodal Gait Recognition
    Zou, Shinan
    Xiong, Jianbo
    Fan, Chao
    Yu, Shiqi
    Tang, Jin
    2023 IEEE INTERNATIONAL JOINT CONFERENCE ON BIOMETRICS, IJCB, 2023,
  • [7] Multi-Stage Multi-Scale Local Feature Fusion for Infrared Small Target Detection
    Wang, Yahui
    Tian, Yan
    Liu, Jijun
    Xu, Yiping
    REMOTE SENSING, 2023, 15 (18)
  • [8] Multi-Stage Fusion of Local and Global Features Based Classification for traffic sign recognition
    Samira, El Margae
    Sanae, Berraho
    Mounir, Ait Kerroum
    Youssef, Fakhri
    2014 INTERNATIONAL CONFERENCE ON MULTIMEDIA COMPUTING AND SYSTEMS (ICMCS), 2014, : 495 - 499
  • [9] MuSeFFF: Multi-stage feature fusion framework for traffic prediction
    Kumar A.
    Sunitha R.
    Intelligent Systems with Applications, 2023, 18
  • [10] Local and global feature attention fusion network for face recognition
    Wang, Yu
    Wei, Wei
    PATTERN RECOGNITION, 2025, 161