Fine grained food image recognition based on swin transformer

被引:5
|
作者
Xiao, Zhiyong [1 ,2 ]
Diao, Guang [1 ,2 ]
Deng, Zhaohong [1 ,2 ]
机构
[1] Jiangnan Univ, Sch Artificial Intelligence & Comp Sci, Wuxi 214122, Peoples R China
[2] Jiangnan Univ, State Key Lab Food Sci & Resources, Wuxi 214122, Peoples R China
基金
中国国家自然科学基金;
关键词
Fine-grained food image recognition; Deep learning; Swin transformer; Food health; Local feature enhancement;
D O I
10.1016/j.jfoodeng.2024.112134
中图分类号
TQ [化学工业];
学科分类号
0817 ;
摘要
Fine-grained food image recognition is an important research direction in the field of computer vision and machine learning. However, fine-grained food image recognition faces huge challenges when dealing with foods that vary greatly in shape but belong to the same category or subcategories of that food. To improve this problem, this paper proposes a deep convolution module for obtaining local enhanced feature representation and combines it with the global feature representation obtained from Swin Transformer for deep residual, to obtain a deeper enhanced feature representation. An end-to-end fine-grained food universal classifier was also proposed, which can more accurately extract effective feature information from enhanced feature representations and achieve accurate recognition. Our approach can accurately handle foods with widely different shapes but belonging to the same category and is expected to help people better manage their diet and improve their health. Our models were trained and verified on the public fine-grained food datasets Foodx-251 and UEC Food-256 respectively, where the accuracy of the method on the validation set is 81.07% and 82.77% respectively. Compared with other state-of-the-art self-supervised methods, the method proposed in this paper exhibits higher accuracy in fine-grained food image recognition tasks.
引用
收藏
页数:9
相关论文
共 50 条
  • [41] Swin-Panda: Behavior Recognition for Giant Pandas Based on Local Fine-Grained and Spatiotemporal Displacement Features
    Yi, Xinyu
    Su, Han
    Min, Peng
    He, Mengnan
    Han, Yimin
    Luo, Gai
    Wu, Pengcheng
    Min, Qingyue
    Hou, Rong
    Chen, Peng
    DIVERSITY-BASEL, 2025, 17 (02):
  • [42] Transformer-based descriptors with fine-grained region supervisions for visual place recognition
    Wang, Yuwei
    Qiu, Yuanying
    Cheng, Peitao
    Zhang, Junyu
    KNOWLEDGE-BASED SYSTEMS, 2023, 280
  • [43] An Integrated Transformer with Collaborative Tokens Mining for Fine-Grained Recognition
    Yang, Weiwei
    Yin, Jian
    ELECTRONICS, 2023, 12 (12)
  • [44] RAMS-Trans: Recurrent Attention Multi-scale Transformer for Fine-grained Image Recognition
    Hu, Yunqing
    Jin, Xuan
    Zhang, Yin
    Hong, Haiwen
    Zhang, Jingfeng
    He, Yuan
    Xue, Hui
    PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021, 2021, : 4239 - 4248
  • [45] Partial Discharge Pattern Recognition Based on Swin Transformer in Atypical Datasets
    Zhang, Yuxiao
    Zhang, Ben
    Song, Hui
    Tang, Zhong
    Liu, Guanghui
    Jiang, Changming
    Gaodianya Jishu/High Voltage Engineering, 2024, 50 (12): : 5346 - 5356
  • [46] Short wave protocol signals recognition based on Swin-Transformer
    Zhu Z.
    Chen P.
    Wang Z.
    Gong K.
    Wu D.
    Wang Z.
    Tongxin Xuebao/Journal on Communications, 2022, 43 (11): : 127 - 135
  • [47] A Measurement Method of Projectile Explosion Position and Explosion Image Recognition Algorithm Based on PSPNet and Swin Transformer Fusion
    Li, Hanshan
    Zhang, Xiaoqian
    IEEE SENSORS JOURNAL, 2025, 25 (03) : 4715 - 4726
  • [48] Gradient aggregation based fine-grained image retrieval: A unified viewpoint for CNN and Transformer
    Yu, Han
    Lu, Huibin
    Zhao, Min
    Li, Zhuoyi
    Gu, Guanghua
    PATTERN RECOGNITION, 2024, 149
  • [49] Fine-Grained Image Classification Combining Swin and Multi-Scale Feature Fusion
    Xiang, Jianwen
    Chen, Minrong
    Yang, Baibing
    Computer Engineering and Applications, 2023, 59 (20): : 147 - 157
  • [50] SeLT: Sonar Echo Image Recognition for Small Targets using Lightweight Swin Transformer
    Xia, Sijia
    Hou, Mengyang
    Han, Yina
    Xiao, Ziyuan
    Guo, Zihao
    Liu, Qingyu
    Ma, Yuanliang
    OCEANS 2024 - SINGAPORE, 2024,