Fine grained food image recognition based on swin transformer

被引:5
|
作者
Xiao, Zhiyong [1 ,2 ]
Diao, Guang [1 ,2 ]
Deng, Zhaohong [1 ,2 ]
机构
[1] Jiangnan Univ, Sch Artificial Intelligence & Comp Sci, Wuxi 214122, Peoples R China
[2] Jiangnan Univ, State Key Lab Food Sci & Resources, Wuxi 214122, Peoples R China
基金
中国国家自然科学基金;
关键词
Fine-grained food image recognition; Deep learning; Swin transformer; Food health; Local feature enhancement;
D O I
10.1016/j.jfoodeng.2024.112134
中图分类号
TQ [化学工业];
学科分类号
0817 ;
摘要
Fine-grained food image recognition is an important research direction in the field of computer vision and machine learning. However, fine-grained food image recognition faces huge challenges when dealing with foods that vary greatly in shape but belong to the same category or subcategories of that food. To improve this problem, this paper proposes a deep convolution module for obtaining local enhanced feature representation and combines it with the global feature representation obtained from Swin Transformer for deep residual, to obtain a deeper enhanced feature representation. An end-to-end fine-grained food universal classifier was also proposed, which can more accurately extract effective feature information from enhanced feature representations and achieve accurate recognition. Our approach can accurately handle foods with widely different shapes but belonging to the same category and is expected to help people better manage their diet and improve their health. Our models were trained and verified on the public fine-grained food datasets Foodx-251 and UEC Food-256 respectively, where the accuracy of the method on the validation set is 81.07% and 82.77% respectively. Compared with other state-of-the-art self-supervised methods, the method proposed in this paper exhibits higher accuracy in fine-grained food image recognition tasks.
引用
收藏
页数:9
相关论文
共 50 条
  • [1] SwinFG: A fine-grained recognition scheme based on swin transformer
    Ma, Zhipeng
    Wu, Xiaoyu
    Chu, Anzhuo
    Huang, Lei
    Wei, Zhiqiang
    EXPERT SYSTEMS WITH APPLICATIONS, 2024, 244
  • [2] Global-local feature learning for fine-grained food classification based on Swin Transformer
    Kim, Jun-Hwa
    Kim, Namho
    Won, Chee Sun
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2024, 133
  • [3] Hybrid Granularities Transformer for Fine-Grained Image Recognition
    Yu, Ying
    Wang, Jinghui
    ENTROPY, 2023, 25 (04)
  • [4] Group-Attention Transformer for Fine-Grained Image Recognition
    Yan, Bo
    Wang, Siwei
    Zhu, En
    Liu, Xinwang
    Chen, Wei
    Communications in Computer and Information Science, 2022, 1587 CCIS : 40 - 54
  • [5] Fine-grained weed recognition using Swin Transformer and two-stage transfer learning
    Wang, Yecheng
    Zhang, Shuangqing
    Dai, Baisheng
    Yang, Sensen
    Song, Haochen
    FRONTIERS IN PLANT SCIENCE, 2023, 14
  • [6] Fine-Grained Ship Classification by Combining CNN and Swin Transformer
    Huang, Liang
    Wang, Fengxiang
    Zhang, Yalun
    Xu, Qingxia
    REMOTE SENSING, 2022, 14 (13)
  • [7] Fine-grained Recognition of Chinese Food Image Based on DenseNet with Attention Mechanism
    Hao, Ran
    Gao, Weidong
    Mi, Jihang
    Zhao, Zhenwei
    TWELFTH INTERNATIONAL CONFERENCE ON GRAPHICS AND IMAGE PROCESSING (ICGIP 2020), 2021, 11720
  • [8] Siamese transformer with hierarchical concept embedding for fine-grained image recognition
    Yilin LYU
    Liping JING
    Jiaqi WANG
    Mingzhe GUO
    Xinyue WANG
    Jian YU
    ScienceChina(InformationSciences), 2023, 66 (03) : 188 - 203
  • [9] Siamese transformer with hierarchical concept embedding for fine-grained image recognition
    Lyu, Yilin
    Jing, Liping
    Wang, Jiaqi
    Guo, Mingzhe
    Wang, Xinyue
    Yu, Jian
    SCIENCE CHINA-INFORMATION SCIENCES, 2023, 66 (03)
  • [10] Siamese transformer with hierarchical concept embedding for fine-grained image recognition
    Yilin Lyu
    Liping Jing
    Jiaqi Wang
    Mingzhe Guo
    Xinyue Wang
    Jian Yu
    Science China Information Sciences, 2023, 66