Fine grained food image recognition based on swin transformer

被引：5

作者：

Xiao, Zhiyong ^{[1
,2
]}

Diao, Guang ^{[1
,2
]}

Deng, Zhaohong ^{[1
,2
]}

机构：

[1] Jiangnan Univ, Sch Artificial Intelligence & Comp Sci, Wuxi 214122, Peoples R China

[2] Jiangnan Univ, State Key Lab Food Sci & Resources, Wuxi 214122, Peoples R China

来源：

JOURNAL OF FOOD ENGINEERING | 2024年 / 380卷

基金：

中国国家自然科学基金;

关键词：

Fine-grained food image recognition; Deep learning; Swin transformer; Food health; Local feature enhancement;

D O I：

10.1016/j.jfoodeng.2024.112134

中图分类号：

TQ [化学工业];

学科分类号：

0817 ;

摘要：

Fine-grained food image recognition is an important research direction in the field of computer vision and machine learning. However, fine-grained food image recognition faces huge challenges when dealing with foods that vary greatly in shape but belong to the same category or subcategories of that food. To improve this problem, this paper proposes a deep convolution module for obtaining local enhanced feature representation and combines it with the global feature representation obtained from Swin Transformer for deep residual, to obtain a deeper enhanced feature representation. An end-to-end fine-grained food universal classifier was also proposed, which can more accurately extract effective feature information from enhanced feature representations and achieve accurate recognition. Our approach can accurately handle foods with widely different shapes but belonging to the same category and is expected to help people better manage their diet and improve their health. Our models were trained and verified on the public fine-grained food datasets Foodx-251 and UEC Food-256 respectively, where the accuracy of the method on the validation set is 81.07% and 82.77% respectively. Compared with other state-of-the-art self-supervised methods, the method proposed in this paper exhibits higher accuracy in fine-grained food image recognition tasks.

引用

页数：9

共 50 条

[41] Swin-Panda: Behavior Recognition for Giant Pandas Based on Local Fine-Grained and Spatiotemporal Displacement Features
Yi, Xinyu
Su, Han
Min, Peng
He, Mengnan
Han, Yimin
Luo, Gai
Wu, Pengcheng
Min, Qingyue
Hou, Rong
Chen, Peng
DIVERSITY-BASEL, 2025, 17 (02):
[42] Transformer-based descriptors with fine-grained region supervisions for visual place recognition
Wang, Yuwei
Qiu, Yuanying
Cheng, Peitao
Zhang, Junyu
KNOWLEDGE-BASED SYSTEMS, 2023, 280
[43] An Integrated Transformer with Collaborative Tokens Mining for Fine-Grained Recognition
Yang, Weiwei
Yin, Jian
ELECTRONICS, 2023, 12 (12)
[44] RAMS-Trans: Recurrent Attention Multi-scale Transformer for Fine-grained Image Recognition
Hu, Yunqing
Jin, Xuan
Zhang, Yin
Hong, Haiwen
Zhang, Jingfeng
He, Yuan
Xue, Hui
PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021, 2021, : 4239 - 4248
[45] Partial Discharge Pattern Recognition Based on Swin Transformer in Atypical Datasets
Zhang, Yuxiao
Zhang, Ben
Song, Hui
Tang, Zhong
Liu, Guanghui
Jiang, Changming
Gaodianya Jishu/High Voltage Engineering, 2024, 50 (12): : 5346 - 5356
[46] Short wave protocol signals recognition based on Swin-Transformer
Zhu Z.
Chen P.
Wang Z.
Gong K.
Wu D.
Wang Z.
Tongxin Xuebao/Journal on Communications, 2022, 43 (11): : 127 - 135
[47] A Measurement Method of Projectile Explosion Position and Explosion Image Recognition Algorithm Based on PSPNet and Swin Transformer Fusion
Li, Hanshan
Zhang, Xiaoqian
IEEE SENSORS JOURNAL, 2025, 25 (03) : 4715 - 4726
[48] Gradient aggregation based fine-grained image retrieval: A unified viewpoint for CNN and Transformer
Yu, Han
Lu, Huibin
Zhao, Min
Li, Zhuoyi
Gu, Guanghua
PATTERN RECOGNITION, 2024, 149
[49] Fine-Grained Image Classification Combining Swin and Multi-Scale Feature Fusion
Xiang, Jianwen
Chen, Minrong
Yang, Baibing
Computer Engineering and Applications, 2023, 59 (20): : 147 - 157
[50] SeLT: Sonar Echo Image Recognition for Small Targets using Lightweight Swin Transformer
Xia, Sijia
Hou, Mengyang
Han, Yina
Xiao, Ziyuan
Guo, Zihao
Liu, Qingyu
Ma, Yuanliang
OCEANS 2024 - SINGAPORE, 2024,

← 1 2 3 4 5 →