RESEARCH ON IMAGE RECOGNITION OF ETHNIC MINORITY CLOTHING BASED ON IMPROVED VISION TRANSFORMER

被引：0

作者：

Wang, Taishen ^{[1
]}

Wen, Bin ^{[1
,2
]}

机构：

[1] Yunnan Normal Univ, Sch Informat Sci & Technol, Kunming 650500, Peoples R China

[2] Yunnan Normal Univ, Yunnan Key Lab Smart Educ, Kunming 650500, Peoples R China

来源：

MATHEMATICAL FOUNDATIONS OF COMPUTING | 2024年 / 7卷 / 01期

关键词：

Image recognition; ethnic clothing recognition; Vision Transformer; self-attention mechanism;

D O I：

10.3934/mfc.2022054

中图分类号：

TP301 [理论、方法];

学科分类号：

081202 ;

摘要：

. Due to the complex ornamentation and special composition of ethnic minority costumes, the performance of current costume image recognition algorithms is limited.Models based on convolutional neural networks can extract deep semantic features from clothing images, and perform better in datasets with more images, but ignore the large-scale features of images along the dimensional direction. Therefore, we propose an improved model based on Vision Transformer, which extracts the features of the image along the height and width directions through asymmetric convolution, and then inputs them into the Transformer encoder for serialization and encoding, and uses its output to get the recognition result. Using the accuracy as the evaluation index on the minority clothing dataset, the results show that the method we proposed performs better than ResNet34, and is 1.2% higher than the classic Vision Transformer.

引用

页码：84 / 97

页数：14

共 50 条

[1] Analysis of Blood Cell Image Recognition Methods Based on Improved CNN and Vision Transformer
Wang, Pingping
Zhang, Xinyi
Zhao, Yuyan
Li, Yueti
Xu, Kaisheng
Zhao, Shuaiyin
IEICE TRANSACTIONS ON FUNDAMENTALS OF ELECTRONICS COMMUNICATIONS AND COMPUTER SCIENCES, 2024, E107A (06) : 899 - 908
[2] Recognition Algorithm of Popular Elements of Ethnic Minority Traditional Clothing Based on PCA
Juan, Hu
SCIENTIFIC PROGRAMMING, 2021, 2021
[3] Research on facial recognition of sika deer based on vision transformer
Gong, He
Luo, Tianye
Ni, Lingyun
Li, Ji
Guo, Jie
Liu, Tonghe
Feng, Ruilong
Mu, Ye
Hu, Tianli
Sun, Yu
Guo, Ying
Li, Shijun
ECOLOGICAL INFORMATICS, 2023, 78
[4] Engagement Recognition in Online Learning Based on an Improved Video Vision Transformer
Guo, Zijian
Zhou, Zhuoyi
Pan, Jiahui
Liang, Yan
2023 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, IJCNN, 2023,
[5] Colorectal cancer image recognition algorithm based on improved transformer
Qin, Zhuanping
Sun, Wenhao
Guo, Tinghang
Lu, Guangda
DISCOVER APPLIED SCIENCES, 2024, 6 (08)
[6] An improved Vision Transformer model for the recognition of blood cells
Sun T.
Zhu Q.
Yang J.
Zeng L.
Shengwu Yixue Gongchengxue Zazhi/Journal of Biomedical Engineering, 2022, 39 (06): : 1097 - 1107
[7] Recognition of penetration state in GTAW based on vision transformer using weld pool image
Zhenmin Wang
Haoyu Chen
Qiming Zhong
Sanbao Lin
Jianwen Wu
Mengjia Xu
Qin Zhang
The International Journal of Advanced Manufacturing Technology, 2022, 119 : 5439 - 5452
[8] Recognition of penetration state in GTAW based on vision transformer using weld pool image
Wang, Zhenmin
Chen, Haoyu
Zhong, Qiming
Lin, Sanbao
Wu, Jianwen
Xu, Mengjia
Zhang, Qin
INTERNATIONAL JOURNAL OF ADVANCED MANUFACTURING TECHNOLOGY, 2022, 119 (7-8): : 5439 - 5452
[9] IMPROVED LIGHTWEIGHT MULTISCALE FINGER VEIN RECOGNITION FOR VISION TRANSFORMER
Tao, Zhiyong
Gao, Yajing
Lin, Sen
UNIVERSITY POLITEHNICA OF BUCHAREST SCIENTIFIC BULLETIN SERIES C-ELECTRICAL ENGINEERING AND COMPUTER SCIENCE, 2024, 86 (02): : 285 - 296
[10] SPRNet: Sitting Posture Recognition Using improved Vision Transformer
Fang, Yi
Shi, Shoudong
Fang, Jingsen
Yin, Wenting
2022 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2022,

← 1 2 3 4 5 →