RESEARCH ON IMAGE RECOGNITION OF ETHNIC MINORITY CLOTHING BASED ON IMPROVED VISION TRANSFORMER

被引：0

作者：

Wang, Taishen ^{[1
]}

Wen, Bin ^{[1
,2
]}

机构：

[1] Yunnan Normal Univ, Sch Informat Sci & Technol, Kunming 650500, Peoples R China

[2] Yunnan Normal Univ, Yunnan Key Lab Smart Educ, Kunming 650500, Peoples R China

来源：

MATHEMATICAL FOUNDATIONS OF COMPUTING | 2024年 / 7卷 / 01期

关键词：

Image recognition; ethnic clothing recognition; Vision Transformer; self-attention mechanism;

D O I：

10.3934/mfc.2022054

中图分类号：

TP301 [理论、方法];

学科分类号：

081202 ;

摘要：

. Due to the complex ornamentation and special composition of ethnic minority costumes, the performance of current costume image recognition algorithms is limited.Models based on convolutional neural networks can extract deep semantic features from clothing images, and perform better in datasets with more images, but ignore the large-scale features of images along the dimensional direction. Therefore, we propose an improved model based on Vision Transformer, which extracts the features of the image along the height and width directions through asymmetric convolution, and then inputs them into the Transformer encoder for serialization and encoding, and uses its output to get the recognition result. Using the accuracy as the evaluation index on the minority clothing dataset, the results show that the method we proposed performs better than ResNet34, and is 1.2% higher than the classic Vision Transformer.

引用

页码：84 / 97

页数：14

共 50 条

[21] CRViT: Vision transformer advanced by causality and inductive bias for image recognition
Lu, Faming
Jia, Kunhao
Zhang, Xue
Sun, Lin
APPLIED INTELLIGENCE, 2025, 55 (01)
[22] A human activity recognition method based on Vision Transformer
Han, Huiyan
Zeng, Hongwei
Kuang, Liqun
Han, Xie
Xue, Hongxin
SCIENTIFIC REPORTS, 2024, 14 (01):
[23] Facial Expression Recognition Based on Squeeze Vision Transformer
Kim, Sangwon
Nam, Jaeyeal
Ko, Byoung Chul
SENSORS, 2022, 22 (10)
[24] Review of Research on Application of Vision Transformer in Medical Image Analysis
Shi, Lei
Ji, Qingyu
Chen, Qingwei
Zhao, Hengyi
Zhang, Junxing
Computer Engineering and Applications, 2023, 59 (08): : 41 - 55
[25] Research on writer identification based on vision transformer
Li, Zhenjiang
Zhang, Qianxue
JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2023, 45 (03) : 5169 - 5179
[26] DeepFake detection algorithm based on improved vision transformer
Heo, Young-Jin
Yeo, Woon-Ha
Kim, Byung-Gyu
APPLIED INTELLIGENCE, 2023, 53 (07) : 7512 - 7527
[27] DeepFake detection algorithm based on improved vision transformer
Young-Jin Heo
Woon-Ha Yeo
Byung-Gyu Kim
Applied Intelligence, 2023, 53 : 7512 - 7527
[28] Intrusion detection: A model based on the improved vision transformer
Yang, Yu-Guang
Fu, Hong-Mei
Gao, Shang
Zhou, Yi-Hua
Shi, Wei-Min
TRANSACTIONS ON EMERGING TELECOMMUNICATIONS TECHNOLOGIES, 2022, 33 (09)
[29] Vehicle Classification Algorithm Based on Improved Vision Transformer
Dong, Xinlong
Shi, Peicheng
Tang, Yueyue
Yang, Li
Yang, Aixi
Liang, Taonian
WORLD ELECTRIC VEHICLE JOURNAL, 2024, 15 (08):
[30] Efficient Image Captioning Based on Vision Transformer Models
Elbedwehy, Samar
Medhat, T.
Hamza, Taher
Alrahmawy, Mohammed F.
CMC-COMPUTERS MATERIALS & CONTINUA, 2022, 73 (01): : 1483 - 1500

← 1 2 3 4 5 →