RESEARCH ON IMAGE RECOGNITION OF ETHNIC MINORITY CLOTHING BASED ON IMPROVED VISION TRANSFORMER

被引:0
|
作者
Wang, Taishen [1 ]
Wen, Bin [1 ,2 ]
机构
[1] Yunnan Normal Univ, Sch Informat Sci & Technol, Kunming 650500, Peoples R China
[2] Yunnan Normal Univ, Yunnan Key Lab Smart Educ, Kunming 650500, Peoples R China
来源
关键词
Image recognition; ethnic clothing recognition; Vision Transformer; self-attention mechanism;
D O I
10.3934/mfc.2022054
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
. Due to the complex ornamentation and special composition of ethnic minority costumes, the performance of current costume image recognition algorithms is limited.Models based on convolutional neural networks can extract deep semantic features from clothing images, and perform better in datasets with more images, but ignore the large-scale features of images along the dimensional direction. Therefore, we propose an improved model based on Vision Transformer, which extracts the features of the image along the height and width directions through asymmetric convolution, and then inputs them into the Transformer encoder for serialization and encoding, and uses its output to get the recognition result. Using the accuracy as the evaluation index on the minority clothing dataset, the results show that the method we proposed performs better than ResNet34, and is 1.2% higher than the classic Vision Transformer.
引用
收藏
页码:84 / 97
页数:14
相关论文
共 50 条
  • [21] CRViT: Vision transformer advanced by causality and inductive bias for image recognition
    Lu, Faming
    Jia, Kunhao
    Zhang, Xue
    Sun, Lin
    APPLIED INTELLIGENCE, 2025, 55 (01)
  • [22] A human activity recognition method based on Vision Transformer
    Han, Huiyan
    Zeng, Hongwei
    Kuang, Liqun
    Han, Xie
    Xue, Hongxin
    SCIENTIFIC REPORTS, 2024, 14 (01):
  • [23] Facial Expression Recognition Based on Squeeze Vision Transformer
    Kim, Sangwon
    Nam, Jaeyeal
    Ko, Byoung Chul
    SENSORS, 2022, 22 (10)
  • [24] Review of Research on Application of Vision Transformer in Medical Image Analysis
    Shi, Lei
    Ji, Qingyu
    Chen, Qingwei
    Zhao, Hengyi
    Zhang, Junxing
    Computer Engineering and Applications, 2023, 59 (08): : 41 - 55
  • [25] Research on writer identification based on vision transformer
    Li, Zhenjiang
    Zhang, Qianxue
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2023, 45 (03) : 5169 - 5179
  • [26] DeepFake detection algorithm based on improved vision transformer
    Heo, Young-Jin
    Yeo, Woon-Ha
    Kim, Byung-Gyu
    APPLIED INTELLIGENCE, 2023, 53 (07) : 7512 - 7527
  • [27] DeepFake detection algorithm based on improved vision transformer
    Young-Jin Heo
    Woon-Ha Yeo
    Byung-Gyu Kim
    Applied Intelligence, 2023, 53 : 7512 - 7527
  • [28] Intrusion detection: A model based on the improved vision transformer
    Yang, Yu-Guang
    Fu, Hong-Mei
    Gao, Shang
    Zhou, Yi-Hua
    Shi, Wei-Min
    TRANSACTIONS ON EMERGING TELECOMMUNICATIONS TECHNOLOGIES, 2022, 33 (09)
  • [29] Vehicle Classification Algorithm Based on Improved Vision Transformer
    Dong, Xinlong
    Shi, Peicheng
    Tang, Yueyue
    Yang, Li
    Yang, Aixi
    Liang, Taonian
    WORLD ELECTRIC VEHICLE JOURNAL, 2024, 15 (08):
  • [30] Efficient Image Captioning Based on Vision Transformer Models
    Elbedwehy, Samar
    Medhat, T.
    Hamza, Taher
    Alrahmawy, Mohammed F.
    CMC-COMPUTERS MATERIALS & CONTINUA, 2022, 73 (01): : 1483 - 1500