RESEARCH ON IMAGE RECOGNITION OF ETHNIC MINORITY CLOTHING BASED ON IMPROVED VISION TRANSFORMER

被引:0
|
作者
Wang, Taishen [1 ]
Wen, Bin [1 ,2 ]
机构
[1] Yunnan Normal Univ, Sch Informat Sci & Technol, Kunming 650500, Peoples R China
[2] Yunnan Normal Univ, Yunnan Key Lab Smart Educ, Kunming 650500, Peoples R China
来源
关键词
Image recognition; ethnic clothing recognition; Vision Transformer; self-attention mechanism;
D O I
10.3934/mfc.2022054
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
. Due to the complex ornamentation and special composition of ethnic minority costumes, the performance of current costume image recognition algorithms is limited.Models based on convolutional neural networks can extract deep semantic features from clothing images, and perform better in datasets with more images, but ignore the large-scale features of images along the dimensional direction. Therefore, we propose an improved model based on Vision Transformer, which extracts the features of the image along the height and width directions through asymmetric convolution, and then inputs them into the Transformer encoder for serialization and encoding, and uses its output to get the recognition result. Using the accuracy as the evaluation index on the minority clothing dataset, the results show that the method we proposed performs better than ResNet34, and is 1.2% higher than the classic Vision Transformer.
引用
收藏
页码:84 / 97
页数:14
相关论文
共 50 条
  • [1] Analysis of Blood Cell Image Recognition Methods Based on Improved CNN and Vision Transformer
    Wang, Pingping
    Zhang, Xinyi
    Zhao, Yuyan
    Li, Yueti
    Xu, Kaisheng
    Zhao, Shuaiyin
    [J]. IEICE TRANSACTIONS ON FUNDAMENTALS OF ELECTRONICS COMMUNICATIONS AND COMPUTER SCIENCES, 2024, E107A (06) : 899 - 908
  • [2] Recognition Algorithm of Popular Elements of Ethnic Minority Traditional Clothing Based on PCA
    Juan, Hu
    [J]. SCIENTIFIC PROGRAMMING, 2021, 2021
  • [3] Research on facial recognition of sika deer based on vision transformer
    Gong, He
    Luo, Tianye
    Ni, Lingyun
    Li, Ji
    Guo, Jie
    Liu, Tonghe
    Feng, Ruilong
    Mu, Ye
    Hu, Tianli
    Sun, Yu
    Guo, Ying
    Li, Shijun
    [J]. ECOLOGICAL INFORMATICS, 2023, 78
  • [4] Engagement Recognition in Online Learning Based on an Improved Video Vision Transformer
    Guo, Zijian
    Zhou, Zhuoyi
    Pan, Jiahui
    Liang, Yan
    [J]. 2023 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, IJCNN, 2023,
  • [5] Colorectal cancer image recognition algorithm based on improved transformer
    Qin, Zhuanping
    Sun, Wenhao
    Guo, Tinghang
    Lu, Guangda
    [J]. DISCOVER APPLIED SCIENCES, 2024, 6 (08)
  • [6] An improved Vision Transformer model for the recognition of blood cells
    Sun, Tianyu
    Zhu, Qingtao
    Yang, Jian
    Zeng, Liang
    [J]. Shengwu Yixue Gongchengxue Zazhi/Journal of Biomedical Engineering, 2022, 39 (06): : 1097 - 1107
  • [7] Recognition of penetration state in GTAW based on vision transformer using weld pool image
    Zhenmin Wang
    Haoyu Chen
    Qiming Zhong
    Sanbao Lin
    Jianwen Wu
    Mengjia Xu
    Qin Zhang
    [J]. The International Journal of Advanced Manufacturing Technology, 2022, 119 : 5439 - 5452
  • [8] Recognition of penetration state in GTAW based on vision transformer using weld pool image
    Wang, Zhenmin
    Chen, Haoyu
    Zhong, Qiming
    Lin, Sanbao
    Wu, Jianwen
    Xu, Mengjia
    Zhang, Qin
    [J]. INTERNATIONAL JOURNAL OF ADVANCED MANUFACTURING TECHNOLOGY, 2022, 119 (7-8): : 5439 - 5452
  • [9] IMPROVED LIGHTWEIGHT MULTISCALE FINGER VEIN RECOGNITION FOR VISION TRANSFORMER
    Tao, Zhiyong
    Gao, Yajing
    Lin, Sen
    [J]. UNIVERSITY POLITEHNICA OF BUCHAREST SCIENTIFIC BULLETIN SERIES C-ELECTRICAL ENGINEERING AND COMPUTER SCIENCE, 2024, 86 (02): : 285 - 296
  • [10] SPRNet: Sitting Posture Recognition Using improved Vision Transformer
    Fang, Yi
    Shi, Shoudong
    Fang, Jingsen
    Yin, Wenting
    [J]. 2022 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2022,