Research on facial recognition of sika deer based on vision transformer

被引:3
|
作者
Gong, He [1 ,2 ,3 ,4 ]
Luo, Tianye [1 ]
Ni, Lingyun [1 ]
Li, Ji [1 ]
Guo, Jie [1 ]
Liu, Tonghe [1 ]
Feng, Ruilong [1 ]
Mu, Ye [1 ,2 ,3 ,4 ]
Hu, Tianli [1 ,2 ,3 ,4 ]
Sun, Yu [1 ,2 ,3 ,4 ]
Guo, Ying [1 ,2 ,3 ,4 ]
Li, Shijun [5 ,6 ]
机构
[1] Jilin Agr Univ, Coll Informat Technol, Changchun 130118, Peoples R China
[2] Jilin Prov Agr Internet Things Technol Collaborat, Changchun 130118, Peoples R China
[3] Jilin Prov Intelligent Environm Engn Res Ctr, Changchun 130118, Peoples R China
[4] Jilin Prov Coll & Univ 13 Five Year Engn Res Ctr, Changchun 130118, Peoples R China
[5] Wuzhou Univ, Coll Informat Technol, Wuzhou 543003, Peoples R China
[6] Guangxi Key Lab Machine Vis & Intelligent Control, Wuzhou 543003, Peoples R China
关键词
Sika deer; Vision transformer; DenseNet; Face recognition; Patch flattening; FACE RECOGNITION;
D O I
10.1016/j.ecoinf.2023.102334
中图分类号
Q14 [生态学(生物生态学)];
学科分类号
071012 ; 0713 ;
摘要
In the face of global concerns about endangered ecosystems, it is vital to identify individual animals. Along these lines, in this work, a Vision Transformer (ViT) based model for sika deer individual recognition using facial data was designed. To get the satisfactory results, both low-level aspects like texture and color must also be considered, in addition to the high-level semantic information. Consequently, it was difficult to get good results by only applying advanced retrieval features. The standard ViT or ViT with ResNet (Residual neural network) as the backbone network may not be the best solution, as the direct patch flattening method of feature embedded in the conventional ViT is not applicable for performing deer face recognition. Therefore, DenseNet (Densely connected convolutional networks) block as Module 1 was used for extracting low-level features. DenseNet layers enable feature reuse through dense connections, and any layer can communicate directly. Thus maximum exchange of information flow between layers in the network is enabled. In Module 2, the mask approach was also used to eliminate extraneous information from the images and reduce interference from complicated backgrounds on the identification accuracy. In addition, the pixel multiplication of the feature map output from the two modules enabled the fusion of the local features with global features, enriching hence the expressiveness of the feature map. Finally, the ViT structure was run through pre-trained. The experimental results showed that the proposed model can reach an accuracy of 97.68% for identifying sika deer individuals and exhibited excellent generalization capabilities. A valid database for the individual identification of sika deer is provided by our work, significantly contributing to the conservation and promotion of the ecosystem.
引用
收藏
页数:14
相关论文
共 50 条
  • [1] Sika Deer Behavior Recognition Based on Machine Vision
    Gong, He
    Deng, Mingwang
    Li, Shijun
    Hu, Tianli
    Sun, Yu
    Mu, Ye
    Wang, Zilian
    Zhang, Chang
    Tyasi, Thobela Louis
    CMC-COMPUTERS MATERIALS & CONTINUA, 2022, 73 (03): : 4953 - 4969
  • [2] Sika Deer Facial Recognition Model Based on SE-ResNet
    Gong, He
    Chen, Lin
    Pan, Haohong
    Li, Shijun
    Guo, Yin
    Fu, Lili
    Hu, Tianli
    Mu, Ye
    Tyasi, Thobela Louis
    CMC-COMPUTERS MATERIALS & CONTINUA, 2022, 72 (03): : 6015 - 6027
  • [3] Facial Expression Recognition Based on Squeeze Vision Transformer
    Kim, Sangwon
    Nam, Jaeyeal
    Ko, Byoung Chul
    SENSORS, 2022, 22 (10)
  • [4] Facial Expression Recognition Based on Vision Transformer with Hybrid Local Attention
    Tian, Yuan
    Zhu, Jingxuan
    Yao, Huang
    Chen, Di
    APPLIED SCIENCES-BASEL, 2024, 14 (15):
  • [5] Research on Facial Expression Recognition Algorithm Based on Lightweight Transformer
    Jiang, Bin
    Li, Nanxing
    Cui, Xiaomei
    Liu, Weihua
    Yu, Zeqi
    Xie, Yongheng
    INFORMATION, 2024, 15 (06)
  • [6] Vision Transformer With Attentive Pooling for Robust Facial Expression Recognition
    Xue, Fanglei
    Wang, Qiangchang
    Tan, Zichang
    Ma, Zhongsong
    Guo, Guodong
    IEEE TRANSACTIONS ON AFFECTIVE COMPUTING, 2023, 14 (04) : 3244 - 3256
  • [7] Enhanced Facial Emotion Recognition Using Vision Transformer Models
    Fatima, N. Sabiyath
    Deepika, G.
    Anthonisamy, Arun
    Chitra, R. Jothi
    Muralidharan, J.
    Alagarsamy, Manjunathan
    Ramyasree, Kummari
    JOURNAL OF ELECTRICAL ENGINEERING & TECHNOLOGY, 2025, 20 (02) : 1143 - 1152
  • [8] The ectopically antler growth Research of sika deer
    Gao Zhi-guang
    Geng Ye-ye
    Yang Fu-he
    Xing Xiu-mei
    Gao Song
    Zhao Gui-yan
    Xia Fu-cai
    2012 WORLD AUTOMATION CONGRESS (WAC), 2012,
  • [9] Face-mask-aware Facial Expression Recognition based on Face Parsing and Vision Transformer
    Yang, Bo
    Wu, Jianming
    Ikeda, Kazushi
    Hattori, Gen
    Sugano, Masaru
    Iwasawa, Yusuke
    Matsuo, Yutaka
    PATTERN RECOGNITION LETTERS, 2022, 164 : 173 - 182
  • [10] VISION TRANSFORMER EQUIPPED WITH NEURAL RESIZER ON FACIAL EXPRESSION RECOGNITION TASK
    Hwang, Hyeonbin
    Kim, Soyeon
    Park, Wei-Jin
    Seo, Jiho
    Ko, Kyungtae
    Yeo, Hyeon
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 2614 - 2618