One Model is Not Enough: Ensembles for Isolated Sign Language Recognition

被引:0
|
作者
Hruz, Marek [1 ]
Gruber, Ivan [1 ]
Kanis, Jakub [1 ]
Bohacek, Matyas [1 ,2 ]
Hlavac, Miroslav [1 ]
Krnoul, Zdenek [1 ]
机构
[1] Univ West Bohemia, Dept Cybernet & New Technol Informat Soc, Tech 8, Plzen 30100, Czech Republic
[2] Gymnasium Johannes Kepler, Parlerova 2-118, Prague 16900, Czech Republic
关键词
sign language recognition; CNN; Transformer; ensemble; TASK;
D O I
10.3390/s22135043
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
In this paper, we dive into sign language recognition, focusing on the recognition of isolated signs. The task is defined as a classification problem, where a sequence of frames (i.e., images) is recognized as one of the given sign language glosses. We analyze two appearance-based approaches, I3D and TimeSformer, and one pose-based approach, SPOTER. The appearance-based approaches are trained on a few different data modalities, whereas the performance of SPOTER is evaluated on different types of preprocessing. All the methods are tested on two publicly available datasets: AUTSL and WLASL300. We experiment with ensemble techniques to achieve new state-of-the-art results of 73.84% accuracy on the WLASL300 dataset by using the CMA-ES optimization method to find the best ensemble weight parameters. Furthermore, we present an ensembling technique based on the Transformer model, which we call Neural Ensembler.
引用
收藏
页数:17
相关论文
共 50 条
  • [1] Isolated sign language characters recognition
    [J]. Santosa, P.I. (insap@jteti.gadjahmada.edu), 1600, Universitas Ahmad Dahlan (11):
  • [2] Recognition of isolated words of the Polish sign language
    Kapuscinski, T
    Wysocki, M
    [J]. Computer Recognition Systems, Proceedings, 2005, : 697 - 704
  • [3] Isolated Sign Language Recognition with Depth Cameras
    Oszust, Mariusz
    Krupski, Jakub
    [J]. KNOWLEDGE-BASED AND INTELLIGENT INFORMATION & ENGINEERING SYSTEMS (KSE 2021), 2021, 192 : 2085 - 2094
  • [4] Real-Time Isolated Sign Language Recognition
    Hori, Noriaki
    Yamamoto, Masahito
    [J]. FRONTIERS OF ARTIFICIAL INTELLIGENCE, ETHICS, AND MULTIDISCIPLINARY APPLICATIONS, FAIEMA 2023, 2024, : 445 - 458
  • [5] Isolated Sign Language Recognition with Fast Hand Descriptors
    Ozdemir, Gulcan
    Kindiroglu, Ahmet Alp
    Akarun, Lale
    [J]. 2018 26TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2018,
  • [6] Isolated Sign Language Recognition with Grassmann Covariance Matrices
    Wang, Hanjie
    Chai, Xiujuan
    Hong, Xiaopeng
    Zhao, Guoying
    Chen, Xilin
    [J]. ACM TRANSACTIONS ON ACCESSIBLE COMPUTING, 2016, 8 (04)
  • [7] Isolated Sign Language Recognition Using Deep Learning
    Das, Sukanya
    Yadav, Sumit Kumar
    Samanta, Debasis
    [J]. COMPUTER VISION AND IMAGE PROCESSING, CVIP 2023, PT I, 2024, 2009 : 343 - 356
  • [8] Unraveling a Decade: A Comprehensive Survey on Isolated Sign Language Recognition
    Sarhan, Noha
    Frintrop, Simone
    [J]. 2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS, ICCVW, 2023, : 3202 - 3211
  • [9] ISOLATED SIGN LANGUAGE RECOGNITION USING IMPROVED DENSE TRAJECTORIES
    Ozdemir, Ogulcan
    Camgoz, Necati Cihan
    Akarun, Lalc
    [J]. 2016 24TH SIGNAL PROCESSING AND COMMUNICATION APPLICATION CONFERENCE (SIU), 2016, : 1961 - 1964
  • [10] Hand pose aware multimodal isolated sign language recognition
    Rastgoo, Razieh
    Kiani, Kourosh
    Escalera, Sergio
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2021, 80 (01) : 127 - 163