Adversarial autoencoder for continuous sign language recognition

被引:0
|
作者
Kamal, Suhail Muhammad [1 ,2 ,3 ]
Chen, Yidong [1 ,2 ]
Li, Shaozi [1 ,2 ]
机构
[1] Xiamen Univ, Sch Informat, Xiamen, Fujian, Peoples R China
[2] Xiamen Univ, Key Lab Digital Protect & Intelligent Proc Intangi, Minist Culture & Tourism, Xiamen, Fujian, Peoples R China
[3] Bayero Univ Kano, Fac Comp, Dept Informat Technol, Kano, Nigeria
关键词
adversarial autoencoder; continuous sign language recognition; vision-language;
D O I
10.1002/cpe.8220
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Sign language serves as a vital communication medium for the deaf community, encompassing a diverse array of signs conveyed through distinct hand shapes along with non-manual gestures like facial expressions and body movements. Accurate recognition of sign language is crucial for bridging the communication gap between deaf and hearing individuals, yet the scarcity of large-scale datasets poses a significant challenge in developing robust recognition technologies. Existing works address this challenge by employing various strategies, such as enhancing visual modules, incorporating pretrained visual models, and leveraging multiple modalities to improve performance and mitigate overfitting. However, the exploration of the contextual module, responsible for modeling long-term dependencies, remains limited. This work introduces an Adversarial Autoencoder for Continuous Sign Language Recognition, AA-CSLR, to address the constraints imposed by limited data availability, leveraging the capabilities of generative models. The integration of pretrained knowledge, coupled with cross-modal alignment, enhances the representation of sign language by effectively aligning visual and textual features. Through extensive experiments on publicly available datasets (PHOENIX-2014, PHOENIX-2014T, and CSL-Daily), we demonstrate the effectiveness of our proposed method in achieving competitive performance in continuous sign language recognition.
引用
收藏
页数:10
相关论文
共 50 条
  • [41] Improving Continuous Sign Language Recognition with Cross-Lingual Signs
    Wei, Fangyun
    Chen, Yutong
    [J]. 2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 23555 - 23564
  • [42] Recognition of Non-Manual Content in Continuous Japanese Sign Language
    Brock, Heike
    Farag, Iva
    Nakadai, Kazuhiro
    [J]. SENSORS, 2020, 20 (19) : 1 - 21
  • [43] Possibility Theory Based Continuous Indian Sign Language Gesture Recognition
    Baranwal, Neha
    Tripathi, Kumud
    Nandi, G. C.
    [J]. TENCON 2015 - 2015 IEEE REGION 10 CONFERENCE, 2015,
  • [44] Recognition of Continuous Sign Language Alphabet Using Leap Motion Controller
    Cohen, Miri Weiss
    Ben Zikri, Nir Nir
    Velkovich, Alexander
    [J]. 2018 11TH INTERNATIONAL CONFERENCE ON HUMAN SYSTEM INTERACTION (HSI), 2018, : 193 - 199
  • [45] Continuous Sign Language Recognition with Iterative Spatiotemporal Fine-tuning
    Koishybay, Kenessary
    Mukushev, Medet
    Sandygulova, Anara
    [J]. 2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 10211 - 10218
  • [46] Distilling Cross-Temporal Contexts for Continuous Sign Language Recognition
    Guo, Leming
    Xue, Wanli
    Guo, Qing
    Liu, Bo
    Zhang, Kaihua
    Yuan, Tiantian
    Chen, Shengyong
    [J]. 2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 10771 - 10780
  • [47] Continuous sign language recognition based on hierarchical memory sequence network
    Xue, Cuihong
    Jia, Jingli
    Yu, Ming
    Yan, Gang
    Guo, Yingchun
    Liu, Yuehao
    [J]. IET COMPUTER VISION, 2024, 18 (02) : 247 - 259
  • [48] Continuous Sign Language Recognition Based on Pseudo-supervised Learning
    Pei, Xiankun
    Guo, Dan
    Zhao, Ye
    [J]. PROCEEDINGS OF THE 2ND WORKSHOP ON MULTIMEDIA FOR ACCESSIBLE HUMAN COMPUTER INTERFACES (MAHCI '19), 2019, : 33 - 39
  • [49] Cross-modal knowledge distillation for continuous sign language recognition
    Gao, Liqing
    Shi, Peng
    Hu, Lianyu
    Feng, Jichao
    Zhu, Lei
    Wan, Liang
    Feng, Wei
    [J]. NEURAL NETWORKS, 2024, 179
  • [50] Transition movement models for large vocabulary continuous sign language recognition
    Gao, W
    Fang, GL
    Zhao, DB
    Chen, YQ
    [J]. SIXTH IEEE INTERNATIONAL CONFERENCE ON AUTOMATIC FACE AND GESTURE RECOGNITION, PROCEEDINGS, 2004, : 553 - 558