Adversarial autoencoder for continuous sign language recognition

被引:0
|
作者
Kamal, Suhail Muhammad [1 ,2 ,3 ]
Chen, Yidong [1 ,2 ]
Li, Shaozi [1 ,2 ]
机构
[1] Xiamen Univ, Sch Informat, Xiamen, Fujian, Peoples R China
[2] Xiamen Univ, Key Lab Digital Protect & Intelligent Proc Intangi, Minist Culture & Tourism, Xiamen, Fujian, Peoples R China
[3] Bayero Univ Kano, Fac Comp, Dept Informat Technol, Kano, Nigeria
关键词
adversarial autoencoder; continuous sign language recognition; vision-language;
D O I
10.1002/cpe.8220
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Sign language serves as a vital communication medium for the deaf community, encompassing a diverse array of signs conveyed through distinct hand shapes along with non-manual gestures like facial expressions and body movements. Accurate recognition of sign language is crucial for bridging the communication gap between deaf and hearing individuals, yet the scarcity of large-scale datasets poses a significant challenge in developing robust recognition technologies. Existing works address this challenge by employing various strategies, such as enhancing visual modules, incorporating pretrained visual models, and leveraging multiple modalities to improve performance and mitigate overfitting. However, the exploration of the contextual module, responsible for modeling long-term dependencies, remains limited. This work introduces an Adversarial Autoencoder for Continuous Sign Language Recognition, AA-CSLR, to address the constraints imposed by limited data availability, leveraging the capabilities of generative models. The integration of pretrained knowledge, coupled with cross-modal alignment, enhances the representation of sign language by effectively aligning visual and textual features. Through extensive experiments on publicly available datasets (PHOENIX-2014, PHOENIX-2014T, and CSL-Daily), we demonstrate the effectiveness of our proposed method in achieving competitive performance in continuous sign language recognition.
引用
收藏
页数:10
相关论文
共 50 条
  • [1] Continuous Sign Language Recognition through a Context-Aware Generative Adversarial Network
    Papastratis, Ilias
    Dimitropoulos, Kosmas
    Daras, Petros
    [J]. SENSORS, 2021, 21 (07)
  • [2] Pattern recognition considerations for continuous sign language recognition
    Sherry, G
    Foulds, R
    [J]. PROCEEDINGS OF THE IEEE 29TH ANNUAL NORTHEAST BIOENGINEERING CONFERENCE, 2003, : 291 - 293
  • [3] Subunit sign modeling framework for continuous sign language recognition
    Elakkiya, R.
    Selvamani, K.
    [J]. COMPUTERS & ELECTRICAL ENGINEERING, 2019, 74 : 379 - 390
  • [4] Video Analysis for Continuous Sign Language Recognition
    Piater, Justus
    Hoyoux, Thomas
    Du, Wei
    [J]. LREC 2010 - SEVENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2010, : A192 - A195
  • [5] Continuous Sign Language Recognition with Correlation Network
    Hu, Lianyu
    Gao, Liqing
    Liu, Zekang
    Feng, Wei
    [J]. 2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR, 2023, : 2529 - 2539
  • [6] Continuous Sign Language Recognition with Correlation Network
    Hu, Lianyu
    Gao, Liqing
    Liu, Zekang
    Feng, Wei
    [J]. arXiv, 2023,
  • [7] CONTINUOUS SIGN LANGUAGE RECOGNITION VIA REINFORCEMENT LEARNING
    Zhang, Zhihao
    Pu, Junfu
    Zhuang, Liansheng
    Zhou, Wengang
    Li, Houqiang
    [J]. 2019 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2019, : 285 - 289
  • [8] Multiple Proposals for Continuous Arabic Sign Language Recognition
    Hassan, Mohamed
    Assaleh, Khaled
    Shanableh, Tamer
    [J]. SENSING AND IMAGING, 2019, 20 (1):
  • [9] Iterative Alignment Network for Continuous Sign Language Recognition
    Pu, Junfu
    Zhou, Wengang
    Li, Houqiang
    [J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 4160 - 4169
  • [10] Multiple Proposals for Continuous Arabic Sign Language Recognition
    Mohamed Hassan
    Khaled Assaleh
    Tamer Shanableh
    [J]. Sensing and Imaging, 2019, 20