HANDS IN FOCUS: SIGN LANGUAGE RECOGNITION VIA TOP-DOWN ATTENTION

被引:1
|
作者
Sarhan, Noha [1 ]
Wilms, Christian [1 ]
Closius, Vanessa [2 ]
Brefeld, Ulf [2 ]
Frintrop, Simone [1 ]
机构
[1] Univ Hamburg, Dept Informat, Hamburg, Germany
[2] Leuphana Univ Luneburg, Inst Informat Syst, Luneburg, Germany
关键词
Sign language recognition; top-down attention; deep learning;
D O I
10.1109/ICIP49359.2023.10222729
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we propose a novel Sign Language Recognition (SLR) model that leverages the task-specific knowledge to incorporate Top-Down (TD) attention to focus the processing of the network on the most relevant parts of the input video sequence. For SLR, this includes information about the hands' shape, orientation and positions, and motion trajectory. Our model consists of three streams that process RGB, optical flow and TD attention data. For the TD attention, we generate pixel-precise attention maps focusing on both hands, thereby retaining valuable hand information, while eliminating distracting background information. Our proposed method outperforms state-of-the-art on a challenging large-scale dataset by over 2%, and achieves strong results with a much simpler architecture compared to other systems on the newly released AUTSL dataset [1].
引用
收藏
页码:2555 / 2559
页数:5
相关论文
共 50 条
  • [1] Top-Down Color Attention for Object Recognition
    Khan, Fahad Shahbaz
    van de Weijer, Joost
    Vanrell, Maria
    2009 IEEE 12TH INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2009, : 979 - 986
  • [2] Top-Down Deep Appearance Attention for Action Recognition
    Anwer, Rao Muhammad
    Khan, Fahad Shahbaz
    de Weijer, Joost van
    Laaksonen, Jorma
    IMAGE ANALYSIS, SCIA 2017, PT I, 2017, 10269 : 297 - 309
  • [3] Combining ICA and top-down attention for robust speech recognition
    Bae, UM
    Lee, SY
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 13, 2001, 13 : 765 - 771
  • [4] Sequential recognition of superimposed patterns with top-down selective attention
    Kim, BT
    Lee, SY
    COMPUTATIONAL NEUROSCIENCE: TRENDS IN RESEARCH 2004, 2004, : 633 - 640
  • [5] Sequential recognition of superimposed patterns with top-down selective attention
    Kim, BT
    Lee, SY
    NEUROCOMPUTING, 2004, 58 : 633 - 640
  • [6] Mechanisms of top-down attention
    Baluchi, Farhan
    Itti, Laurent
    TRENDS IN NEUROSCIENCES, 2011, 34 (04) : 210 - 224
  • [7] TOP-DOWN LANGUAGE ANALYZER
    SMITH, JW
    THARP, AL
    INTERNATIONAL JOURNAL OF MAN-MACHINE STUDIES, 1979, 11 (03): : 325 - 338
  • [8] Top-down attention control at feature space for robust pattern recognition
    Lee, SI
    Lee, SY
    BIOLOGICALLY MOTIVATED COMPUTER VISION, PROCEEDING, 2000, 1811 : 129 - 138
  • [9] Top-down attention recurrent VLAD encoding for action recognition in videos
    Sudhakaran, Swathikiran
    Lanz, Oswald
    INTELLIGENZA ARTIFICIALE, 2019, 13 (01) : 107 - 118
  • [10] Top-Down Attention Recurrent VLAD Encoding for Action Recognition in Videos
    Sudhakaran, Swathikiran
    Lanz, Oswald
    AI*IA 2018 - ADVANCES IN ARTIFICIAL INTELLIGENCE, 2018, 11298 : 375 - 386