HANDS IN FOCUS: SIGN LANGUAGE RECOGNITION VIA TOP-DOWN ATTENTION

被引：1

作者：

Sarhan, Noha ^{[1
]}

Wilms, Christian ^{[1
]}

Closius, Vanessa ^{[2
]}

Brefeld, Ulf ^{[2
]}

Frintrop, Simone ^{[1
]}

机构：

[1] Univ Hamburg, Dept Informat, Hamburg, Germany

[2] Leuphana Univ Luneburg, Inst Informat Syst, Luneburg, Germany

来源：

2023 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP | 2023年

关键词：

Sign language recognition; top-down attention; deep learning;

D O I：

10.1109/ICIP49359.2023.10222729

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In this paper, we propose a novel Sign Language Recognition (SLR) model that leverages the task-specific knowledge to incorporate Top-Down (TD) attention to focus the processing of the network on the most relevant parts of the input video sequence. For SLR, this includes information about the hands' shape, orientation and positions, and motion trajectory. Our model consists of three streams that process RGB, optical flow and TD attention data. For the TD attention, we generate pixel-precise attention maps focusing on both hands, thereby retaining valuable hand information, while eliminating distracting background information. Our proposed method outperforms state-of-the-art on a challenging large-scale dataset by over 2%, and achieves strong results with a much simpler architecture compared to other systems on the newly released AUTSL dataset [1].

引用

页码：2555 / 2559

页数：5

共 50 条

[1] Top-Down Color Attention for Object Recognition
Khan, Fahad Shahbaz
van de Weijer, Joost
Vanrell, Maria
2009 IEEE 12TH INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2009, : 979 - 986
[2] Top-Down Deep Appearance Attention for Action Recognition
Anwer, Rao Muhammad
Khan, Fahad Shahbaz
de Weijer, Joost van
Laaksonen, Jorma
IMAGE ANALYSIS, SCIA 2017, PT I, 2017, 10269 : 297 - 309
[3] Combining ICA and top-down attention for robust speech recognition
Bae, UM
Lee, SY
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 13, 2001, 13 : 765 - 771
[4] Sequential recognition of superimposed patterns with top-down selective attention
Kim, BT
Lee, SY
COMPUTATIONAL NEUROSCIENCE: TRENDS IN RESEARCH 2004, 2004, : 633 - 640
[5] Sequential recognition of superimposed patterns with top-down selective attention
Kim, BT
Lee, SY
NEUROCOMPUTING, 2004, 58 : 633 - 640
[6] Mechanisms of top-down attention
Baluchi, Farhan
Itti, Laurent
TRENDS IN NEUROSCIENCES, 2011, 34 (04) : 210 - 224
[7] TOP-DOWN LANGUAGE ANALYZER
SMITH, JW
THARP, AL
INTERNATIONAL JOURNAL OF MAN-MACHINE STUDIES, 1979, 11 (03): : 325 - 338
[8] Top-down attention control at feature space for robust pattern recognition
Lee, SI
Lee, SY
BIOLOGICALLY MOTIVATED COMPUTER VISION, PROCEEDING, 2000, 1811 : 129 - 138
[9] Top-down attention recurrent VLAD encoding for action recognition in videos
Sudhakaran, Swathikiran
Lanz, Oswald
INTELLIGENZA ARTIFICIALE, 2019, 13 (01) : 107 - 118
[10] Top-Down Attention Recurrent VLAD Encoding for Action Recognition in Videos
Sudhakaran, Swathikiran
Lanz, Oswald
AI*IA 2018 - ADVANCES IN ARTIFICIAL INTELLIGENCE, 2018, 11298 : 375 - 386

← 1 2 3 4 5 →