Multi-scale context-aware network for continuous sign language recognition

被引:0
|
作者
Senhua XUE
Liqing GAO
Liang WAN
Wei FENG
机构
[1] CollegeofIntelligenceandComputing,TianjinUniversity
关键词
D O I
暂无
中图分类号
学科分类号
摘要
The hands and face are the most important parts for expressing sign language morphemes in sign language videos. However, we find that existing Continuous Sign Language Recognition(CSLR) methods lack the mining of hand and face information in visual backbones or use expensive and time-consuming external extractors to explore this information. In addition, the signs have different lengths, whereas previous CSLR methods typically use a fixed-length window to segment the video to capture sequential features and then perform global temporal modeling, which disturbs the perception of complete signs. In this study, we propose a Multi-Scale Context-Aware network(MSCA-Net) to solve the aforementioned problems. Our MSCA-Net contains two main modules:(1) Multi-Scale Motion Attention(MSMA), which uses the differences among frames to perceive information of the hands and face in multiple spatial scales, replacing the heavy feature extractors; and(2) Multi-Scale Temporal Modeling(MSTM), which explores crucial temporal information in the sign language video from different temporal scales. We conduct extensive experiments using three widely used sign language datasets, i.e., RWTH-PHOENIX-Weather-2014, RWTH-PHOENIX-Weather-2014T, and CSL-Daily. The proposed MSCA-Net achieve state-of-the-art performance, demonstrating the effectiveness of our approach.
引用
收藏
页数:15
相关论文
共 50 条
  • [1] Continuous Sign Language Recognition through a Context-Aware Generative Adversarial Network
    Papastratis, Ilias
    Dimitropoulos, Kosmas
    Daras, Petros
    [J]. SENSORS, 2021, 21 (07)
  • [2] Multi-scale Fusion with Context-aware Network for Object Detection
    Wang, Hanyuan
    Xu, Jie
    Li, Linke
    Tian, Ye
    Xu, Du
    Xu, Shizhong
    [J]. 2018 24TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2018, : 2486 - 2491
  • [3] Local Context-aware Self-attention for Continuous Sign Language Recognition
    Zuo, Ronglai
    Mak, Brian
    [J]. INTERSPEECH 2022, 2022, : 4810 - 4814
  • [4] Context-Aware Multi-Scale Aggregation Network for Congested Crowd Counting
    Huang, Liangjun
    Shen, Shihui
    Zhu, Luning
    Shi, Qingxuan
    Zhang, Jianwei
    [J]. SENSORS, 2022, 22 (09)
  • [5] Multi-scale inputs and context-aware aggregation network for stereo matching
    Shi, Liqing
    Xiong, Taiping
    Cui, Gengshen
    Pan, Minghua
    Cheng, Nuo
    Wu, Xiangjie
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (30) : 75171 - 75194
  • [6] MSCANet: A multi-scale context-aware network for remote sensing object detection
    Zhou, Huaping
    Liu, Weidong
    Sun, Kelei
    Wu, Jin
    Wu, Tao
    [J]. EARTH SCIENCE INFORMATICS, 2024,
  • [7] CONTEXT-AWARE HIERARCHICAL FEATURE ATTENTION NETWORK FOR MULTI-SCALE OBJECT DETECTION
    Xu, Xuelong
    Luo, Xiangfeng
    Ma, Liyan
    [J]. 2020 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2020, : 2011 - 2015
  • [8] Global context-aware multi-scale features aggregative network for salient object detection
    Ullah, Inam
    Jian, Muwei
    Hussain, Sumaira
    Lian, Li
    Ali, Zafar
    Qureshi, Imran
    Guo, Jie
    Yin, Yilong
    [J]. NEUROCOMPUTING, 2021, 455 : 139 - 153
  • [9] Bridging Multi-Scale Context-Aware Representation for Object Detection
    Wang, Boying
    Ji, Ruyi
    Zhang, Libo
    Wu, Yanjun
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 33 (05) : 2317 - 2329
  • [10] Multi-Scale Based Context-Aware Net for Action Detection
    Liu, Haijun
    Wang, Shiguang
    Wang, Wen
    Cheng, Jian
    [J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2020, 22 (02) : 337 - 348