Semantic Boundary Detection With Reinforcement Learning for Continuous Sign Language Recognition

被引:30
|
作者
Wei, Chengcheng [1 ]
Zhao, Jian [1 ]
Zhou, Wengang [1 ]
Li, Houqiang [1 ]
机构
[1] Univ Sci & Technol China, Dept Elect Engn & Informat Sci, CAS Key Lab Technol Geospatial Informat Proc & Ap, Hefei 230027, Peoples R China
关键词
Semantics; Hidden Markov models; Assistive technology; Reinforcement learning; Gesture recognition; Task analysis; Measurement; Sign language recognition; reinforcement learning; semantic boundary; weakly supervised learning; VISUAL TRACKING;
D O I
10.1109/TCSVT.2020.2999384
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Sign language recognition (SLR) is a significant and promising technique to facilitate the communication for the hearing-impaired people. In this paper, we are dedicated to weakly supervised continuous SLR, where for each sign video, there are only ordered gloss labels without temporal boundary along frames. To explicitly align video frames to the sign words in a sign video, we propose a novel semantic boundary detection method based on reinforcement learning for accurate continuous SLR. In our approach, we first propose a multi-scale perception scheme to learn discriminative representation for video clips. Then, we formulate the semantic boundary detection as a reinforcement learning problem. We define the state as the feature representation of a video segment, and the action as the determination of the semantic boundary's location. The reward is computed by the quantitative performance metric between the prediction sentence and the ground truth sentence. The policy network is trained with a policy gradient algorithm. Extensive experiments are conducted on CSL Split II and RWTH-PHOENIX-Weather 2014 datasets, and the results demonstrate the effectiveness and superiority of our method.
引用
收藏
页码:1138 / 1149
页数:12
相关论文
共 50 条
  • [1] CONTINUOUS SIGN LANGUAGE RECOGNITION VIA REINFORCEMENT LEARNING
    Zhang, Zhihao
    Pu, Junfu
    Zhuang, Liansheng
    Zhou, Wengang
    Li, Houqiang
    [J]. 2019 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2019, : 285 - 289
  • [2] Visual feature segmentation with reinforcement learning for continuous sign language recognition
    Yuchun Fang
    Liangjun Wang
    Shiquan Lin
    Lan Ni
    [J]. International Journal of Multimedia Information Retrieval, 2023, 12
  • [3] Visual feature segmentation with reinforcement learning for continuous sign language recognition
    Fang, Yuchun
    Wang, Liangjun
    Lin, Shiquan
    Ni, Lan
    [J]. INTERNATIONAL JOURNAL OF MULTIMEDIA INFORMATION RETRIEVAL, 2023, 12 (02)
  • [4] Dynamical semantic enhancement network for continuous sign language recognition
    Wang, Suyang
    Guo, Leming
    Xue, Wanli
    [J]. Multimedia Systems, 2024, 30 (06)
  • [5] Continuous Sign Language Recognition Based on Pseudo-supervised Learning
    Pei, Xiankun
    Guo, Dan
    Zhao, Ye
    [J]. PROCEEDINGS OF THE 2ND WORKSHOP ON MULTIMEDIA FOR ACCESSIBLE HUMAN COMPUTER INTERFACES (MAHCI '19), 2019, : 33 - 39
  • [6] Self-Mutual Distillation Learning for Continuous Sign Language Recognition
    Hao, Aiming
    Min, Yuecong
    Chen, Xilin
    [J]. 2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 11283 - 11292
  • [7] Deepsign: Sign Language Detection and Recognition Using Deep Learning
    Kothadiya, Deep
    Bhatt, Chintan
    Sapariya, Krenil
    Patel, Kevin
    Gil-Gonzalez, Ana-Belen
    Corchado, Juan M.
    [J]. ELECTRONICS, 2022, 11 (11)
  • [8] Pattern recognition considerations for continuous sign language recognition
    Sherry, G
    Foulds, R
    [J]. PROCEEDINGS OF THE IEEE 29TH ANNUAL NORTHEAST BIOENGINEERING CONFERENCE, 2003, : 291 - 293
  • [9] Continuous Sign Language Recognition System Using Deep Learning with MediaPipe Holistic
    Srivastava, Sharvani
    Singh, Sudhakar
    Pooja, Shiv
    Prakash, Shiv
    [J]. WIRELESS PERSONAL COMMUNICATIONS, 2024, 137 (03) : 1455 - 1468
  • [10] An Unsupervised Learning Approach to Handle Movement Epenthesis in Continuous Sign Language Recognition
    Nayan, Navneet
    Ghosh, Debashis
    Pradhan, Pyari Mohan
    [J]. 2022 17TH INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION, ROBOTICS AND VISION (ICARCV), 2022, : 862 - 867