Isolated Sign Language Recognition with Grassmann Covariance Matrices

被引:40
|
作者
Wang, Hanjie [1 ,2 ]
Chai, Xiujuan [1 ,2 ]
Hong, Xiaopeng [3 ]
Zhao, Guoying [3 ]
Chen, Xilin [1 ,2 ]
机构
[1] Chinese Acad Sci, Key Lab Intelligent Informat Proc, Inst Comp Technol, Beijing 100190, Peoples R China
[2] Cooperat Medianet Innovat Ctr, Beijing, Peoples R China
[3] Univ Oulu, Fac Informat Technol & Elect Engn, SF-90100 Oulu, Finland
关键词
Hearing loss; sign language; covariance matrix; Grassmann manifold;
D O I
10.1145/2897735
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
In this article, to utilize long-term dynamics over an isolated sign sequence, we propose a covariance matrix-based representation to naturally fuse information from multimodal sources. To tackle the drawback induced by the commonly used Riemannian metric, the proximity of covariance matrices is measured on the Grassmann manifold. However, the inherent Grassmann metric cannot be directly applied to the covariance matrix. We solve this problem by evaluating and selecting the most significant singular vectors of covariance matrices of sign sequences. The resulting compact representation is called the Grassmann covariance matrix. Finally, the Grassmann metric is used to be a kernel for the support vector machine, which enables learning of the signs in a discriminative manner. To validate the proposed method, we collect three challenging sign language datasets, on which comprehensive evaluations show that the proposed method outperforms the state-of-the-art methods both in accuracy and computational cost.
引用
收藏
页数:21
相关论文
共 50 条
  • [41] Australian sign language recognition
    Eun-Jung Holden
    Gareth Lee
    Robyn Owens
    Machine Vision and Applications, 2005, 16 : 312 - 320
  • [42] On the use of graph parsing for recognition of isolated hand postures of Polish Sign Language
    Flasinski, Mariusz
    Myslinski, Szymon
    PATTERN RECOGNITION, 2010, 43 (06) : 2249 - 2264
  • [43] TIM-SLR: a lightweight network for video isolated sign language recognition
    Wang, Fei
    Zhang, Libo
    Yan, Hao
    Han, Shuai
    NEURAL COMPUTING & APPLICATIONS, 2023, 35 (30): : 22265 - 22280
  • [44] Block-based histogram of optical flow for isolated sign language recognition
    Lim, Kian Ming
    Tan, Alan W. C.
    Tan, Shing Chiang
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2016, 40 : 538 - 545
  • [45] Covariance of the grassmann star product
    Daoud, M
    REPORTS ON MATHEMATICAL PHYSICS, 2003, 52 (02) : 281 - 294
  • [46] A recognition method of sign language sentences for Japanese sign language translation
    Sagawa, H
    Takeuchi, M
    PROCEEDINGS OF THE FIFTH JOINT CONFERENCE ON INFORMATION SCIENCES, VOLS 1 AND 2, 2000, : A858 - A861
  • [47] Collection and Preprocessing of Czech Sign Language Corpus for Sign Language Recognition
    Campr, Pavel
    Hruz, Marek
    Trojanova, Jana
    SIXTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, LREC 2008, 2008, : 3175 - 3178
  • [48] Sign Language Recognition: Learning American Sign Language in a Virtual Environment
    Schioppo, Jacob
    Meyer, Zachary
    Fabiano, Diego
    Canavan, Shaun
    CHI EA '19 EXTENDED ABSTRACTS: EXTENDED ABSTRACTS OF THE 2019 CHI CONFERENCE ON HUMAN FACTORS IN COMPUTING SYSTEMS, 2019,
  • [49] RECOGNITION OF MULTIDIMENSIONAL NORMAL ENSEMBLES WITH VARIOUS COVARIANCE MATRICES
    FOMIN, YA
    SAVICH, AV
    RADIOTEKHNIKA I ELEKTRONIKA, 1987, 32 (11): : 2311 - 2319
  • [50] A combination between VQ and Covariance Matrices for speaker recognition
    Faúndez-Zanuy, M
    2001 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-VI, PROCEEDINGS: VOL I: SPEECH PROCESSING 1; VOL II: SPEECH PROCESSING 2 IND TECHNOL TRACK DESIGN & IMPLEMENTATION OF SIGNAL PROCESSING SYSTEMS NEURALNETWORKS FOR SIGNAL PROCESSING; VOL III: IMAGE & MULTIDIMENSIONAL SIGNAL PROCESSING MULTIMEDIA SIGNAL PROCESSING - VOL IV: SIGNAL PROCESSING FOR COMMUNICATIONS; VOL V: SIGNAL PROCESSING EDUCATION SENSOR ARRAY & MULTICHANNEL SIGNAL PROCESSING AUDIO & ELECTROACOUSTICS; VOL VI: SIGNAL PROCESSING THEORY & METHODS STUDENT FORUM, 2001, : 453 - 456