Study of sign segmentation in the text of Chinese sign language

被引:0
|
作者
Dengfeng Yao
Minghu Jiang
Yunlong Huang
Abudoukelimu Abulizi
Hanjing Li
机构
[1] Beijing Union University,Beijing Key Lab of Information Service Engineering
[2] Tsinghua University,Lab of Computational Linguistics, School of Humanities
关键词
Chinese sign language (CSL); Phonology; Backward maximum matching (BMM); Conditional random fields (CRFs);
D O I
暂无
中图分类号
学科分类号
摘要
The natural language processing (NLP) of sign language aims to make human sign language “understandable” to computers. In achieving this goal, the text of sign language should first be segmented into sign sequences for computers to recognize. This segmentation process constitutes the basis for the information processing of sign language. With an aim to solve the problems in expressing Chinese sign language (CSL), this paper analyzes the lexical features of CSL and discusses various sign segmentation algorithms used in obtaining computer-read files. Sign segmentation involves two main approaches: The first is rule based, whereas the second is statistics based. Backward maximum matching (BMM) is an important rule-based method widely used in Chinese NLP fields. The recently proposed conditional random fields (CRFs) have also demonstrated excellent performance as a statistical method in international tests. In this study, both the BMM and CRFs methods are employed on the same dataset to explore the practical issues in the sign segmentation of CSL. The results of the CRFs method are then presented and discussed. Our corpus contains only hundreds of sentences; therefore, cross-validation based on CRFs is also performed to avoid the unreliable function that may arise from using an exceedingly small corpus scale within limited processing time. Specifically, three-group twofold cross-validation is applied to analyze the design of the annotation specification and the selection of a feature template. The results validate the effectiveness of our proposed segmentation strategy and confirm that CRFs outperform the BMM method. The proposed approach yields an F-score of 77.4% in sign segmentation in the CSL corpus. The CRFs perform effectively in sign segmentation because they can capture the arbitrary, overlapping features of the input in a Markov model. However, to obtain more satisfactory results, we must rely on the technological development of the sign language corpus.
引用
下载
收藏
页码:725 / 737
页数:12
相关论文
共 50 条
  • [1] Study of sign segmentation in the text of Chinese sign language
    Yao, Dengfeng
    Jiang, Minghu
    Huang, Yunlong
    Abulizi, Abudoukelimu
    Li, Hanjing
    UNIVERSAL ACCESS IN THE INFORMATION SOCIETY, 2017, 16 (03) : 725 - 737
  • [2] Research on word segmentation for Chinese sign language
    Cheng, Yinchao
    Yin, Baocai
    Sun, Yanfeng
    PACLIC 20: PROCEEDINGS OF THE 20TH PACIFIC ASIA CONFERENCE ON LANGUAGE, INFORMATION AND COMPUTATION, 2006, : 407 - 413
  • [3] Research on word segmentation for Chinese sign language
    Cheng, Yinchao
    Yin, Baocai
    Sun, Yanfeng
    PACLIC 20 - Proceedings of the 20th Pacific Asia Conference on Language, Information and Computation, 2006, : 407 - 413
  • [4] Text-Driven Chinese Sign Language Synthesis
    徐琳
    高文
    晏洁
    Journal of Harbin Institute of Technology(New series), 1998, (03) : 93 - 98
  • [5] Study on translating Chinese into Chinese sign language
    Xu, L
    Gao, W
    JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY, 2000, 15 (05) : 485 - 490
  • [6] Study on Translating Chinese into Chinese Sign Language
    徐琳
    高文
    Journal of Computer Science & Technology, 2000, (05) : 485 - 490
  • [7] Study on translating Chinese into Chinese sign language
    Lin Xu
    Wen Gao
    Journal of Computer Science and Technology, 2000, 15 : 485 - 490
  • [8] Sign language to text by SVM
    Travieso, CM
    Alonso, JB
    Ferrer, MA
    SEVENTH INTERNATIONAL SYMPOSIUM ON SIGNAL PROCESSING AND ITS APPLICATIONS, VOL 2, PROCEEDINGS, 2003, : 435 - 438
  • [9] A Translator for Bangla Text to Sign Language
    Sarkar, Biswajit
    Datta, Kaushik
    Datta, C. D.
    Sarkar, Debranjan
    Dutta, Shashanka J.
    Das Roy, Indranil
    Paul, Amalesh
    Molla, Joshim Uddin
    Paul, Anirban
    2009 ANNUAL IEEE INDIA CONFERENCE (INDICON 2009), 2009, : 406 - +
  • [10] A SEGMENTATION METHOD FOR SIGN LANGUAGE RECOGNITION
    OHIRA, E
    SAGAWA, H
    SAKIYAMA, T
    OHKI, M
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 1995, E78D (01) : 49 - 57