Glyce: Glyph-vectors for Chinese Character Representations

被引:0
|
作者
Meng, Yuxian [1 ]
Wu, Wei [1 ]
Wang, Fei [1 ]
Li, Xiaoya [1 ]
Nie, Ping [1 ]
Yin, Fan [1 ]
Li, Muyu [1 ]
Han, Qinghong [1 ]
Sun, Xiaofei [1 ]
Li, Jiwei [1 ]
机构
[1] Shannon AI, Beijing, Peoples R China
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
It is intuitive that NLP tasks for logographic languages like Chinese should benefit from the use of the glyph information in those languages. However, due to the lack of rich pictographic evidence in glyphs and the weak generalization ability of standard computer vision models on character data, an effective way to utilize the glyph information remains to be found. In this paper, we address this gap by presenting Glyce, the glyph-vectors for Chinese character representations. We make three major innovations: (1) We use historical Chinese scripts (e.g., bronzeware script, seal script, traditional Chinese, etc) to enrich the pictographic evidence in characters; (2) We design CNN structures (called tianzege-CNN) tailored to Chinese character image processing; and (3) We use image-classification as an auxiliary task in a multi-task learning setup to increase the model's ability to generalize. We show that glyph-based models are able to consistently outperform word/char ID-based models in a wide range of Chinese NLP tasks. We are able to set new state-of-the-art results for a variety of Chinese NLP tasks, including tagging (NER, CWS, POS), sentence pair classification, single sentence classification tasks, dependency parsing, and semantic role labeling. For example, the proposed model achieves an F1 score of 80.6 on the OntoNotes dataset of NER, +1.5 over BERT; it achieves an almost perfect accuracy of 99.8% on the Fudan corpus for text classification. (1 2)
引用
收藏
页数:12
相关论文
共 50 条
  • [1] LOCAL CONTEXT INTERACTION-AWARE GLYPH-VECTORS FOR CHINESE SEQUENCE TAGGING
    Lu, Junyu
    Zhang, Pingjian
    [J]. ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, 2022, 2022-May : 8152 - 8156
  • [2] LOCAL CONTEXT INTERACTION-AWARE GLYPH-VECTORS FOR CHINESE SEQUENCE TAGGING
    Lu, Junyu
    Zhang, Pingjian
    [J]. 2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 8152 - 8156
  • [3] A Structure Character Modeling for Chinese Character Glyph Description
    Wu, Shixiao
    Zheng, Shijue
    [J]. ICECT: 2009 INTERNATIONAL CONFERENCE ON ELECTRONIC COMPUTER TECHNOLOGY, PROCEEDINGS, 2009, : 245 - 248
  • [4] An XML-Based Approach for Chinese Character Glyph Description
    Wu, Shixiao
    Zheng, Shijue
    [J]. PROCEEDINGS OF THE FIRST INTERNATIONAL WORKSHOP ON EDUCATION TECHNOLOGY AND COMPUTER SCIENCE, VOL II, 2009, : 674 - 677
  • [5] Chinese text recognition enhanced by glyph and character semantic information
    Wu, Shilian
    Li, Yongrui
    Wang, Zengfu
    [J]. INTERNATIONAL JOURNAL ON DOCUMENT ANALYSIS AND RECOGNITION, 2024, 27 (01) : 45 - 56
  • [6] Chinese text recognition enhanced by glyph and character semantic information
    Shilian Wu
    Yongrui Li
    Zengfu Wang
    [J]. International Journal on Document Analysis and Recognition (IJDAR), 2024, 27 : 45 - 56
  • [7] Beautification of Chinese Character Stroke-Segment-Mesh Glyph Stroke Curve
    Zhang, MaiKu
    Lin, Min
    Huang, HanQuan
    [J]. ADVANCES IN MULTIMEDIA, SOFTWARE ENGINEERING AND COMPUTING, VOL 2, 2011, 129 : 101 - 110
  • [8] Glyph Enhanced Chinese Character Pre-Training for Lexical Sememe Prediction
    Lyu, Boer
    Chen, Lu
    Yu, Kai
    [J]. FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EMNLP 2021, 2021, : 4549 - 4555
  • [9] A Research on the Stroke-Segment-Mesh (SSM) Glyph Depiction Method of Chinese Character
    Lin, Min
    Song, Rou
    Ge, Shi-Li
    [J]. ALPIT 2008: SEVENTH INTERNATIONAL CONFERENCE ON ADVANCED LANGUAGE PROCESSING AND WEB INFORMATION TECHNOLOGY, PROCEEDINGS, 2008, : 269 - +
  • [10] Integrating Character Representations into Chinese Word Embedding
    Leshan Normal University, China
    [J]. Lect. Notes Comput. Sci.,