Glyce: Glyph-vectors for Chinese Character Representations

被引:0
|
作者
Meng, Yuxian [1 ]
Wu, Wei [1 ]
Wang, Fei [1 ]
Li, Xiaoya [1 ]
Nie, Ping [1 ]
Yin, Fan [1 ]
Li, Muyu [1 ]
Han, Qinghong [1 ]
Sun, Xiaofei [1 ]
Li, Jiwei [1 ]
机构
[1] Shannon AI, Beijing, Peoples R China
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
It is intuitive that NLP tasks for logographic languages like Chinese should benefit from the use of the glyph information in those languages. However, due to the lack of rich pictographic evidence in glyphs and the weak generalization ability of standard computer vision models on character data, an effective way to utilize the glyph information remains to be found. In this paper, we address this gap by presenting Glyce, the glyph-vectors for Chinese character representations. We make three major innovations: (1) We use historical Chinese scripts (e.g., bronzeware script, seal script, traditional Chinese, etc) to enrich the pictographic evidence in characters; (2) We design CNN structures (called tianzege-CNN) tailored to Chinese character image processing; and (3) We use image-classification as an auxiliary task in a multi-task learning setup to increase the model's ability to generalize. We show that glyph-based models are able to consistently outperform word/char ID-based models in a wide range of Chinese NLP tasks. We are able to set new state-of-the-art results for a variety of Chinese NLP tasks, including tagging (NER, CWS, POS), sentence pair classification, single sentence classification tasks, dependency parsing, and semantic role labeling. For example, the proposed model achieves an F1 score of 80.6 on the OntoNotes dataset of NER, +1.5 over BERT; it achieves an almost perfect accuracy of 99.8% on the Fudan corpus for text classification. (1 2)
引用
收藏
页数:12
相关论文
共 50 条
  • [21] ChineseBERT: Chinese Pretraining Enhanced by Glyph and Pinyin Information
    Sun, Zijun
    Li, Xiaoya
    Sun, Xiaofei
    Meng, Yuxian
    Ao, Xiang
    He, Qing
    Wu, Fei
    Li, Jiwei
    [J]. 59TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 11TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING, VOL 1 (ACL-IJCNLP 2021), 2021, : 2065 - 2075
  • [22] Glyph-Based Data Augmentation for Accurate Kanji Character Recognition
    Ofusa, Kenichiro
    Miyazaki, Tomo
    Sugaya, Yoshihiro
    Omachi, Shinichiro
    [J]. 2017 14TH IAPR INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR), VOL 1, 2017, : 597 - 602
  • [23] Fast optical character recognition through glyph hashing for document conversion
    Chellapilla, K
    Simard, P
    Nickolov, R
    [J]. EIGHTH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION, VOLS 1 AND 2, PROCEEDINGS, 2005, : 829 - 833
  • [24] THE INFORMATIONAL CHARACTER OF REPRESENTATIONS
    DRETSKE, F
    [J]. BEHAVIORAL AND BRAIN SCIENCES, 1982, 5 (03) : 376 - 377
  • [25] Text Classification through Glyph-aware Disentangled Character Embedding and Semantic Sub-character Augmentation
    Aoki, Takumi
    Kitada, Shunsuke
    Iyatomi, Hitoshi
    [J]. AACL-IJCNLP 2020: THE 1ST CONFERENCE OF THE ASIA-PACIFIC CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 10TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING: PROCEEDINGS OF THE STUDENT RESEARCH WORKSHOP, 2020, : 1 - 7
  • [26] Exploiting Word Semantics to Enrich Character Representations of Chinese Pre-trained Models
    Li, Wenbiao
    Sun, Rui
    Wu, Yunfang
    [J]. NATURAL LANGUAGE PROCESSING AND CHINESE COMPUTING, NLPCC 2022, PT I, 2022, 13551 : 3 - 15
  • [27] Visual Exploration of Anomalies in Cyclic Time Series Data with Matrix and Glyph Representations
    Suschnigg, Josef
    Mutlu, Belgin
    Koutroulis, Georgios
    Sabol, Vedran
    Thalmann, Stefan
    Schreck, Tobias
    [J]. BIG DATA RESEARCH, 2021, 26 (26)
  • [28] Global Vectors for Node Representations
    Brochier, Robin
    Guille, Adrien
    Velcin, Julien
    [J]. WEB CONFERENCE 2019: PROCEEDINGS OF THE WORLD WIDE WEB CONFERENCE (WWW 2019), 2019, : 2587 - 2593
  • [29] CYCLIC VECTORS OF INDUCED REPRESENTATIONS
    HULANICKI, A
    PYTLIK, T
    [J]. PROCEEDINGS OF THE AMERICAN MATHEMATICAL SOCIETY, 1972, 31 (02) : 633 - +
  • [30] On the differentiable vectors for contragredient representations
    Beltita, Ingrid
    Beltita, Daniel
    [J]. COMPTES RENDUS MATHEMATIQUE, 2013, 351 (13-14) : 513 - 516