Empirical Exploring Word-Character Relationship for Chinese Sentence Representation

被引:8
|
作者
Wang, Shaonan [1 ]
Zhang, Jiajun [1 ]
Zong, Chengqing [2 ]
机构
[1] Univ Chinese Acad Sci, Chinese Acad Sci, Inst Automat, Natl Lab Pattern Recognit, Intelligence Bldg,498 95 Zhongguancun East Rd, Beijing 100190, Peoples R China
[2] Univ Chinese Acad Sci, Chinese Acad Sci, CAS Ctr Excellence Brain Sci & Intelligence Techn, Natl Lab Pattern Recognit,Inst Automat, Intelligence Bldg,498 95 Zhongguancun East Rd, Beijing 100190, Peoples R China
关键词
Sentence representation; compositionmodel; inner-word character; mixed character-word representation; mask gate; max pooling;
D O I
10.1145/3156778
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This article addresses the problem of learning compositional Chinese sentence representations, which represent the meaning of a sentence by composing the meanings of its constituent words. In contrast to English, a Chinese word is composed of characters, which contain rich semantic information. However, this information has not been fully exploited by existing methods. In this work, we introduce a novel, mixed character-word architecture to improve the Chinese sentence representations by utilizing rich semantic information of inner-word characters. We propose two novel strategies to reach this purpose. The first one is to use a mask gate on characters, learning the relation among characters in a word. The second one is to use a max-pooling operation on words to adaptively find the optimal mixture of the atomic and compositional word representations. Finally, the proposed architecture is applied to various sentence composition models, which achieves substantial performance gains over baseline models on sentence similarity task. To further verify the generalization ability of our model, we employ the learned sentence representations as features in sentence classification task, question classification task, and sentence entailment task. Results have shown that the proposed mixed character-word sentence representation models outperform both the character-based and word-based models.
引用
收藏
页数:18
相关论文
共 50 条
  • [1] Word-character attention model for Chinese text classification
    Qiao, Xue
    Peng, Chen
    Liu, Zhen
    Hu, Yanfeng
    [J]. INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2019, 10 (12) : 3521 - 3537
  • [2] Word-character attention model for Chinese text classification
    Xue Qiao
    Chen Peng
    Zhen Liu
    Yanfeng Hu
    [J]. International Journal of Machine Learning and Cybernetics, 2019, 10 : 3521 - 3537
  • [3] An Encoding Strategy Based Word-Character LSTM for Chinese NER
    Liu, Wei
    Xu, Tongge
    Xu, Qinghua
    Song, Jiayu
    Zu, Yueran
    [J]. 2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, 2019, : 2379 - 2389
  • [4] Word-Character Graph Convolution Network for Chinese Named Entity Recognition
    Tang, Zhuo
    Wan, Boyan
    Yang, Li
    [J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2020, 28 : 1520 - 1532
  • [5] Joint Chinese Word Segmentation and POS Tagging Using an Error-Driven Word-Character Hybrid Model
    Kruengkrai, Canasai
    Uchimoto, Kiyotaka
    Kazama, Jun'ichi
    Wang, Yiou
    Torisawa, Kentaro
    Isahara, Hitoshi
    [J]. IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2009, E92D (12) : 2298 - 2305
  • [6] Chinese Word Segmentation for Sub-character Representation
    Zhang, Taozheng
    Shang, Chenyang
    [J]. 2021 IEEE/ACIS 21ST INTERNATIONAL FALL CONFERENCE ON COMPUTER AND INFORMATION SCIENCE (ICIS 2021-FALL), 2021, : 177 - 181
  • [7] Achieving Open Vocabulary Neural Machine Translation with Hybrid Word-Character Models
    Minh-Thang Luong
    Manning, Christopher D.
    [J]. PROCEEDINGS OF THE 54TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, VOL 1, 2016, : 1054 - 1063
  • [8] Mandarin word-character hybrid-input Neural Network Language Model
    Kang, Moonyoung
    Tim Ng
    Long Nguyen
    [J]. 12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 632 - 635
  • [9] An empirical study on the relationship between Chinese character recognition and production
    Ke, CR
    [J]. MODERN LANGUAGE JOURNAL, 1996, 80 (03): : 340 - 349
  • [10] Lattice LSTM for Chinese Sentence Representation
    Zhang, Yue
    Wang, Yile
    Yang, Jie
    [J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2020, 28 : 1506 - 1519