Neural Character-Level Syntactic Parsing for Chinese

被引:0
|
作者
Li, Zuchao [1 ]
Zhou, Junru [1 ]
Zhao, Hai [1 ]
Zhang, Zhisong [2 ]
Li, Haonan [3 ]
Ju, Yuqi [1 ]
机构
[1] Shanghai Jiao Tong Univ, Dept Comp Sci & Engn, Shanghai, Peoples R China
[2] Carnegie Mellon Univ, Language Technol Inst, Pittsburg, KS USA
[3] Univ Melbourne, Sch Comp & Informat Syst, Melbourne, Vic, Australia
基金
中国国家自然科学基金;
关键词
WORD SEGMENTATION;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this work, we explore character-level neural syntactic parsing for Chinese with two typical syntactic formalisms: the constituent formalism and a dependency formalism based on a newly released character-level dependency treebank. Prior works in Chinese parsing have struggled with whether to define words when modeling character interactions. We choose to integrate full character-level syntactic dependency relationships using neural representations from character embeddings and richer linguistic syntactic information from human-annotated character-level Parts-Of-Speech and dependency labels. This has the potential to better understand the deeper structure of Chinese sentences and provides a better structural formalism for avoiding unnecessary structural ambiguities. Specifically, we first compare two different character-level syntax annotation styles: constituency and dependency. Then, we discuss two key problems for character-level parsing: (1) how to combine constituent and dependency syntactic structure in full character-level trees and (2) how to convert from character-level to word-level for both constituent and dependency trees. In addition, we also explore several other key parsing aspects, including different character-level dependency annotations and joint learning of Parts-Of-Speech and syntactic parsing. Finally, we evaluate our models on the Chinese Penn Treebank (CTB) and our published Shanghai Jiao Tong University Chinese Character Dependency Treebank (SCDT). The results show the effectiveness of our model on both constituent and dependency parsing. We further provide empirical analysis and suggest several directions for future study.
引用
收藏
页码:461 / 509
页数:49
相关论文
共 50 条
  • [1] Neural Character-Level Syntactic Parsing for Chinese
    Li, Zuchao
    Zhou, Junru
    Zhao, Hai
    Zhang, Zhisong
    Li, Haonan
    Ju, Yuqi
    [J]. Journal of Artificial Intelligence Research, 2022, 73 : 461 - 509
  • [2] Neural Character-Level Dependency Parsing for Chinese
    Li, Haonan
    Zhang, Zhisong
    Ju, Yuqi
    Zhao, Hai
    [J]. THIRTY-SECOND AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTIETH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / EIGHTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2018, : 5205 - 5212
  • [3] Character-Level Chinese Dependency Parsing
    Zhang, Meishan
    Zhang, Yue
    Che, Wanxiang
    Liu, Ting
    [J]. PROCEEDINGS OF THE 52ND ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, VOL 1, 2014, : 1326 - 1336
  • [4] Deep Graph-Based Character-Level Chinese Dependency Parsing
    Wu, Linzhi
    Zhang, Meishan
    [J]. IEEE/ACM Transactions on Audio Speech and Language Processing, 2021, 29 : 1329 - 1339
  • [5] Deep Graph-Based Character-Level Chinese Dependency Parsing
    Wu, Linzhi
    Zhang, Meishan
    [J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2021, 29 : 1329 - 1339
  • [6] A Top-Down Model for Character-Level Chinese Dependency Parsing
    Chen, Yuanmeng
    Liu, Hang
    Zhang, Yujie
    Xu, Jinan
    Chen, Yufeng
    [J]. CHINESE COMPUTATIONAL LINGUISTICS, CCL 2019, 2019, 11856 : 677 - 688
  • [7] Hybrid Attention for Chinese Character-Level Neural Machine Translation
    Wang, Feng
    Chen, Wei
    Yang, Zhen
    Xu, Shuang
    Xu, Bo
    [J]. NEUROCOMPUTING, 2019, 358 : 44 - 52
  • [8] Character Decomposition for Japanese-Chinese Character-Level Neural Machine Translation
    Zhang, Jinyi
    Matsumoto, Tadahiro
    [J]. PROCEEDINGS OF THE 2019 INTERNATIONAL CONFERENCE ON ASIAN LANGUAGE PROCESSING (IALP), 2019, : 35 - 40
  • [9] Character-Level Dependency Model for Joint Word Segmentation, POS Tagging, and Dependency Parsing in Chinese
    Guo, Zhen
    Zhang, Yujie
    Su, Chen
    Xu, Jinan
    Isahara, Hitoshi
    [J]. IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2016, E99D (01): : 257 - 264
  • [10] Automatically Classifying Chinese Judgment Documents Using Character-Level Convolutional Neural Networks
    Zhou, Xiaosong
    Li, Chuanyi
    Ge, Jidong
    Li, Zhongjin
    Zhou, Xiaoyu
    Luo, Bin
    [J]. PRICAI 2018: TRENDS IN ARTIFICIAL INTELLIGENCE, PT II, 2018, 11013 : 430 - 437