Vietnamese Span-based Constituency Parsing with BERT Embedding

被引:0
|
作者
Phan, Thi-Phuong-Uyen [1 ]
Huynh, Ngoc-Thanh-Tung [1 ]
Truong, Hung-Thinh [1 ]
机构
[1] Univ Sci VNU HCMC, Fac Informat Technol, Ho Chi Minh City, Vietnam
关键词
constituency parsing; span-based parsing; contextualized word representation;
D O I
10.1109/kse.2019.8919467
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Syntactic structure of sentences obtained from Constituency Parsing is fundamental information in many Natural Language Processing tasks. However, due to the lack of available resources and the complex linguistic features of Vietnamese, the research into Constituency Parsing has not received enough attention in this language. To the best of our knowledge, the study presented in this paper is one of the first investigations to explore this task in Vietnamese. In this work, we present a Span-based approach which focuses on representing spans through the use of contextualized pre-trained embeddings to obtain optimal parse trees for Vietnamese sentences. The conducted experiments indicate that our system achieved promising results on the VLSP Vietnamese Treebank dataset by significantly outperforming existing methods. The results of this study support the view that encoding context information into the representation of words is effective in improving the parsing performance of Vietnamese. Consequently, this idea can be generalized to apply to other tasks such as Dependency Parsing or other low-resource languages.
引用
收藏
页码:293 / 299
页数:7
相关论文
共 50 条
  • [1] A Minimal Span-Based Neural Constituency Parser
    Stern, Mitchell
    Andreas, Jacob
    Klein, Dan
    [J]. PROCEEDINGS OF THE 55TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2017), VOL 1, 2017, : 818 - 827
  • [2] A New Representation for Span-based CCG Parsing
    Kato, Yoshihide
    Matsubara, Shigeki
    [J]. 2021 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2021), 2021, : 10579 - 10584
  • [3] Span-Based LCFRS-2 Parsing
    Stanojevic, Milos
    Steedman, Mark
    [J]. 16TH INTERNATIONAL CONFERENCE ON PARSING TECHNOLOGIES AND IWPT 2020 SHARED TASK ON PARSING INTO ENHANCED UNIVERSAL DEPENDENCIES, 2020, : 111 - 121
  • [4] Span-based Semantic Parsing for Compositional Generalization
    Herzig, Jonathan
    Berant, Jonathan
    [J]. 59TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 11TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING, VOL 1 (ACL-IJCNLP 2021), 2021, : 908 - 921
  • [5] Improving Constituency Parsing with Span Attention
    Tian, Yuanhe
    Song, Yan
    Xia, Fei
    Zhang, Tong
    [J]. FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EMNLP 2020, 2020, : 1691 - 1703
  • [6] Span-based Hierarchical Semantic Parsing for Task-Oriented Dialog
    Pasupae, Panupong
    Gupta, Sonal
    Mandyam, Karishma
    Shah, Rushin
    Lewis, Mike
    Zettlemoyer, Luke
    [J]. 2019 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING AND THE 9TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (EMNLP-IJCNLP 2019): PROCEEDINGS OF THE CONFERENCE, 2019, : 1520 - 1526
  • [7] An Empirical Study for Vietnamese Constituency Parsing with Pre-training
    Tuan-Vi Tran
    Xuan-Thien Pham
    Duc-Vu Nguyen
    Kiet Van Nguyen
    Ngan Luu-Thuy Nguyen
    [J]. 2021 RIVF INTERNATIONAL CONFERENCE ON COMPUTING AND COMMUNICATION TECHNOLOGIES (RIVF 2021), 2021, : 234 - 239
  • [8] A Span-based Target-aware Relation Model for Frame-semantic Parsing
    Su, Xuefeng
    Li, Ru
    Li, Xiaoli
    Chang, Baobao
    Hu, Zhiwei
    Han, Xiaoqi
    Yan, Zhichao
    [J]. ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2023, 22 (03)
  • [9] Span-based discontinuous constituency parsing: a family of exact chart-based algorithms with time complexities from O(n6) down to O(n3)
    Corro, Caio
    [J]. PROCEEDINGS OF THE 2020 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP), 2020, : 2753 - 2764
  • [10] BERT-Proof Syntactic Structures: Investigating Errors in Discontinuous Constituency Parsing
    Coavoux, Maximin
    [J]. FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL-IJCNLP 2021, 2021, : 3259 - 3272