Constituency Parsing of Bulgarian: Word- vs. Class-based Parsing

被引:0
|
作者
Ghayoomi, Masood [1 ]
Simov, Kiril [2 ]
Osenova, Petya [2 ]
机构
[1] Free Univ Berlin, Dept Math & Comp Sci, Berlin, Germany
[2] IICT BAS, Linguist Modelling Dept, Sofia, Bulgaria
关键词
Constituency Parsing; Word Clustering; the Bulgarian Language; Treebanking;
D O I
暂无
中图分类号
H0 [语言学];
学科分类号
030303 ; 0501 ; 050102 ;
摘要
In this paper, we report the obtained results of two constituency parsers trained with BulTreeBank, an HPSG-based treebank for Bulgarian. To reduce the data sparsity problem, we propose using the Brown word clustering to do an off-line clustering and map the words in the treebank to create a class-based treebank. The observations show that when the classes outnumber the POS tags, the results are be. er. Since this approach adds on another dimension of abstraction (in comparison to the lemma), its coarse-grained representation can be used further for training statistical parsers.
引用
收藏
页码:4056 / 4060
页数:5
相关论文
共 50 条
  • [1] Word Segmentation as Unsupervised Constituency Parsing
    Alhama, Raquel G.
    [J]. PROCEEDINGS OF THE 60TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2022), VOL 1: (LONG PAPERS), 2022, : 4103 - 4112
  • [2] Neural mechanisms underlying word- and phrase-level morphological parsing
    Leminen, Alina
    Jakonen, Sini
    Leminen, Miika
    Makela, Jyrki P.
    Lehtonen, Minna
    [J]. JOURNAL OF NEUROLINGUISTICS, 2016, 38 : 26 - 41
  • [3] Unlexicalized Transition-based Discontinuous Constituency Parsing
    Coavoux, Maximin
    Crabbe, Benoit
    Cohen, Shay B.
    [J]. TRANSACTIONS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, 2019, 7 : 73 - 89
  • [4] Vietnamese Span-based Constituency Parsing with BERT Embedding
    Phan, Thi-Phuong-Uyen
    Huynh, Ngoc-Thanh-Tung
    Truong, Hung-Thinh
    [J]. PROCEEDINGS OF 2019 11TH INTERNATIONAL CONFERENCE ON KNOWLEDGE AND SYSTEMS ENGINEERING (KSE 2019), 2019, : 293 - 299
  • [5] Measurement of sentence similarity based on constituency parsing and dilated convolution
    Ji, MingYu
    Wang, ChenLong
    Liu, Gang
    [J]. INTERNATIONAL JOURNAL OF COMPUTER APPLICATIONS IN TECHNOLOGY, 2020, 64 (03) : 252 - 259
  • [6] Deep Learning-Based Constituency Parsing for Arabic Language
    Morad, Amr
    Nagi, Magdy
    Alansary, Sameh
    [J]. ADVANCES IN ARTIFICIAL INTELLIGENCE AND APPLIED COGNITIVE COMPUTING, 2021, : 45 - 58
  • [7] Parsing speech vs. nonspeech factors in speech perception
    Zhang, Y
    Kuhl, P
    Imada, T
    Kotani, M
    [J]. JOURNAL OF COGNITIVE NEUROSCIENCE, 2002, : 40 - 40
  • [8] Parsing Chinese text based on semantic class
    Ding, Hua-Fu
    Zhao, Tie-Jun
    Li, Sheng
    [J]. PROCEEDINGS OF 2007 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-7, 2007, : 3377 - 3380
  • [9] A class-based approach to word alignment
    Ker, SJ
    Chang, JS
    [J]. COMPUTATIONAL LINGUISTICS, 1997, 23 (02) : 313 - 343
  • [10] Dependency Parsing and Projection Based on Word-Pair Classification
    Jiang, Wenbin
    Liu, Qun
    [J]. ACL 2010: 48TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, 2010, : 12 - 20