Construction of an English Dependency Corpus incorporating Compound Function Words

被引:0
|
作者
Kato, Akihiko [1 ]
Shindo, Hiroyuki [1 ]
Matsumoto, Yuji [1 ]
机构
[1] Nara Inst Sci Technol, 8916-5 Takayama, Nara 6300192, Japan
关键词
MultiWord Expressions; Dependency Parsing;
D O I
暂无
中图分类号
H [语言、文字];
学科分类号
05 ;
摘要
The recognition of multiword expressions (MWEs) in a sentence is important for such linguistic analyses as syntactic and semantic parsing, because it is known that combining an MWE into a single token improves accuracy for various NLP tasks, such as dependency parsing and constituency parsing. However, MWEs are not annotated in Penn Treebank. Furthermore, when converting word-based dependency to MWE-aware dependency directly, one could combine nodes in an MWE into a single node. Nevertheless, this method often leads to the following problem: A node derived from an MWE could have multiple heads and the whole dependency structure including MWE might be cyclic. Therefore we converted a phrase structure to a dependency structure after establishing an MWE as a single subtree. This approach can avoid an occurrence of multiple heads and/or cycles. In this way, we constructed an English dependency corpus taking into account compound function words, which are one type of MWEs that serve as functional expressions. In addition, we report experimental results of dependency parsing using a constructed corpus.
引用
收藏
页码:1667 / 1671
页数:5
相关论文
共 50 条
  • [1] The Construction of English New Words Corpus Based on Decision Tree Algorithm
    Gao, Hongxia
    MACHINE LEARNING, IMAGE PROCESSING, NETWORK SECURITY AND DATA SCIENCES, MIND 2022, PT I, 2022, 1762 : 337 - 344
  • [2] A Gold Standard Dependency Corpus for English
    Silveira, Natalia
    Dozat, Timothy
    de Marneffe, Marie-Catherine
    Bowman, Samuel R.
    Connor, Miriam
    Bauer, John
    Manning, Christopher D.
    LREC 2014 - NINTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2014, : 2897 - 2904
  • [3] The Stress of English Compound Words
    Hsu Hai-lan
    武汉大学学报(人文科学), 1963, (02) : 101 - 116
  • [4] EXTRACTING ENGLISH WORDS FROM A CORPUS OF CROATIAN
    Borucinsky, Mirjana
    Bogunovic, Irena
    FLUMINENSIA, 2022, 34 (02): : 435 - 461
  • [5] COMPOUND WORDS IN MILTON ENGLISH POETRY
    BURNETT, A
    MODERN LANGUAGE REVIEW, 1980, 75 (JUL): : 492 - 506
  • [6] Incorporating sociolinguistic information into a diachronic corpus of English
    Raumolin-Brunberg, H
    TRACING THE TRAIL OF TIME, 1997, (18): : 105 - 117
  • [7] Experience with compound words influences their processing: An eye movement investigation with English compound words
    Juhasz, Barbara J.
    QUARTERLY JOURNAL OF EXPERIMENTAL PSYCHOLOGY, 2018, 71 (01): : 103 - 112
  • [8] Ambiguity Avoidance by Means of Function Words in English? Providing Additional Corpus-based Counterevidence
    Rohdenburg, Guenter
    ZEITSCHRIFT FUR ANGLISTIK UND AMERIKANISTIK, 2021, 69 (03): : 207 - 236
  • [9] Joinings: Compound Words in Old English Literature
    Dance, Richard
    SPECULUM-A JOURNAL OF MEDIEVAL STUDIES, 2017, 92 (04): : 1182 - 1183
  • [10] JOININGS: COMPOUND WORDS IN OLD ENGLISH LITERATURE
    Magennis, Hugh
    JOURNAL OF ENGLISH AND GERMANIC PHILOLOGY, 2018, 117 (03): : 404 - 406