Lexical Representation of Multiword Expressions in Morphologically-complex Languages

被引:6
|
作者
Al-Haj, Hassan [1 ]
Itai, Alon [2 ]
Wintner, Shuly [1 ]
机构
[1] Univ Haifa, Dept Comp Sci, IL-31999 Haifa, Israel
[2] Technion Israel Inst Technol, Dept Comp Sci, IL-32000 Haifa, Israel
基金
以色列科学基金会;
关键词
HEBREW;
D O I
10.1093/ijl/ect036
中图分类号
H0 [语言学];
学科分类号
030303 ; 0501 ; 050102 ;
摘要
In spite of the surging interest in multiword expressions (MWEs) in recent years, it is still unclear how such expressions should be stored in computational lexicons. This problem is amplified in morphologically-complex languages, where the unique properties of MWEs interact with non-trivial morphological processes. We propose an architecture for lexical representation of MWEs, augmented by a protocol for integrating MWEs into a morphological processing system. The proposal is applied to Modern Hebrew, a Semitic language with complex morphology and a problematic orthography. The result is an integrated system that can morphologically process Hebrew multiword expressions of various types. In light of the complexity of Hebrew morphology and orthography, we are confident that the proposed architecture is general enough so as to accommodate MWEs in a large number of languages.
引用
收藏
页码:130 / 170
页数:41
相关论文
共 42 条
  • [1] LEXICAL REPRESENTATION OF MORPHOLOGICALLY COMPLEX WORDS
    DREWS, E
    [J]. BULLETIN OF THE PSYCHONOMIC SOCIETY, 1991, 29 (06) : 491 - 491
  • [2] A lexical database of Portuguese multiword expressions
    Antunes, Sandra
    Bacelar do Nascimento, Maria Fernanda
    Casteleiro, Joao Miguel
    Mendes, Amalia
    Pereira, Luisa
    Sa, Tiago
    [J]. COMPUTATIONAL PROCESSING OF THE PORTUGUESE LANGUAGE, PROCEEDINGS, 2006, 3960 : 238 - 243
  • [3] Multiword Expressions Dataset for Indian Languages
    Singh, Dhirendra
    Bhingardive, Sudha
    Bhattacharyya, Pushpak
    [J]. LREC 2016 - TENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2016, : 2331 - 2335
  • [4] MENTAL REPRESENTATION OF MORPHOLOGICALLY COMPLEX WORDS AND LEXICAL ACCESS
    SEGUI, J
    ZUBIZARRETA, ML
    [J]. LINGUISTICS, 1985, 23 (05) : 759 - 774
  • [5] Predicting Morphologically-Complex Unknown Words in Igbo
    Onyenwe, Ikechukwu E.
    Hepple, Mark
    [J]. TEXT, SPEECH, AND DIALOGUE, 2016, 9924 : 206 - 214
  • [6] Creation of lexical resources for a characterisation of multiword expressions in Italian
    Zaninello, Andrea
    Nissim, Malvina
    [J]. LREC 2010 - SEVENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2010,
  • [7] Dictionary of Multiword Expressions for Translation into Highly Inflected Languages
    Deksne, Daiga
    Skadins, Raivis
    Skadina, Inguna
    [J]. SIXTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, LREC 2008, 2008, : 1401 - 1405
  • [8] Influence of Treebank Design on Representation of Multiword Expressions
    Bejcek, Eduard
    Stranak, Pavel
    Zeman, Daniel
    [J]. COMPUTATIONAL LINGUISTICS AND INTELLIGENT TEXT PROCESSING, PT I, 2011, 6608 : 1 - 14
  • [9] Representation Learning of Multiword Expressions with Compositionality Constraint
    Li, Minglei
    Lu, Qin
    Long, Yunfei
    [J]. KNOWLEDGE SCIENCE, ENGINEERING AND MANAGEMENT (KSEM 2017): 10TH INTERNATIONAL CONFERENCE, KSEM 2017, MELBOURNE, VIC, AUSTRALIA, AUGUST 19-20, 2017, PROCEEDINGS, 2017, 10412 : 507 - 519
  • [10] Comparing lexical and grammatical development in morphologically different languages
    Kovacevic, M
    Jelaska, Z
    Brozovic, B
    [J]. PERSPECTIVES ON LANGUAGE ACQUISITION, 1998, : 368 - 383