Lexical Representation of Multiword Expressions in Morphologically-complex Languages

被引:6
|
作者
Al-Haj, Hassan [1 ]
Itai, Alon [2 ]
Wintner, Shuly [1 ]
机构
[1] Univ Haifa, Dept Comp Sci, IL-31999 Haifa, Israel
[2] Technion Israel Inst Technol, Dept Comp Sci, IL-32000 Haifa, Israel
基金
以色列科学基金会;
关键词
HEBREW;
D O I
10.1093/ijl/ect036
中图分类号
H0 [语言学];
学科分类号
030303 ; 0501 ; 050102 ;
摘要
In spite of the surging interest in multiword expressions (MWEs) in recent years, it is still unclear how such expressions should be stored in computational lexicons. This problem is amplified in morphologically-complex languages, where the unique properties of MWEs interact with non-trivial morphological processes. We propose an architecture for lexical representation of MWEs, augmented by a protocol for integrating MWEs into a morphological processing system. The proposal is applied to Modern Hebrew, a Semitic language with complex morphology and a problematic orthography. The result is an integrated system that can morphologically process Hebrew multiword expressions of various types. In light of the complexity of Hebrew morphology and orthography, we are confident that the proposed architecture is general enough so as to accommodate MWEs in a large number of languages.
引用
下载
收藏
页码:130 / 170
页数:41
相关论文
共 42 条