Compositionality and lexical alignment of multi-word terms

被引:0
|
作者
Emmanuel Morin
Béatrice Daille
机构
[1] Université de Nantes,
[2] LINA-UMR CNRS 6241,undefined
来源
关键词
Terminology mining; Comparable corpora; Lexical alignment; Compositional translation;
D O I
暂无
中图分类号
学科分类号
摘要
The automatic compilation of bilingual lists of terms from specialized comparable corpora using lexical alignment has been successful for single-word terms (SWTs), but remains disappointing for multi-word terms (MWTs). The low frequency and the variability of the syntactic structures of MWTs in the source and the target languages are the main reported problems. This paper defines a general framework dedicated to the lexical alignment of MWTs from comparable corpora that includes a compositional translation process and the standard lexical context analysis. The compositional method which is based on the translation of lexical items being restrictive, we introduce an extended compositional method that bridges the gap between MWTs of different syntactic structures through morphological links. We experimented with the two compositional methods for the French–Japanese alignment task. The results show a significant improvement for the translation of MWTs and advocate further morphological analysis in lexical alignment.
引用
收藏
页码:79 / 95
页数:16
相关论文
共 50 条
  • [1] Compositionality and lexical alignment of multi-word terms
    Morin, Emmanuel
    Daille, Beatrice
    [J]. LANGUAGE RESOURCES AND EVALUATION, 2010, 44 (1-2) : 79 - 95
  • [2] Lexical selection in multi-word production
    Janssen, Niels
    Caramazza, Alfonso
    [J]. FRONTIERS IN PSYCHOLOGY, 2011, 2
  • [3] On the Structural Disambiguation of Multi-word Terms
    Cabezas-Garcia, Melania
    Leon-Arauz, Pilar
    [J]. COMPUTATIONAL AND CORPUS-BASED PHRASEOLOGY, EUROPHRAS 2019, 2019, 11755 : 46 - 60
  • [4] Multi-word terms selection for information retrieval
    Bechikh Ali, Chedi
    Haddad, Hatem
    Slimani, Yahya
    [J]. INFORMATION DISCOVERY AND DELIVERY, 2023, 51 (01) : 74 - 87
  • [5] Towards a graded lexical inventory of multi-word combinations
    Vicente, Rocio Cuberos
    Villegas, Elisa Rosado
    Navarrete, Iban Manas
    [J]. ITL-INTERNATIONAL JOURNAL OF APPLIED LINGUISTICS, 2024,
  • [6] Word Embedding Approach for Synonym Extraction of Multi-Word Terms
    Hazem, Amir
    Daille, Beatrice
    [J]. PROCEEDINGS OF THE ELEVENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2018), 2018, : 297 - 303
  • [7] Semantic prosody and semantic preference in multi-word terms
    Cabezas-Garcia, Melania
    Faber, Pamela
    [J]. FACHSPRACHE-JOURNAL OF PROFESSIONAL AND SCIENTIFIC COMMUNICATION, 2019, 41 (1-2): : 2 - 21
  • [8] Head to Head: Semantic Similarity of Multi-Word Terms
    Spasic, Irena
    Corcoran, Padraig
    Gagarin, Andrei
    Buerki, Andreas
    [J]. IEEE ACCESS, 2018, 6 : 20545 - 20557
  • [9] Vector representations of multi-word terms for semantic relatedness
    Henry, Sam
    Cuffy, Clint
    McInnes, Bridget T.
    [J]. JOURNAL OF BIOMEDICAL INFORMATICS, 2018, 77 : 111 - 119
  • [10] Planning and production of grammatical and lexical verbs in multi-word messages
    Lange, Violaine Michel
    Messerschmidt, Maria
    Harder, Peter
    Siebner, Hartwig Roman
    Boye, Kasper
    [J]. PLOS ONE, 2017, 12 (11):