Towards the Construction of Language Resources for Greek Multiword Expressions: Extraction and Evaluation

被引:0
|
作者
Linardaki, Evita [1 ]
Ramisch, Carlos [2 ,3 ]
Villavicencio, Aline [3 ]
Fotopoulou, Aggeliki [1 ]
机构
[1] Inst Language & Speech Proc, Athens, Greece
[2] Univ Grenoble, Lab Informat Grenoble, GETALP, Grenoble, France
[3] Univ Fed Rio Grande do Sul, Inst Informat, BR-90046900 Porto Alegre, RS, Brazil
关键词
D O I
暂无
中图分类号
H [语言、文字];
学科分类号
05 ;
摘要
Multiword Expressions have been posing problems for Natural Language Processing systems for many years. Their automatic identification has, as a result, been in the focus of NLP research for almost two decades now. The advances in the most widely spoken languages like English, French, German, etc. are remarkable and have been extensively documented. This paper presents our work towards the creation of a dictionary of Multiword Expressions for Greek using automatic extraction and human validation. We investigate the use of a knowledge-poor statistical approach based on four association measures. The results obtained by these measures on the Greek Europarl corpus are compared and contrasted with those obtained by the same measures using the web as a corpus. The manual evaluation of the results by Greek native speakers shows that the automatic approach performs well enough to help in the construction of a lexical resource, despite of the difficulty of the task.
引用
收藏
页码:H31 / H40
页数:10
相关论文
共 50 条
  • [1] Multiword Expressions in Child Language
    Wilkens, Rodrigo
    Idiart, Marco
    Villavicencio, Aline
    [J]. LREC 2016 - TENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2016, : 2307 - 2311
  • [2] Models of Language and Multiword Expressions
    Kallens, Pablo Contreras
    Christiansen, Morten H.
    [J]. FRONTIERS IN ARTIFICIAL INTELLIGENCE, 2022, 5
  • [3] IDION: A database for Modern Greek multiword expressions
    Markantonatou, Stella
    Minos, Panagiotis
    Zakis, George
    Moutzouri, Vassiliki
    Chantou, Maria
    [J]. JOINT WORKSHOP ON MULTIWORD EXPRESSIONS AND WORDNET (MWE-WN 2019), 2019, : 130 - 134
  • [4] Automatic extraction of fixed multiword expressions
    Hore, C
    Asahara, M
    Matsumoto, Y
    [J]. NATURAL LANGUAGE PROCESSING - IJCNLP 2005, PROCEEDINGS, 2005, 3651 : 565 - 575
  • [5] Alignment-based extraction of multiword expressions
    Helena Medeiros de Caseli
    Carlos Ramisch
    Maria das Graças Volpe Nunes
    Aline Villavicencio
    [J]. Language Resources and Evaluation, 2010, 44 : 59 - 77
  • [6] Analyzing and identifying multiword expressions in spoken language
    Strik, Helmer
    Hulsbosch, Micha
    Cucchiarini, Catia
    [J]. LANGUAGE RESOURCES AND EVALUATION, 2010, 44 (1-2) : 41 - 58
  • [7] Analyzing and identifying multiword expressions in spoken language
    Helmer Strik
    Micha Hulsbosch
    Catia Cucchiarini
    [J]. Language Resources and Evaluation, 2010, 44 : 41 - 58
  • [8] Alignment-based extraction of multiword expressions
    Caseli, Helena de Medeiros
    Ramisch, Carlos
    Volpe Nunes, Maria das Gracas
    Villavicencio, Aline
    [J]. LANGUAGE RESOURCES AND EVALUATION, 2010, 44 (1-2) : 59 - 77
  • [9] Dedicated Language Resources for Interdisciplinary Research on Multiword Expressions: Best Thing since Sliced Bread
    Hubers, Ferdy
    Cucchiarini, Catia
    Strik, Helmer
    [J]. PROCEEDINGS OF THE 12TH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2020), 2020, : 4418 - 4425
  • [10] Creation of lexical resources for a characterisation of multiword expressions in Italian
    Zaninello, Andrea
    Nissim, Malvina
    [J]. LREC 2010 - SEVENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2010,