Creation of lexical resources for a characterisation of multiword expressions in Italian

被引:0
|
作者
Zaninello, Andrea [1 ]
Nissim, Malvina [1 ]
机构
[1] Univ Bologna, Alma Mater Studiorum, Dipartimento Studi Linguistici & Orientali, I-40126 Bologna, Italy
关键词
D O I
暂无
中图分类号
H [语言、文字];
学科分类号
05 ;
摘要
The theoretical characterisation of multiword expressions (MWEs) is tightly connected to their actual occurrences in data and to their representation in lexical resources. We present three lexical resources for Italian MWEs, namely an electronic lexicon, a series of example corpora and a database of MWEs represented around morphosyntactic patterns. These resources are matched against, and created from, a very large web-derived corpus for Italian that spans across registers and domains. We can thus test expressions coded by lexicographers in a dictionary, thereby discarding unattested expressions, revisiting lexicographers's choices on the basis of frequency information, and at the same time creating an example sub-corpus for each entry. We organise MWEs on the basis of the morphosyntactic information obtained from the data in an electronic, flexible knowledge-base containing structured annotation exploitable for multiple purposes. We also suggest further work directions towards characterising MWEs by analysing the data organised in our database through lexico-semantic information available in WordNet or MultiWordNet-like resources, also in the perspective of expanding their set through the extraction of other similar compact expressions.
引用
收藏
页数:8
相关论文
共 50 条
  • [1] A lexical database of Portuguese multiword expressions
    Antunes, Sandra
    Bacelar do Nascimento, Maria Fernanda
    Casteleiro, Joao Miguel
    Mendes, Amalia
    Pereira, Luisa
    Sa, Tiago
    [J]. COMPUTATIONAL PROCESSING OF THE PORTUGUESE LANGUAGE, PROCEEDINGS, 2006, 3960 : 238 - 243
  • [2] A QUANTITATIVE STUDY OF THE MORPHOLOGY OF ITALIAN MULTIWORD EXPRESSIONS
    Nissim, Malvina
    Zaninello, Andrea
    [J]. LINGUE E LINGUAGGIO, 2011, 10 (02) : 283 - 299
  • [3] Lexical Representation of Multiword Expressions in Morphologically-complex Languages
    Al-Haj, Hassan
    Itai, Alon
    Wintner, Shuly
    [J]. INTERNATIONAL JOURNAL OF LEXICOGRAPHY, 2014, 27 (02) : 130 - 170
  • [4] Determining the Importance of Frequency and Contextual Diversity in the Lexical Organization of Multiword Expressions
    Senaldi, Marco S. G.
    Titone, Debra A.
    Johns, Brendan T.
    [J]. CANADIAN JOURNAL OF EXPERIMENTAL PSYCHOLOGY-REVUE CANADIENNE DE PSYCHOLOGIE EXPERIMENTALE, 2022, 76 (02): : 87 - 98
  • [5] Towards the Construction of Language Resources for Greek Multiword Expressions: Extraction and Evaluation
    Linardaki, Evita
    Ramisch, Carlos
    Villavicencio, Aline
    Fotopoulou, Aggeliki
    [J]. LREC 2010 - SEVENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2010, : H31 - H40
  • [6] Multiword Expressions and Lexicalism
    Findlay, Jamie Y.
    [J]. PROCEEDINGS OF LFG'17 CONFERENCE, 2017, : 209 - 229
  • [7] Discovering multiword expressions
    Villavicencio, Aline
    Idiart, Marco
    [J]. NATURAL LANGUAGE ENGINEERING, 2019, 25 (06) : 715 - 733
  • [8] Prepositional multiword expressions
    Ivankovic, Ivana Matas
    [J]. RASPRAVE, 2016, 42 (02): : 543 - 562
  • [9] Computational Phraseology light: automatic translation of multiword expressions without translation resources
    Mitkov, Ruslan
    [J]. YEARBOOK OF PHRASEOLOGY, 2016, 7 (01) : 149 - 166
  • [10] Extracting multiword expressions from texts with the aid of online resources A classroom experiment
    Bui, Thuy
    Boers, Frank
    Coxhead, Averil
    [J]. ITL-INTERNATIONAL JOURNAL OF APPLIED LINGUISTICS, 2020, 171 (02) : 221 - 252