Using Wiktionary to Create Specialized Lexical Resources and Datasets

被引:0
|
作者
Bajcetic, Lenka [1 ]
Declerck, Thierry [2 ]
机构
[1] Austrian Acad Sci, Austrian Ctr Digital Human & Cultural Heritage, Vienna, Austria
[2] DFKI GmbH, Multilingual & Language Technol Lab, Saarland Univ Campus D3 2, Saarbrucken, Germany
基金
欧盟地平线“2020”;
关键词
Wiktionary; ambiguities; pronunciation; grammatical number; grammatical gender;
D O I
暂无
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
This paper describes an approach aiming at utilizing Wiktionary data for creating specialized lexical datasets which can be used for enriching other lexical (semantic) resources or for generating datasets that can be used for evaluating or improving NLP tasks, like Word Sense Disambiguation, Word-in-Context challenges, or Sense Linking across lexicons and dictionaries. We have focused on Wiktionary data about pronunciation information in English, and grammatical number and grammatical gender in German.
引用
收藏
页码:3457 / 3460
页数:4
相关论文
共 50 条
  • [1] Building Specialized Multilingual Lexical Graphs Using Community Resources
    Daoud, Mohammad
    Boitet, Christian
    Kageura, Kyo
    Kitamoto, Asanobu
    Mangeot, Mathieu
    Daoud, Daoud
    [J]. RESOURCE DISCOVERY, 2010, 6162 : 94 - +
  • [2] Semi-automatic Endogenous Enrichment of Collaboratively Constructed Lexical Resources: Piggybacking onto Wiktionary
    Sajous, Franck
    Navarro, Emmanuel
    Gaume, Bruno
    Prevot, Laurent
    Chudy, Yannick
    [J]. ADVANCES IN NATURAL LANGUAGE PROCESSING, 2010, 6233 : 332 - +
  • [3] Definition patterns for predicative terms in specialized lexical resources
    San Martin, Antonio
    L'Homme, Marie-Claude
    [J]. LREC 2014 - NINTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2014, : 3748 - 3755
  • [4] Extracting Lexical Semantic Knowledge from Wikipedia and Wiktionary
    Zesch, Torsten
    Mueller, Christof
    Gurevych, Iryna
    [J]. SIXTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, LREC 2008, 2008, : 1646 - 1652
  • [5] MIss RoBERTa WiLDe: Metaphor Identification Using Masked Language Model with Wiktionary Lexical Definitions
    Babieno, Mateusz
    Takeshita, Masashi
    Radisavljevic, Dusan
    Rzepka, Rafal
    Araki, Kenji
    [J]. APPLIED SCIENCES-BASEL, 2022, 12 (04):
  • [6] Wikipedia and Wiktionary as resources for chemical text mining
    Sayle, Roger
    Lowe, Daniel
    [J]. ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 2015, 250
  • [7] What Lexical Factors Drive Look-Ups in the English Wiktionary?
    Lew, Robert
    Wolfer, Sascha
    [J]. SAGE OPEN, 2024, 14 (01):
  • [8] DBnary: Wiktionary as a Lemon-based multilingual lexical resource in RDF
    Serasset, Gilles
    [J]. SEMANTIC WEB, 2015, 6 (04) : 355 - 361
  • [9] Create, Analyze, and Visualize Phylogenomic Datasets Using PhyloFisher
    Jones, Robert E.
    Tice, Alexander K.
    Elias, Marek
    Eme, Laura
    Kolisko, Martin
    Nenarokov, Serafim
    Panek, Tomas
    Rokas, Antonis
    Salomaki, Eric
    Strassert, Juergen F. H.
    Shen, Xing-Xing
    Zihala, David
    Brown, Matthew W.
    [J]. CURRENT PROTOCOLS, 2024, 4 (01):
  • [10] General and Specialized Lexical Resources: A Study on the Potential of Combining Efforts to Enrich Formal Lexicons
    Pimentel, Janine
    L'Homme, Marie-Claude
    Laneville, Marie-Eve
    [J]. INTERNATIONAL JOURNAL OF LEXICOGRAPHY, 2012, 25 (02) : 152 - 190