PERSISTENT SEMANTIC IDENTITY IN WORDNET

被引:0
|
作者
Kafe, Eric [1 ]
机构
[1] MegaDoc, Charlottenlund, Denmark
来源
关键词
wordnets; semantic identifiers; sense keys; key violations; synsets; mappings;
D O I
10.11649/cs.1717
中图分类号
H [语言、文字];
学科分类号
05 ;
摘要
Although rarely studied, the persistence of semantic identity in the WordNet lexical database is crucial for the interoperability of all the resources that use WordNet data. The present study investigates the stability of the two primary entities of the WordNet database (the word senses and the synonym sets), by following their respective identifiers (the sense keys and the synset offsets) across all the versions released between 1995 and 2012, while also considering drifts of identical definitions and semantic relations. Contrary to expectations, 94.4% of the WordNet 1.5 synsets still persisted in the latest 2012 version, compared to only 89.1% of the corresponding sense keys. Meanwhile, the splits and merges between synonym sets remained few and simple. These results are presented in tables that allow to estimate the lexicographic effort needed for updating WordNet-based resources to newer WordNet versions. We discuss the specific challenges faced by both the dominant synset-based mapping paradigm (a moderate amount of split synsets), and the recommended sense key-based approach (very few identity violations), and conclude that stable synset identifiers are viable, but need to be complemented by stable sense keys in order to adequately handle the split synonym sets.
引用
收藏
页数:20
相关论文
共 50 条
  • [1] Semantic Opposition and WordNet
    Sandiway Fong
    Journal of Logic, Language and Information, 2004, 13 (2) : 159 - 171
  • [2] Measuring semantic similarity in WordNet
    Liu, Xiao-Ying
    Zhou, Yi-Ming
    Zheng, Ruo-Shi
    PROCEEDINGS OF 2007 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-7, 2007, : 3431 - +
  • [3] ADJECTIVES IN WORDNET: SEMANTIC ISSUES
    Dimitrova, Tsvetana
    Stefanova, Valentina
    PROCEEDINGS OF THE 12TH INTERNATIONAL CONFERENCE 'LINQUISTIC RESOURCES AND TOOLS FOR PROCESSING THE ROMANIAN LANGUAGE', 2016, : 131 - 141
  • [4] Chinese WordNet domains: Bootstrapping Chinese WordNet with semantic domain labels
    Lee, Lung-Hao
    Yu, Yu-Ting
    Huang, Chu-Ren
    PACLIC 23 - Proceedings of the 23rd Pacific Asia Conference on Language, Information and Computation, 2009, 1 : 288 - 296
  • [5] Integration of semantic resources based on WordNet
    Gutierrez Vazquez, Yoan
    Fernandez Orquin, Antonio
    Montoyo Guijarro, Andres
    Vazquez Perez, Sonia
    PROCESAMIENTO DEL LENGUAJE NATURAL, 2010, (45): : 161 - 168
  • [6] Building Semantic Corpus from WordNet
    Stanchev, Lubomir
    2012 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE WORKSHOPS (BIBMW), 2012,
  • [7] Parse Ranking with Semantic Dependencies and WordNet
    Yin, Xiaocheng
    Kim, Jungjae
    Pozen, Zinaida
    Bond, Francis
    PROCEEDINGS OF THE SEVENTH GLOBAL WORDNET CONFERENCE, GWC 2014, 2014, : 186 - 193
  • [8] WordNet Gloss for Semantic Concept Relatedness
    Bijaksana, Moch Arif
    Permadi, Rakhmad Indra
    RECENT ADVANCES ON SOFT COMPUTING AND DATA MINING, 2017, 549 : 406 - 413
  • [9] Measuring Semantic Similarity Based On WordNet
    Zhao, Zhongcheng
    Yan, Jianzhuo
    Fang, Liying
    Wang, Pu
    2009 SIXTH WEB INFORMATION SYSTEMS AND APPLICATIONS CONFERENCE, PROCEEDINGS, 2009, : 89 - 92
  • [10] Classification of Semantic Documents based on WordNet
    Shi, Bin
    Fang, Liying
    Yan, Jianzhuo
    Wang, Pu
    Dong, Chen
    IEEE: 2009 INTERNATIONAL CONFERENCE ON E-LEARNING, E-BUSINESS, ENTERPRISE INFORMATION SYSTEMS AND E-GOVERNMENT, 2009, : 173 - 176