WORD SENSE DISAMBIGUATION USING WORD ONTOLOGY AND CONCEPT DISTRIBUTION

被引:1
|
作者
Hung, Jason C. [1 ]
Yang, Che-Yu [2 ]
机构
[1] Overseas Chinese Inst Technol, Dept Informat Technol, Taichung 407, Taiwan
[2] China Univ Technol, Dept Informat Management, Hsinchu 300, Taiwan
关键词
word sense disambiguation; semantic relatedness; semantic similarity; natural language processing; wordnet;
D O I
10.1080/02533839.2009.9671494
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
This paper presents a method of word sense disambiguation that assigns a target word the sense that is most related to the senses of its neighbor words. We explore the use of measures of relatedness between word senses based on a novel hybrid approach. First, we investigate how to "literally" and "regularly" express a "concept". We apply set algebra to Wordnet's synsets cooperating with Wordnet's word ontology. In this way we establish regular rules for constructing various representations (lexical notations) of a concept using Boolean operators and word forms in various synset(s) defined in Wordnet. Then we establish a formal mechanism for quantifying and estimating the semantic relatedness between concepts - we facilitate "concept distribution statistics" to determine the degree of semantic relatedness between two lexically expressed concepts. Human languages have words that can mean different things in different contexts, such words with multiple meanings are potentially "ambiguous". The process of "deciding which of several meanings of a term is intended in a given context" is known as "Word Sense Disambiguation (WSD)". The proposed method is not supervised, and does not require any manually created sense-tagged training examples. The experimental results showed good performance on Semcor, a subset of the Brown corpus. We observe that measures of semantic relatedness are useful sources of information for word sense disambiguation.
引用
收藏
页码:153 / 168
页数:16
相关论文
共 50 条
  • [1] Word-Sense Disambiguation for Ontology Mapping: Concept Disambiguation using Virtual Documents and Information Retrieval Techniques
    Schadd, Frederik C.
    Roos, Nico
    JOURNAL ON DATA SEMANTICS, 2015, 4 (03) : 167 - 186
  • [2] Word Sense Disambiguation with Distribution Estimation
    Chan, Yee Seng
    Ng, Hwee Tou
    19TH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE (IJCAI-05), 2005, : 1010 - 1015
  • [3] Unsupervised Word Sense Disambiguation Using Word Embeddings
    Moradi, Behzad
    Ansari, Ebrahim
    Zabokrtsky, Zdenek
    PROCEEDINGS OF THE 2019 25TH CONFERENCE OF OPEN INNOVATIONS ASSOCIATION (FRUCT), 2019, : 228 - 233
  • [4] Word sense disambiguation as the primary step of ontology integration
    Banek, Marko
    Vrdoljak, Boris
    Tjoa, A. Min
    DATABASE AND EXPERT SYSTEMS APPLICATIONS, PROCEEDINGS, 2008, 5181 : 65 - +
  • [5] Word Sense Disambiguation using KeNet
    Cetiner, Meltem
    Yildirim, Ahmet
    Onay, Bahadir
    Oksuz, Cuneyt
    29TH IEEE CONFERENCE ON SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS (SIU 2021), 2021,
  • [6] Word Sense Disambiguation Using PolyWordNet
    Dhungana, Udaya Raj
    Shakya, Subarna
    2016 INTERNATIONAL CONFERENCE ON INVENTIVE COMPUTATION TECHNOLOGIES (ICICT), VOL 2, 2016, : 597 - 602
  • [7] Word sense disambiguation based on word sense clustering
    Anaya-Sanchez, Henry
    Pons-Porrata, Aurora
    Berlanga-Llavori, Rafael
    ADVANCES IN ARTIFICIAL INTELLIGENCE - IBERAMIA-SBIA 2006, PROCEEDINGS, 2006, 4140 : 472 - 481
  • [8] Domain Adaptation for Word Sense Disambiguation Using Word Embeddings
    Komiya, Kanako
    Suzuki, Shota
    Sasaki, Minoru
    Shinnou, Hiroyuki
    Okumura, Manabu
    COMPUTATIONAL LINGUISTICS AND INTELLIGENT TEXT PROCESSING (CICLING 2017), PT I, 2018, 10761 : 195 - 206
  • [9] Using WordNet for word sense disambiguation to support concept map construction
    Cañas, AJ
    Valerio, A
    Lalinde-Pulido, J
    Carvalho, M
    Arguedas, M
    STRING PROCESSING AND INFORMATION RETRIEVAL, PROCEEDINGS, 2003, 2857 : 350 - 359
  • [10] Corpus-based ontology learning for word sense disambiguation
    Kang, SJ
    PACLIC 17: Language, Information and Computation, Proceedings, 2003, : 399 - 407