Semantic similarity using first and second order co-occurrence matrices and information content vectors

被引:0
|
作者
Pesaranghader, Ahmad [1 ]
Muthaiyah, Saravanan [2 ]
机构
[1] Multimedia University, Jalan Multimedia, 63100 Cyberjaya, Malaysia
[2] Faculty of Management, Multimedia University, Jalan Multimedia, 63100 Cyberjaya, Malaysia
来源
WSEAS Transactions on Computers | 2013年 / 12卷 / 03期
关键词
Natural language processing systems - Medical information systems - Ontology;
D O I
暂无
中图分类号
学科分类号
摘要
Massiveness of data on the Web demands automated Knowledge Engineering techniques enabling machines to achieve integrated definition of all available data to make a unique understanding of all discrete data sources. This research deals with Measures of Semantic Similarity resolving foregoing issue. These measures are widely used in ontology alignment, information retrieval and natural language processing. The study also introduces new normalized functions based on first and second order context and information content vectors of concepts in a corpus. By applying these measures to Unified Medical Language System (UMLS) using WordNet as a general taxonomy and MEDLINE abstract as the corpus to extract information content and information content vectors, these functions get evaluated against a created test bed of 301 biomedical concept pairs scored by medical residents. The paper shows newly proposed Semantic Similarity Measures outperform previous functions.
引用
下载
收藏
页码:95 / 104
相关论文
共 50 条
  • [21] Constructing gene similarity networks using co-occurrence probabilities
    Golrokh Mirzaei
    BMC Genomics, 24
  • [22] Using Information Content to Evaluate Semantic Similarity on HowNet
    You Bin
    Liu Xiao-ran
    Li Ning
    Yan Yue-song
    PROCEEDINGS OF THE 2012 EIGHTH INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND SECURITY (CIS 2012), 2012, : 142 - 145
  • [23] Valence extraction using EM selection and co-occurrence matrices
    Łukasz Dębowski
    Language Resources and Evaluation, 2009, 43 : 301 - 327
  • [24] Textile recognition using Tchebichef moments of co-occurrence matrices
    Cheong, Marc
    Loke, Kar-Seng
    ADVANCED INTELLIGENT COMPUTING THEORIES AND APPLICATIONS, PROCEEDINGS: WITH ASPECTS OF THEORETICAL AND METHODOLOGICAL ISSUES, 2008, 5226 : 1017 - +
  • [25] Valence extraction using EM selection and co-occurrence matrices
    Debowski, Lukasz
    LANGUAGE RESOURCES AND EVALUATION, 2009, 43 (04) : 301 - 327
  • [26] The textural analysis of gravity data using co-occurrence matrices
    Cooper, GRJ
    COMPUTERS & GEOSCIENCES, 2004, 30 (01) : 107 - 115
  • [27] Handwritten arabic character recognition using co-occurrence matrices
    Assaleh, K
    Al-Rousan, M
    Ghazal, M
    8TH WORLD MULTI-CONFERENCE ON SYSTEMICS, CYBERNETICS AND INFORMATICS, VOL VI, PROCEEDINGS: IMAGE, ACOUSTIC, SIGNAL PROCESSING AND OPTICAL SYSTEMS, TECHNOLOGIES AND APPLICATIONS, 2004, : 191 - 194
  • [28] An unsupervised language independent method of name discrimination using second order co-occurrence features
    Pedersen, T
    Kulkarni, A
    Angheluta, R
    Kozareva, Z
    Solorio, T
    COMPUTATIONAL LINGUISTICS AND INTELLIGENT TEXT PROCESSING, 2006, 3878 : 208 - 222
  • [29] Comparing Computational Models of Selectional Preferences Second-order Co-Occurrence vs. Latent Semantic Clusters
    Walde, Sabine Schulte Im
    LREC 2010 - SEVENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2010,
  • [30] Co-occurrence matrices and their applications in information science: Extending ACA to the Web environment
    Leydesdorff, Loet
    Vaughan, Liwen
    JOURNAL OF THE AMERICAN SOCIETY FOR INFORMATION SCIENCE AND TECHNOLOGY, 2006, 57 (12): : 1616 - 1628