Semantic similarity using first and second order co-occurrence matrices and information content vectors

被引:0
|
作者
Pesaranghader, Ahmad [1 ]
Muthaiyah, Saravanan [2 ]
机构
[1] Multimedia University, Jalan Multimedia, 63100 Cyberjaya, Malaysia
[2] Faculty of Management, Multimedia University, Jalan Multimedia, 63100 Cyberjaya, Malaysia
来源
WSEAS Transactions on Computers | 2013年 / 12卷 / 03期
关键词
Natural language processing systems - Medical information systems - Ontology;
D O I
暂无
中图分类号
学科分类号
摘要
Massiveness of data on the Web demands automated Knowledge Engineering techniques enabling machines to achieve integrated definition of all available data to make a unique understanding of all discrete data sources. This research deals with Measures of Semantic Similarity resolving foregoing issue. These measures are widely used in ontology alignment, information retrieval and natural language processing. The study also introduces new normalized functions based on first and second order context and information content vectors of concepts in a corpus. By applying these measures to Unified Medical Language System (UMLS) using WordNet as a general taxonomy and MEDLINE abstract as the corpus to extract information content and information content vectors, these functions get evaluated against a created test bed of 301 biomedical concept pairs scored by medical residents. The paper shows newly proposed Semantic Similarity Measures outperform previous functions.
引用
下载
收藏
页码:95 / 104
相关论文
共 50 条
  • [31] Corpus Co-Occurrence, Dictionary and Wikipedia Entries as Resources for Semantic Relatedness Information
    Roth, Michael
    Walde, Sabine Schulte Im
    SIXTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, LREC 2008, 2008, : 1852 - 1859
  • [32] Measuring similarity in co-occurrence data using ego-networks
    Wang, Xiaomeng
    Ran, Yijun
    Jia, Tao
    CHAOS, 2020, 30 (01)
  • [33] The normalization of occurrence and Co-occurrence matrices in bibliometrics using Cosine similarities and Ochiai coefficients
    Zhou, Qiuju
    Leydesdorff, Loet
    JOURNAL OF THE ASSOCIATION FOR INFORMATION SCIENCE AND TECHNOLOGY, 2016, 67 (11) : 2805 - 2814
  • [34] The analysis and measurement of building patterns using texton co-occurrence matrices
    Yu, Wenhao
    Ai, Tinghua
    Liu, Pengcheng
    Cheng, Xiaoqiang
    INTERNATIONAL JOURNAL OF GEOGRAPHICAL INFORMATION SCIENCE, 2017, 31 (06) : 1079 - 1100
  • [35] Material microstructures analyzed by using gray level Co-occurrence matrices
    胡延苏
    王志军
    樊晓光
    李俊杰
    高昂
    Chinese Physics B, 2017, (09) : 487 - 494
  • [36] Texture defect detection using subband domain co-occurrence matrices
    Amet, Ahmet Latif
    Ertuzun, Aysin
    Ercil, Aytul
    Proceedings of the IEEE Southwest Symposium on Image Analysis and Interpretation, 1998, : 205 - 210
  • [37] Color texture analysis using CFA chromatic co-occurrence matrices
    Losson, O.
    Porebski, A.
    Vandenbroucke, N.
    Macaire, L.
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2013, 117 (07) : 747 - 763
  • [38] Barrett's Esophagus Identification Using Color Co-occurrence Matrices
    de Souza, Luis A., Jr.
    Ebigbo, Alanna
    Probst, Andreas
    Messmann, Helmut
    Papa, Joao P.
    Mendel, Robert
    Palm, Christoph
    PROCEEDINGS 2018 31ST SIBGRAPI CONFERENCE ON GRAPHICS, PATTERNS AND IMAGES (SIBGRAPI), 2018, : 166 - 173
  • [39] Action recognition using graph embedding and the co-occurrence matrices descriptor
    Zheng, Feng
    Shao, Ling
    Song, Zhan
    Chen, Xi
    INTERNATIONAL JOURNAL OF COMPUTER MATHEMATICS, 2011, 88 (18) : 3896 - 3914
  • [40] Material microstructures analyzed by using gray level Co-occurrence matrices
    Hu, Yansu
    Wang, Zhijun
    Fan, Xiaoguang
    Li, Junjie
    Gao, Ang
    CHINESE PHYSICS B, 2017, 26 (09)