An improved Chinese word semantic similarity algorithm based on CiLin

被引:0
|
作者
Li, Fei [1 ]
Zhu, Xinhua [1 ]
Chen, Hongchao [1 ]
Ma, Runcong [1 ]
Deng, Han [1 ]
机构
[1] Guangxi Key Lab. of Multi-Source Information Mining & Security and College of Computer Science & Information Technology, Guangxi Normal University, Guilin, China
来源
Journal of Information and Computational Science | 2015年 / 12卷 / 10期
关键词
Correlation methods;
D O I
10.12733/jics20106030
中图分类号
O212 [数理统计];
学科分类号
摘要
The CiLin is a famous semantic dictionary of Chinese synonyms; its structure and function are quite like the WordNet in English. This paper improves the existing algorithm of Chinese word semantic similarity based on CiLin, which integrates the word distance, the density of lowest common parent node and branch layer spacing. Firstly, the initial value of word semantic similarity is calculated through word distance, and then an adjusting parameter that depends on the lowest common parent node density n and the branch interval k is set to revise the initial value downward. Through the fourth root of an expression for the parameters k and n, the revision range of initial similarity can be limited below 16%, thus avoiding the unreasonable phenomenon that the word pairs with near distance have a low similarity because of a far branch interval. This method obtains an as high as 0.8464 value of Pearson correlation coefficient compared with artificial judgment for the word pair set of Miller & Charles. 1548-7741/Copyright © 2015 Binary Information Press
引用
收藏
页码:3799 / 3807
相关论文
共 50 条
  • [1] An approach based on tongyici cilin and word similarity for Chinese word sense induction
    Sun, Rui
    Jin, Peng
    Yang, Xia
    ICIC Express Letters, 2013, 7 (06): : 1767 - 1772
  • [2] An Improved Algorithm of Word Semantic Similarity Based on HowNet
    Kang, Bocheng
    Qi, Junpeng
    2022 16TH IEEE INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING (ICSP2022), VOL 1, 2022, : 266 - 271
  • [3] Chinese Sentence Similarity based on Word Context and Semantic
    Gu, Tianjiao
    Ren, Fuji
    IEEE NLP-KE 2009: PROCEEDINGS OF INTERNATIONAL CONFERENCE ON NATURAL LANGUAGE PROCESSING AND KNOWLEDGE ENGINEERING, 2009, : 535 - 539
  • [4] An Improved Algorithm for Semantic Similarity Based on HowNet
    Bai Jinhong
    Bu Yan
    2ND INTERNATIONAL CONFERENCE ON DATA SCIENCE AND BUSINESS ANALYTICS (ICDSBA 2018), 2018, : 65 - 70
  • [5] Improved Semantic Similarity Algorithm Based on Ontology
    Wei, Junying
    Zhong, Peisi
    Guo, Chunfen
    MECHANICAL, MATERIALS AND MANUFACTURING ENGINEERING, PTS 1-3, 2011, 66-68 : 709 - 714
  • [6] Semantic Similarity Calculation of Chinese Word
    Pan, Liqiang
    Zhang, Pu
    Xiong, Anping
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2014, 5 (08) : 8 - 12
  • [7] Using Tongyici Cilin to compute word semantic polarity
    Lu, Bin
    Wan, Xiaojun
    Yang, Jianwu
    Chen, Xiaoou
    RECENT ADVANCE OF CHINESE COMPUTING TECHNOLOGIES, 2007, : 11 - 16
  • [8] An Improved Semantic Similarity Measure for Word Pairs
    Cai, Songmei
    Lu, Zhao
    2010 INTERNATIONAL CONFERENCE ON E-EDUCATION, E-BUSINESS, E-MANAGEMENT AND E-LEARNING: IC4E 2010, PROCEEDINGS, 2010, : 212 - 216
  • [9] A Method of Building Chinese Basic Semantic Lexicon Based on Word Similarity
    Zhu, Yanhui
    Wen, ZhiQiang
    Wang, Ping
    Peng, Zhaoyi
    PROCEEDINGS OF THE 2009 CHINESE CONFERENCE ON PATTERN RECOGNITION AND THE FIRST CJK JOINT WORKSHOP ON PATTERN RECOGNITION, VOLS 1 AND 2, 2009, : 608 - 611
  • [10] A chinese short text similarity algorithm based on semantic and syntax
    Liao, Zhi-Fang (zfliao@csu.edu.cn), 1600, Hunan University (43):