A New Method of Computing Chinese Word Similarity Based on Statistics

被引:1
|
作者
Zhang, Bo [1 ]
Hong, Lei [1 ]
Song, Shubin [1 ]
He, Liang [1 ]
Li, Guorong [2 ]
机构
[1] East China Normal Univ, Dept Comp Sci & Technol, Shanghai 200062, Peoples R China
[2] Shanghai KINGSWAY CO LTD, Shanghai, Peoples R China
来源
2012 FIFTH INTERNATIONAL CONFERENCE ON BUSINESS INTELLIGENCE AND FINANCIAL ENGINEERING (BIFE) | 2012年
基金
国家高技术研究发展计划(863计划);
关键词
Semantic similarity; co-occurrence; Tongyici Cilin;
D O I
10.1109/BIFE.2012.17
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Word semantic similarity is a very subjective concept and it is very difficult to get a similarity value close to human judgment. Chinese word semantic similarity research is relatively scarce due to its inherent complexity. This paper presents an approach to compute Chinese word semantic similarity based on statistical methods with word frequency contrast introduced (WFC-WS). Word semantic vectors are first obtained using co-occurrence and then extended with HIT-IR Tongyici Cilin (Extended). Word frequency contrast is introduced to filter the semantic vectors. Experiments show that the results of WFC-WS are closer to artificial standard compared with some similar methods.
引用
收藏
页码:43 / 46
页数:4
相关论文
共 50 条
  • [21] An improved Chinese word semantic similarity algorithm based on CiLin
    Li, Fei
    Zhu, Xinhua
    Chen, Hongchao
    Ma, Runcong
    Deng, Han
    Journal of Information and Computational Science, 2015, 12 (10): : 3799 - 3807
  • [22] A new indexing method based on word proximity for Chinese text retrieval
    Lin Du
    Yufang Sun
    Journal of Computer Science and Technology, 2000, 15 : 280 - 286
  • [23] A New Indexing Method Based on Word Proximity for Chinese Text Retrieval
    杜林
    孙玉芳
    Journal of Computer Science and Technology, 2000, (03) : 280 - 286
  • [24] An adaptive method for Chinese new word detection based on hypothesis testing
    Jiang, Dongchen
    Jiang, Aoyuan
    Tang, Shuai
    PATTERN ANALYSIS AND APPLICATIONS, 2022, 25 (04) : 993 - 999
  • [25] A new word detection method for chinese based on local context information
    Zeng, Hua-Lin
    Zhou, Chang-Le
    Zheng, Xu-Ling
    Journal of Donghua University (English Edition), 2010, 27 (02) : 189 - 192
  • [26] An adaptive method for Chinese new word detection based on hypothesis testing
    Dongchen Jiang
    Aoyuan Jiang
    Shuai Tang
    Pattern Analysis and Applications, 2022, 25 : 993 - 999
  • [27] A New Word Detection Method for Chinese Based on Local Context Information
    曾华琳
    周昌乐
    郑旭玲
    JournalofDonghuaUniversity(EnglishEdition), 2010, 27 (02) : 189 - 192
  • [28] A new indexing method based on word proximity for Chinese text retrieval
    Du, L
    Sun, YF
    JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY, 2000, 15 (03) : 280 - 286
  • [29] Out-domain Chinese new word detection with statistics-based character embedding
    Liang, Yuzhi
    Yang, Min
    Zhu, Jia
    Yiu, S. M.
    NATURAL LANGUAGE ENGINEERING, 2019, 25 (02) : 239 - 255
  • [30] Semantic Similarity Calculation of Chinese Word
    Pan, Liqiang
    Zhang, Pu
    Xiong, Anping
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2014, 5 (08) : 8 - 12