A New Method of Computing Chinese Word Similarity Based on Statistics

被引:1
|
作者
Zhang, Bo [1 ]
Hong, Lei [1 ]
Song, Shubin [1 ]
He, Liang [1 ]
Li, Guorong [2 ]
机构
[1] East China Normal Univ, Dept Comp Sci & Technol, Shanghai 200062, Peoples R China
[2] Shanghai KINGSWAY CO LTD, Shanghai, Peoples R China
来源
2012 FIFTH INTERNATIONAL CONFERENCE ON BUSINESS INTELLIGENCE AND FINANCIAL ENGINEERING (BIFE) | 2012年
基金
国家高技术研究发展计划(863计划);
关键词
Semantic similarity; co-occurrence; Tongyici Cilin;
D O I
10.1109/BIFE.2012.17
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Word semantic similarity is a very subjective concept and it is very difficult to get a similarity value close to human judgment. Chinese word semantic similarity research is relatively scarce due to its inherent complexity. This paper presents an approach to compute Chinese word semantic similarity based on statistical methods with word frequency contrast introduced (WFC-WS). Word semantic vectors are first obtained using co-occurrence and then extended with HIT-IR Tongyici Cilin (Extended). Word frequency contrast is introduced to filter the semantic vectors. Experiments show that the results of WFC-WS are closer to artificial standard compared with some similar methods.
引用
收藏
页码:43 / 46
页数:4
相关论文
共 50 条
  • [1] CHINESE WORD SIMILARITY COMPUTING
    Li, Lei
    Wang, Zhiqing
    PROCEEDINGS OF THE 3RD IEEE INTERNATIONAL CONFERENCE ON NETWORK INFRASTRUCTURE AND DIGITAL CONTENT (IEEE IC-NIDC 2012), 2012, : 619 - 623
  • [2] Chinese Word Similarity Computing Based on Combination Strategy
    Guo, Shaoru
    Guan, Yong
    Li, Ru
    Zhang, Qi
    NATURAL LANGUAGE UNDERSTANDING AND INTELLIGENT APPLICATIONS (NLPCC 2016), 2016, 10102 : 744 - 752
  • [3] A new similarity computing method based on concept similarity in Chinese text processing
    Peng Jing
    Yang DongQing
    Tang ShiWei
    Wang TengJiao
    Gao Jun
    SCIENCE IN CHINA SERIES F-INFORMATION SCIENCES, 2008, 51 (09): : 1215 - 1230
  • [4] A new similarity computing method based on concept similarity in Chinese text processing
    PENG Jing1
    2 Department of Science and Technology
    Science in China(Series F:Information Sciences), 2008, (09) : 1215 - 1230
  • [5] A new similarity computing method based on concept similarity in Chinese text processing
    Jing Peng
    DongQing Yang
    ShiWei Tang
    TengJiao Wang
    Jun Gao
    Science in China Series F: Information Sciences, 2008, 51
  • [6] A word similarity calculate method base on CSD and statistics
    Zhong, Tonny
    Zhang, Yangsen
    11TH CHINESE LEXICAL SEMANTICS WORKSHOP (CKSW2010), 2010, : 257 - 264
  • [7] A Method of Building Chinese Basic Semantic Lexicon Based on Word Similarity
    Zhu, Yanhui
    Wen, ZhiQiang
    Wang, Ping
    Peng, Zhaoyi
    PROCEEDINGS OF THE 2009 CHINESE CONFERENCE ON PATTERN RECOGNITION AND THE FIRST CJK JOINT WORKSHOP ON PATTERN RECOGNITION, VOLS 1 AND 2, 2009, : 608 - 611
  • [8] Chinese word classification based on statistics
    Zhao, SW
    Xia, Y
    Ma, SP
    Wang, Y
    Su, Z
    PROCEEDINGS OF THE 3RD WORLD CONGRESS ON INTELLIGENT CONTROL AND AUTOMATION, VOLS 1-5, 2000, : 2753 - 2756
  • [9] An Improved Method of Computing Chinese Sentence Similarity
    Wang, Lu
    He, Zhongshi
    PROCEEDINGS OF THE 2ND INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTING AND COGNITIVE INFORMATICS, 2015, : 29 - 33
  • [10] A new method for Chinese sentence similarity computing and its weighting coefficients determination
    Zhou, Faguo
    Yang, Bingru
    Li, Linna
    RECENT ADVANCE OF CHINESE COMPUTING TECHNOLOGIES, 2007, : 143 - 146