A New Method of Computing Chinese Word Similarity Based on Statistics

被引：1

作者：

Zhang, Bo ^{[1
]}

Hong, Lei ^{[1
]}

Song, Shubin ^{[1
]}

He, Liang ^{[1
]}

Li, Guorong ^{[2
]}

机构：

[1] East China Normal Univ, Dept Comp Sci & Technol, Shanghai 200062, Peoples R China

[2] Shanghai KINGSWAY CO LTD, Shanghai, Peoples R China

来源：

2012 FIFTH INTERNATIONAL CONFERENCE ON BUSINESS INTELLIGENCE AND FINANCIAL ENGINEERING (BIFE) | 2012年

基金：

国家高技术研究发展计划(863计划);

关键词：

Semantic similarity; co-occurrence; Tongyici Cilin;

D O I：

10.1109/BIFE.2012.17

中图分类号：

TP39 [计算机的应用];

学科分类号：

081203 ; 0835 ;

摘要：

Word semantic similarity is a very subjective concept and it is very difficult to get a similarity value close to human judgment. Chinese word semantic similarity research is relatively scarce due to its inherent complexity. This paper presents an approach to compute Chinese word semantic similarity based on statistical methods with word frequency contrast introduced (WFC-WS). Word semantic vectors are first obtained using co-occurrence and then extended with HIT-IR Tongyici Cilin (Extended). Word frequency contrast is introduced to filter the semantic vectors. Experiments show that the results of WFC-WS are closer to artificial standard compared with some similar methods.

引用

页码：43 / 46

页数：4

共 50 条

[1] CHINESE WORD SIMILARITY COMPUTING
Li, Lei
Wang, Zhiqing
PROCEEDINGS OF THE 3RD IEEE INTERNATIONAL CONFERENCE ON NETWORK INFRASTRUCTURE AND DIGITAL CONTENT (IEEE IC-NIDC 2012), 2012, : 619 - 623
[2] Chinese Word Similarity Computing Based on Combination Strategy
Guo, Shaoru
Guan, Yong
Li, Ru
Zhang, Qi
NATURAL LANGUAGE UNDERSTANDING AND INTELLIGENT APPLICATIONS (NLPCC 2016), 2016, 10102 : 744 - 752
[3] A new similarity computing method based on concept similarity in Chinese text processing
Peng Jing
Yang DongQing
Tang ShiWei
Wang TengJiao
Gao Jun
SCIENCE IN CHINA SERIES F-INFORMATION SCIENCES, 2008, 51 (09): : 1215 - 1230
[4] A new similarity computing method based on concept similarity in Chinese text processing
PENG Jing1
2 Department of Science and Technology
Science in China(Series F:Information Sciences), 2008, (09) : 1215 - 1230
[5] A new similarity computing method based on concept similarity in Chinese text processing
Jing Peng
DongQing Yang
ShiWei Tang
TengJiao Wang
Jun Gao
Science in China Series F: Information Sciences, 2008, 51
[6] A word similarity calculate method base on CSD and statistics
Zhong, Tonny
Zhang, Yangsen
11TH CHINESE LEXICAL SEMANTICS WORKSHOP (CKSW2010), 2010, : 257 - 264
[7] A Method of Building Chinese Basic Semantic Lexicon Based on Word Similarity
Zhu, Yanhui
Wen, ZhiQiang
Wang, Ping
Peng, Zhaoyi
PROCEEDINGS OF THE 2009 CHINESE CONFERENCE ON PATTERN RECOGNITION AND THE FIRST CJK JOINT WORKSHOP ON PATTERN RECOGNITION, VOLS 1 AND 2, 2009, : 608 - 611
[8] Chinese word classification based on statistics
Zhao, SW
Xia, Y
Ma, SP
Wang, Y
Su, Z
PROCEEDINGS OF THE 3RD WORLD CONGRESS ON INTELLIGENT CONTROL AND AUTOMATION, VOLS 1-5, 2000, : 2753 - 2756
[9] An Improved Method of Computing Chinese Sentence Similarity
Wang, Lu
He, Zhongshi
PROCEEDINGS OF THE 2ND INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTING AND COGNITIVE INFORMATICS, 2015, : 29 - 33
[10] A new method for Chinese sentence similarity computing and its weighting coefficients determination
Zhou, Faguo
Yang, Bingru
Li, Linna
RECENT ADVANCE OF CHINESE COMPUTING TECHNOLOGIES, 2007, : 143 - 146

← 1 2 3 4 5 →