Word Clustering based on Word2vec and Semantic Similarity

被引:0
|
作者
Luo Jie [1 ]
Wang Qinglin [1 ]
Li Yuan [1 ]
机构
[1] Beijing Inst Technol, Sch Automat, Beijing 100081, Peoples R China
关键词
word2vec; semantic similarity; word clustering; domain ontology;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Domain words clustering have important theoretical and practical significance in text categorization, the ontology research, machine learning and many other research areas. The domain words clustering method in this article is a method based on word2vec and semantic similarity computation. First of all, we get the candidate word set with word2vec tools to preliminary clustering of words. Then we tectonic domain category semantic core word set and screening candidate word set by means of semantic similarity computation. Finally we get new word set belongs to the target domain and get the word set in the field of clustering. Experiments show that this method has higher recall ratio and accuracy.
引用
收藏
页码:517 / 521
页数:5
相关论文
共 2 条
  • [1] Bassiou N., 2003, PATTERN RECOGN, V01, P145
  • [2] Jardino M., 1993, IEEE INT C AC SPEECH, P4