Spectral Methods for Thesaurus Construction

被引:0
|
作者
Shimizu, Nobuyuki [1 ]
Sugiyama, Masashi [2 ]
Nakagawa, Hiroshi [1 ]
机构
[1] Univ Tokyo, Ctr Informat Technol, Tokyo 1130033, Japan
[2] Tokyo Inst Technol, Dept Comp Sci, Tokyo 1528550, Japan
来源
关键词
synonym acquisition; synonym extraction; thesaurus; spectral clustering; graph laplacian;
D O I
10.1587/transinf.E93.D.1378
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Traditionally, popular synonym acquisition methods are based on the distributional hypothesis, and a metric such as Jaccard coefficients is used to evaluate the similarity between the contexts of words to obtain synonyms for a query. On the other hand, when one tries to compile and clean a thesaurus, one often already has a modest number of synonym relations at hand. Could something be done with a half-built thesaurus alone? We propose the use of spectral methods and discuss their relation to other network-based algorithms in natural language processing (NLP), such as Page Rank and Bootstrapping. Since compiling a thesaurus is very laborious, we believe that adding the proposed method to the toolkit of thesaurus constructors would significantly ease the pain in accomplishing this task.
引用
收藏
页码:1378 / 1385
页数:8
相关论文
共 50 条