Spectral Methods for Thesaurus Construction

被引:0
|
作者
Shimizu, Nobuyuki [1 ]
Sugiyama, Masashi [2 ]
Nakagawa, Hiroshi [1 ]
机构
[1] Univ Tokyo, Ctr Informat Technol, Tokyo 1130033, Japan
[2] Tokyo Inst Technol, Dept Comp Sci, Tokyo 1528550, Japan
来源
关键词
synonym acquisition; synonym extraction; thesaurus; spectral clustering; graph laplacian;
D O I
10.1587/transinf.E93.D.1378
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Traditionally, popular synonym acquisition methods are based on the distributional hypothesis, and a metric such as Jaccard coefficients is used to evaluate the similarity between the contexts of words to obtain synonyms for a query. On the other hand, when one tries to compile and clean a thesaurus, one often already has a modest number of synonym relations at hand. Could something be done with a half-built thesaurus alone? We propose the use of spectral methods and discuss their relation to other network-based algorithms in natural language processing (NLP), such as Page Rank and Bootstrapping. Since compiling a thesaurus is very laborious, we believe that adding the proposed method to the toolkit of thesaurus constructors would significantly ease the pain in accomplishing this task.
引用
收藏
页码:1378 / 1385
页数:8
相关论文
共 50 条
  • [21] THE APPLICATION OF A MINICOMPUTER TO THESAURUS CONSTRUCTION
    KAZLAUSKAS, EJ
    HOLT, TD
    JOURNAL OF THE AMERICAN SOCIETY FOR INFORMATION SCIENCE, 1980, 31 (05): : 363 - 368
  • [22] Thesaurus construction: Problems and their roots
    Miller, U
    INFORMATION PROCESSING & MANAGEMENT, 1997, 33 (04) : 481 - 493
  • [23] Essential Thesaurus construction.
    Intner, Sheila S.
    LIBRARY JOURNAL, 2006, 131 (19) : 98 - 98
  • [24] ThesWB - Thesaurus construction WorkBench
    Abuzir, Y
    Vandamme, F
    STAIRS 2002, PROCEEDINGS, 2002, 78 : 95 - 104
  • [25] On the Automatic Construction of an Arabic Thesaurus
    Mohsen, Ghassan
    Al-Ayyoub, Mahmoud
    Hmeidi, Ismail
    Al-Aiad, Ahmad
    2018 9TH INTERNATIONAL CONFERENCE ON INFORMATION AND COMMUNICATION SYSTEMS (ICICS), 2018, : 243 - 247
  • [26] Construction of thesaurus by automatized means
    Confeccion de un tesauro por medios automatizados
    1600, (24):
  • [27] Problems and prospects in thesaurus construction
    Soergel, D
    ADVANCES IN CLASSIFICATION RESEARCH, VOL 8, 1998, : 102 - 104
  • [28] Thesaurus construction: Problems and their roots
    Miller, Uri
    Information Processing and Management, 1997, 33 (04): : 481 - 493
  • [29] THEORETICAL FOUNDATIONS OF THESAURUS CONSTRUCTION AND SOME METHODOLOGICAL CONSIDERATIONS FOR THESAURUS UPDATING
    KIM, C
    JOURNAL OF THE AMERICAN SOCIETY FOR INFORMATION SCIENCE, 1973, 24 (02): : 148 - 156
  • [30] Construction of optimal spectral methods in phase retrieval
    Maillard, Antoine
    Krzakala, Florent
    Lu, Yue M.
    Zdeborova, Lenka
    MATHEMATICAL AND SCIENTIFIC MACHINE LEARNING, VOL 145, 2021, 145 : 693 - 720