Unsupervised methods for developing taxonomies by combining syntactic and statistical information

被引:0
|
作者
Widdows, D [1 ]
机构
[1] Stanford Univ, Ctr Study Language & Informat, Stanford, CA 94305 USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper describes an unsupervised algorithm for placing unknown words into a taxonomy and evaluates its accuracy on a large and varied sample of words. The algorithm works by first using a large corpus to find semantic neighbors of the unknown word, which we accomplish by combining latent semantic analysis with part-of-speech information. We then place the unknown word in the part of the taxonomy where these neighbors are most concentrated, using a class-labelling algorithm developed especially for this task. This method is used to reconstruct parts of the existing Word-Net database, obtaining results for common nouns, proper nouns and verbs. We evaluate the contribution made by part-of-speech tagging and show that automatic filtering using the class-labelling algorithm gives a fourfold improvement in accuracy.
引用
收藏
页码:276 / 283
页数:8
相关论文
共 50 条
  • [31] Combining statistical information in set theoretic estimation
    Combettes, PL
    Chaussalet, TJ
    [J]. IEEE SIGNAL PROCESSING LETTERS, 1996, 3 (03) : 61 - 62
  • [32] Inferring Gene Regulatory Networks by Combining Supervised and Unsupervised Methods
    Turki, Turki
    Wang, Jason T. L.
    Rajikhan, Ibrahim
    [J]. 2016 15TH IEEE INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS (ICMLA 2016), 2016, : 140 - 145
  • [33] Combining unsupervised lexical knowledge methods for word sense disambiguation
    Rigau, G
    Atserias, J
    Agirre, E
    [J]. 35TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 8TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, PROCEEDINGS OF THE CONFERENCE, 1997, : 48 - 55
  • [34] Improving facies prediction by combining supervised and unsupervised learning methods
    Ippolito, Marco
    Ferguson, John
    Jenson, Fred
    [J]. JOURNAL OF PETROLEUM SCIENCE AND ENGINEERING, 2021, 200
  • [35] Combining linguistic with statistical methods in modeling prosody
    Price, P
    Ostendorf, M
    [J]. SIGNAL TO SYNTAX: BOOTSTRAPPING FROM SPEECH TO GRAMMAR IN EARLY ACQUISITION, 1996, : 67 - 83
  • [36] Combining Semantic and Syntactic Information Sources for 5-W Question Answering
    Yaman, Sibel
    Hakkani-Tur, Dilek
    Tur, Gokhan
    [J]. INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5, 2009, : 2711 - +
  • [37] Unsupervised monocular visual odometry via combining instance and RGB information
    Yue, Min
    Fu, Guangyuan
    Wu, Ming
    Gu, Hongyang
    Yao, Erliang
    [J]. APPLIED OPTICS, 2022, 61 (13) : 3793 - 3803
  • [38] Combining information: Statistical issues and opportunity for research.
    Becker, BJ
    [J]. JOURNAL OF EDUCATIONAL AND BEHAVIORAL STATISTICS, 1998, 23 (01) : 77 - 92
  • [39] Combining supervised and unsupervised methods to support early diagnosis of hepatocellular carcinoma
    Ciocchetta, F
    Dell'Anna, R
    Demichelis, F
    Dhillon, AP
    Quaglia, A
    Sboner, A
    [J]. ARTIFICIAL INTELLIGENCE IN MEDICINE, PROCEEDINGS, 2003, 2780 : 239 - 243
  • [40] Combining Supervised and Unsupervised Lexical Knowledge Methods for Word Sense Disambiguation
    E. Agirre
    G. Rigau
    L. Padró
    J. Atserias
    [J]. Computers and the Humanities, 2000, 34 : 103 - 108