Part-of-speech induction by singular value decomposition and hierarchical clustering

被引:1
|
作者
Rapp, R [1 ]
机构
[1] Johannes Gutenberg Univ Mainz, Fachbereich Angewandte Sprach & Kulturwissensch, D-76711 Germersheim, Germany
关键词
D O I
10.1007/3-540-31314-1_51
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Part-of-speech induction involves the automatic discovery of word classes and the assignment of each word of a vocabulary to one or several of these classes. The approach proposed here is based on the analysis of word distributions in a large collection of German newspaper texts. Its main advantage over other attempts is that it combines the hierarchical clustering of context vectors with a previous step of dimensionality reduction that minimizes the effects of sampling errors.
引用
收藏
页码:422 / 429
页数:8
相关论文
共 50 条
  • [1] Part-of-Speech Induction for Vietnamese
    Phuong Le-Hong
    Thi Minh Huyen Nguyen
    [J]. KNOWLEDGE AND SYSTEMS ENGINEERING (KSE 2013), VOL 2, 2014, 245 : 261 - 272
  • [2] Controlling Complexity in Part-of-Speech Induction
    Graca, Joan V.
    Ganchev, Kuzman
    Coheur, Luisa
    Pereira, Fernando
    Taskar, Ben
    [J]. JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 2011, 41 : 527 - 551
  • [3] Part-of-speech discovery by clustering contextual features
    Rapp, Reinhard
    [J]. ADVANCES IN DATA ANALYSIS, 2007, : 627 - 634
  • [4] Dual Decomposition for Vietnamese Part-of-Speech Tagging
    Bach, Ngo Xuan
    Hiraishi, Kunihiko
    Le Minh, Nguyen
    Shimazu, Akira
    [J]. 17TH INTERNATIONAL CONFERENCE IN KNOWLEDGE BASED AND INTELLIGENT INFORMATION AND ENGINEERING SYSTEMS - KES2013, 2013, 22 : 123 - 131
  • [5] HIERARCHICAL SINGULAR VALUE DECOMPOSITION OF TENSORS
    Grasedyck, Lars
    [J]. SIAM JOURNAL ON MATRIX ANALYSIS AND APPLICATIONS, 2010, 31 (04) : 2029 - 2054
  • [6] Part-of-speech persistence: The influence of part-of-speech information on lexical processes
    Melinger, Alissa
    Koenig, Jean-Pierre
    [J]. JOURNAL OF MEMORY AND LANGUAGE, 2007, 56 (04) : 472 - 489
  • [7] Graph Clustering by Hierarchical Singular Value Decomposition with Selectable Range for Number of Clusters Members
    Sadeghian, Azam
    Fazeli, Seyed Abolfazl Shahzadeh
    Karbassi, Seyed Mehdi
    [J]. IRANIAN JOURNAL OF MATHEMATICAL SCIENCES AND INFORMATICS, 2021, 16 (01): : 105 - 121
  • [8] Part-of-speech tagging
    Martinez, Angel R.
    [J]. WILEY INTERDISCIPLINARY REVIEWS-COMPUTATIONAL STATISTICS, 2012, 4 (01): : 107 - 113
  • [9] ADVERBIAL PART-OF-SPEECH
    CERVONI, J
    [J]. LANGUE FRANCAISE, 1990, (88): : 5 - 11
  • [10] Mutual Information Maximization for Simple and Accurate Part-Of-Speech Induction
    Stratos, Karl
    [J]. 2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, 2019, : 1095 - 1104