Controlling Complexity in Part-of-Speech Induction

被引:5
|
作者
Graca, Joan V. [1 ]
Ganchev, Kuzman [2 ]
Coheur, Luisa [1 ]
Pereira, Fernando [3 ]
Taskar, Ben [4 ]
机构
[1] L2F INESC ID, Lisbon, Portugal
[2] Google Inc, New York, NY USA
[3] Google Inc, Mountain View, CA USA
[4] Univ Penn, Philadelphia, PA 19104 USA
关键词
D O I
10.1613/jair.3348
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We consider the problem of fully unsupervised learning of grammatical (part-of-speech) categories from unlabeled text. The standard maximum-likelihood hidden Markov model for this task performs poorly, because of its weak inductive bias and large model capacity. We address this problem by refining the model and modifying the learning objective to control its capacity via parametric and non-parametric constraints. Our approach enforces word-category association sparsity, adds morphological and orthographic features, and eliminates hard-to-estimate parameters for rare words. We develop an efficient learning algorithm that is not much more computationally intensive than standard training. We also provide an open-source implementation of the algorithm. Our experiments on five diverse languages (Bulgarian, Danish, English, Portuguese, Spanish) achieve significant improvements compared with previous methods for the same task.
引用
收藏
页码:527 / 551
页数:25
相关论文
共 50 条
  • [1] Part-of-Speech Induction for Vietnamese
    Phuong Le-Hong
    Thi Minh Huyen Nguyen
    [J]. KNOWLEDGE AND SYSTEMS ENGINEERING (KSE 2013), VOL 2, 2014, 245 : 261 - 272
  • [2] The computational complexity of rule-based part-of-speech tagging
    Oliva, K
    Kveton, P
    Ondruska, R
    [J]. TEXT, SPEECH AND DIALOGUE, PROCEEDINGS, 2003, 2807 : 82 - 89
  • [3] Part-of-speech persistence: The influence of part-of-speech information on lexical processes
    Melinger, Alissa
    Koenig, Jean-Pierre
    [J]. JOURNAL OF MEMORY AND LANGUAGE, 2007, 56 (04) : 472 - 489
  • [4] Part-of-speech tagging
    Martinez, Angel R.
    [J]. WILEY INTERDISCIPLINARY REVIEWS-COMPUTATIONAL STATISTICS, 2012, 4 (01): : 107 - 113
  • [5] ADVERBIAL PART-OF-SPEECH
    CERVONI, J
    [J]. LANGUE FRANCAISE, 1990, (88): : 5 - 11
  • [6] Mutual Information Maximization for Simple and Accurate Part-Of-Speech Induction
    Stratos, Karl
    [J]. 2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, 2019, : 1095 - 1104
  • [7] Part-of-speech induction by singular value decomposition and hierarchical clustering
    Rapp, R
    [J]. FROM DATA AND INFORMATION ANALYSIS TO KNOWLEDGE ENGINEERING, 2006, : 422 - 429
  • [8] A Universal Part-of-Speech Tagset
    Petrov, Slav
    Das, Dipanjan
    McDonald, Ryan
    [J]. LREC 2012 - EIGHTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2012, : 2089 - 2096
  • [9] Part-of-speech tagging for Swedish
    Prütz, K
    [J]. PARALLEL CORPORA, PARALLEL WORLDS, 2002, (43): : 201 - 206
  • [10] Part-of-speech studies in Chinese
    Wang, Lu
    [J]. JOURNAL OF QUANTITATIVE LINGUISTICS, 2016, 23 (03) : 235 - 255