The VC dimension of constraint-based grammars

被引:6
|
作者
Bane, Max [1 ]
Riggle, Jason [1 ]
Sonderegger, Morgan [2 ]
机构
[1] Univ Chicago, Dept Linguist, Chicago, IL 60637 USA
[2] Univ Chicago, Dept Comp Sci, Chicago, IL 60637 USA
关键词
Complexity; Learnability; Optimality Theory; Harmonic Grammar; VC dimension;
D O I
10.1016/j.lingua.2009.07.001
中图分类号
H0 [语言学];
学科分类号
030303 ; 0501 ; 050102 ;
摘要
We analyze the complexity of Harmonic Grammar (HG), a linguistic model in which licit underlying-to-surface-form mappings are determined by optimization over weighted constraints. We show that the Vapnik-Chervonenkis dimension of HG grammars with k constraints is k - 1. This establishes a fundamental bound on the complexity of HG in terms of its capacity to classify sets of linguistic data that has significant ramifications for learnability. The VC dimension of HG is the same as that of Optimality Theory (OT), which is similar to HG, but uses ranked rather than weighted constraints in optimization. The parity of the VC dimension in these two models is somewhat surprising because OT defines finite classes of grammars there are at most k! ways to rank k constraints while HG can define infinite classes of grammars because the weights associated with constraints are real-valued. The parity is also surprising because HG permits groups of constraints that interact through so-called 'gang effects' to generate languages that cannot be generated in OT. The fact that the VC dimension grows linearly with the number of constraints in both models means that, even in the worst case, the number of randomly chosen training samples needed to weight/rank a known set of constraints is a linear function of k. We conclude that though there may be factors that favor one model or the other, the complexity of learning weightings/rankings is not one of them. (C) 2009 Elsevier B.V. All rights reserved.
引用
收藏
页码:1194 / 1208
页数:15
相关论文
共 50 条
  • [1] Lexical rules in constraint-based grammars
    Briscoe, T
    Copestake, A
    [J]. COMPUTATIONAL LINGUISTICS, 1999, 25 (04) : 487 - 526
  • [2] Property Grammars: A fully constraint-based theory
    Blache, P
    [J]. CONSTRAINT SOLVING AND LANGUAGE PROCESSING, 2005, 3438 : 1 - 16
  • [3] An algebra for semantic construction in constraint-based grammars
    Copestake, A
    Lascarides, A
    Flickinger, D
    [J]. 39TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, PROCEEDINGS OF THE CONFERENCE, 2001, : 132 - 139
  • [4] ATTRIBUTE GRAMMARS IN CONSTRAINT-BASED GRAPHICS SYSTEMS
    BARFORD, LA
    VANDERZANDEN, BT
    [J]. SOFTWARE-PRACTICE & EXPERIENCE, 1989, 19 (04): : 309 - 328
  • [5] Inducing constraint-based grammars using a domain ontology
    Muresan, S
    [J]. PROCEEDING OF THE NINETEENTH NATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND THE SIXTEENTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2004, : 995 - 996
  • [6] A constraint-based bottom-up counterpart to Definite Clause Grammars
    Christiansen, H
    [J]. Recent Advances in Natural Language Processing III, 2004, 260 : 227 - 236
  • [7] Offline compilation of chains for head-driven generation with constraint-based grammars
    Tuells, T
    Rigau, G
    Rodríguez, H
    [J]. COMPUTATIONAL LINGUISTICS AND INTELLIGENT TEXT PROCESSING, PROCEEDINGS, 2003, 2588 : 180 - 190
  • [8] Constraint-based agents
    Nareyek, A
    [J]. CONSTRAINT-BASED AGENTS: AN ARCHITECTURE FOR CONSTRAINT-BASED MODELING AND LOCAL-SEARCH-BASED REASONING FOR PLANNING AND SCHEDULING IN OPEN AND DYNAMIC WORLDS, 2001, 2062 : 1 - +
  • [9] Constraint-Based Metrics
    Chris Golston
    [J]. Natural Language & Linguistic Theory, 1998, 16 : 719 - 770
  • [10] CONSTRAINT-BASED REASONING
    KASIF, S
    [J]. IEEE EXPERT-INTELLIGENT SYSTEMS & THEIR APPLICATIONS, 1991, 6 (06): : 55 - 55