Statistical properties of probabilistic context-free grammars

被引:0
|
作者
Chi, ZY [1 ]
机构
[1] Univ Chicago, Dept Stat, Chicago, IL 60637 USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We prove a number of useful results about probabilistic context-free grammars (PCFGs) and their Gibbs representations. We present a method, called the relative weighted frequency method, to assign production probabilities that impose proper PCFG distributions on finite parses. We demonstrate that these distributions have finite entropies. In addition, under the distributions, sizes of pauses have finite moment of any order We show that Gibbs distributions on CFG pauses, which generalize PCFG distributions and are more powerful, become PCFG distributions if their features only include frequencies of production rules in parses. Under these circumstances, we prove the equivalence of the maximum-likelihood (ML) estimation procedures for these two types of probability distributions on parses. We introduce the renormalization of improper PCFGs to proper ones. We also study PCFGs from the perspective of stochastic branching processes. We prove that with their production probabilities assigned by the relative weighted frequency method, PCFGs are subcritical, i.e., their branching rates are less than one. We also show that by renormalization, connected supercritical PCFGs become subcritical ones. Finally, some minor issues, including identifiability and approximation of production probabilities of PCFGs, are discussed.
引用
收藏
页码:131 / 160
页数:30
相关论文
共 50 条
  • [1] Estimation of probabilistic context-free grammars
    Chi, ZY
    Geman, S
    [J]. COMPUTATIONAL LINGUISTICS, 1998, 24 (02) : 299 - 305
  • [2] Generalized queries on probabilistic context-free grammars
    Pynadath, DV
    Wellman, MP
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1998, 20 (01) : 65 - 77
  • [3] Generalized queries on probabilistic context-free grammars
    Pynadath, DV
    Wellman, MP
    [J]. PROCEEDINGS OF THE THIRTEENTH NATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND THE EIGHTH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE, VOLS 1 AND 2, 1996, : 1285 - 1290
  • [4] PROBABILISTIC CONTEXT-FREE GRAMMARS THAT ACHIEVE CAPACITY
    JUSTESEN, J
    LARSEN, KJ
    [J]. INFORMATION AND CONTROL, 1975, 29 (03): : 268 - 285
  • [5] Generalized context-free grammars and multiple context-free grammars
    Kasami, Tadao
    Seki, Hiroyuki
    Fujii, Mamoru
    [J]. Systems and Computers in Japan, 1989, 20 (07): : 43 - 52
  • [6] STATISTICAL ESTIMATION OF STOCHASTIC CONTEXT-FREE GRAMMARS
    CASACUBERTA, F
    [J]. PATTERN RECOGNITION LETTERS, 1995, 16 (06) : 565 - 573
  • [7] CONTEXT-FREE GRAMMARS WITH CANCELLATION PROPERTIES
    FROUGNY, C
    [J]. THEORETICAL COMPUTER SCIENCE, 1985, 39 (01) : 3 - 13
  • [8] Context-Free Tree Grammars are as Powerful as Context-Free Jungle Grammars
    Drewes, Frank
    Engelfriett, Joost
    [J]. ACTA CYBERNETICA, 2015, 22 (02): : 373 - 392
  • [9] Squibs and Discussions: Estimation of Probabilistic Context-Free Grammars
    Division of Applied Mathematics, Brown University, Providence, RI 02912, United States
    [J]. Comput. Linguist., 2 (299-305):
  • [10] Iterative CKY parsing for probabilistic context-free grammars
    Tsuruoka, Y
    Tsujii, J
    [J]. NATURAL LANGUAGE PROCESSING - IJCNLP 2004, 2005, 3248 : 52 - 60