Language modeling with probabilistic left corner parsing

被引:5
|
作者
Van Uytsel, DH [1 ]
Van Compernolle, D [1 ]
机构
[1] Katholieke Univ Leuven, ESAT, B-3001 Heverlee, Belgium
来源
COMPUTER SPEECH AND LANGUAGE | 2005年 / 19卷 / 02期
关键词
D O I
10.1016/j.csl.2004.05.009
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present a novel language model, suitable for large-vocabulary continuous speech recognition, based on parsing with a probabilistic left corner grammar (PLCG). The PLCG probabilities are conditioned on local and non-local features of the partial parse tree, and some of these features are lexical. They are not derived from another stochastic grammar, but directly induced from a treebank, a corpus of text sentences, annotated with parse trees. A context-enriched constituent represents all partial parse trees that are equivalent with respect to the probability of the next parse move. For computational efficiency the parsing problem is represented as a traversal through a compact stochastic network of constituents connected by PLCG moves. The efficiency of the algorithm is due to the fact that the network consists of recursively nested, shared subnetworks. The PLCG-based language model results from accumulating the probabilities of all (partial) paths through this network. Next word probabilities can be computed synchronously with the probabilistic left corner parsing algorithm in one single pass from left to right. They are guaranteed to be normalized, even when pruning less likely paths. Finally, it is shown experimentally that the PLCG-based language model is a competitive alternative to other syntax-based language models, both in efficiency and accuracy. (C) 2004 Elsevier Ltd. All rights reserved.
引用
收藏
页码:171 / 204
页数:34
相关论文
共 50 条
  • [1] A structured language model based on context-sensitive probabilistic left-corner parsing
    Van Uytsel, DH
    Van Aelten, F
    Van Compernolle, D
    [J]. 2ND MEETING OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, PROCEEDINGS OF THE CONFERENCE, 2001, : 223 - 230
  • [2] Probabilistic top-down parsing and language modeling
    Roark, B
    [J]. COMPUTATIONAL LINGUISTICS, 2001, 27 (02) : 249 - 276
  • [3] A Sound and Complete Left-Corner Parsing for Minimalist Grammars
    Stanojevic, Milos
    Stabler, Edward P.
    [J]. COGNITIVE ASPECTS OF COMPUTATIONAL LANGUAGE LEARNING AND PROCESSING, 2018, : 65 - 74
  • [4] Probabilistic Treatment for Syntactic Gaps in Analytic Language Parsing
    Boonkwan, Prachya
    Supnithi, Thepchai
    [J]. IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2011, E94D (03) : 440 - 447
  • [5] ''Almost parsing'' technique for language modeling
    Srinivas, B
    [J]. ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4, 1996, : 1173 - 1176
  • [6] Structural bias in inducing representations for probabilistic natural language parsing
    Henderson, J
    [J]. ARTIFICAIL NEURAL NETWORKS AND NEURAL INFORMATION PROCESSING - ICAN/ICONIP 2003, 2003, 2714 : 19 - 26
  • [7] Improved left-corner chart parsing for large context-free grammars
    Moore, RC
    [J]. NEW DEVELOPMENTS IN PARSING TECHNOLOGY, 2004, : 185 - 201
  • [8] Left-Corner Parsing With Distributed Associative Memory Produces Surprisal and Locality Effects
    Rasmussen, Nathan E.
    Schuler, William
    [J]. COGNITIVE SCIENCE, 2018, 42 : 1009 - 1042
  • [9] Probabilistic parsing strategies
    Nederhof, Mark-Jan
    Satta, Giorgio
    [J]. JOURNAL OF THE ACM, 2006, 53 (03) : 406 - 436
  • [10] PROBMELA:: a modeling language for communicating probabilistic processes
    Baier, C
    Ciesinski, F
    Grösser, M
    [J]. SECOND ACM AND IEEE INTERNATIONAL CONFERENCE ON FORMAL METHODS AND MODELS FOR CO-DESIGN, PROCEEDINGS, 2004, : 57 - 66