A maximum entropy approach to adaptive statistical language modelling

被引:242
|
作者
Rosenfeld, R
机构
[1] Computer Science Department, Carnegie Mellon University, Pittsburgh
来源
COMPUTER SPEECH AND LANGUAGE | 1996年 / 10卷 / 03期
关键词
D O I
10.1006/csla.1996.0011
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
An adaptive statistical language model is described, which successfully integrates long distance linguistic information with other knowledge sources. Most existing statistical language models exploit only the immediate history of a text. To extract information from further back in the document's history, we propose and use trigger pairs as the basic information bearing elements. This allows the model to adapt its expectations to the topic of discourse. Next, statistical evidence from multiple sources must be combined. Traditionally, linear interpolation and its variants have been used, but these are shown here to be seriously deficient. Instead, we apply the principle of Maximum Entropy (ME). Each information source gives rise to a set of constraints, to be imposed on the combined estimate. The intersection of these constraints is the set of probability functions which are consistent with all the information sources. The function with the highest entropy within that set is the ME solution. Given consistent statistical evidence, a unique ME solution is guaranteed to exist, and an iterative algorithm exists which is guaranteed to converge to it. The ME framework is extremely general: any phenomenon that can be described in terms of statistics of the text can be readily incorporated. An adaptive language model based on the ME approach was trained on the Wall Street Journal corpus, and showed a 32-39% perplexity reduction over the baseline. When interfaced to SPHINX-II, Carnegie Mellon's speech recognizer, it reduced its error rate by 10-14%. This thus illustrates the feasibility of incorporating many diverse knowledge sources in a single, unified statistical framework. (C) 1996 Academic Press Limited
引用
收藏
页码:187 / 228
页数:42
相关论文
共 50 条
  • [1] Maximum entropy approach to adaptive statistical language modelling
    Carnegie Mellon Univ, Pittsburgh, United States
    Comput Speech Lang, 3 (187-228):
  • [2] A maximum entropy approach for integrating semantic information in statistical language models
    Chueh, CH
    Chien, JT
    Wang, HM
    2004 International Symposium on Chinese Spoken Language Processing, Proceedings, 2004, : 309 - 312
  • [3] Latent maximum entropy principle for statistical language modeling
    Wang, SJ
    Rosenfeld, R
    Zhao, YX
    ASRU 2001: IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING, CONFERENCE PROCEEDINGS, 2001, : 182 - 185
  • [4] Distant bigram language modelling using maximum entropy
    Simons, M
    Ney, H
    Martin, SC
    1997 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I - V: VOL I: PLENARY, EXPERT SUMMARIES, SPECIAL, AUDIO, UNDERWATER ACOUSTICS, VLSI; VOL II: SPEECH PROCESSING; VOL III: SPEECH PROCESSING, DIGITAL SIGNAL PROCESSING; VOL IV: MULTIDIMENSIONAL SIGNAL PROCESSING, NEURAL NETWORKS - VOL V: STATISTICAL SIGNAL AND ARRAY PROCESSING, APPLICATIONS, 1997, : 787 - 790
  • [5] A maximum entropy approach to natural language processing
    Berger, AL
    DellaPietra, SA
    DellaPietra, VJ
    COMPUTATIONAL LINGUISTICS, 1996, 22 (01) : 39 - 71
  • [6] Combining Statistical Language Models via the Latent Maximum Entropy Principle
    Shaojun Wang
    Dale Schuurmans
    Fuchun Peng
    Yunxin Zhao
    Machine Learning, 2005, 60 : 229 - 250
  • [7] Combining statistical language models via the latent maximum entropy principle
    Wang, SJ
    Schuurmans, D
    Peng, FC
    Zhao, YX
    MACHINE LEARNING, 2005, 60 (1-3) : 229 - 250
  • [8] Maximum entropy approach to statistical inference for an ocean acoustic waveguide
    Knobles, D. P.
    Sagers, J. D.
    Koch, R. A.
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2012, 131 (02): : 1087 - 1101
  • [9] Statistical models of complex brain networks: a maximum entropy approach
    Dichio, Vito
    Fallani, Fabrizio De Vico
    REPORTS ON PROGRESS IN PHYSICS, 2023, 86 (10)
  • [10] Adaptive Approach for a Maximum Entropy Algorithm in Ecological Niche Modeling
    Rodrigues, E. S. C.
    Rodrigues, F. A.
    Rocha, R. L. A.
    Correa, P. L. P.
    IEEE LATIN AMERICA TRANSACTIONS, 2011, 9 (03) : 331 - 338