A maximum entropy approach to adaptive statistical language modelling

被引:242
|
作者
Rosenfeld, R
机构
[1] Computer Science Department, Carnegie Mellon University, Pittsburgh
来源
COMPUTER SPEECH AND LANGUAGE | 1996年 / 10卷 / 03期
关键词
D O I
10.1006/csla.1996.0011
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
An adaptive statistical language model is described, which successfully integrates long distance linguistic information with other knowledge sources. Most existing statistical language models exploit only the immediate history of a text. To extract information from further back in the document's history, we propose and use trigger pairs as the basic information bearing elements. This allows the model to adapt its expectations to the topic of discourse. Next, statistical evidence from multiple sources must be combined. Traditionally, linear interpolation and its variants have been used, but these are shown here to be seriously deficient. Instead, we apply the principle of Maximum Entropy (ME). Each information source gives rise to a set of constraints, to be imposed on the combined estimate. The intersection of these constraints is the set of probability functions which are consistent with all the information sources. The function with the highest entropy within that set is the ME solution. Given consistent statistical evidence, a unique ME solution is guaranteed to exist, and an iterative algorithm exists which is guaranteed to converge to it. The ME framework is extremely general: any phenomenon that can be described in terms of statistics of the text can be readily incorporated. An adaptive language model based on the ME approach was trained on the Wall Street Journal corpus, and showed a 32-39% perplexity reduction over the baseline. When interfaced to SPHINX-II, Carnegie Mellon's speech recognizer, it reduced its error rate by 10-14%. This thus illustrates the feasibility of incorporating many diverse knowledge sources in a single, unified statistical framework. (C) 1996 Academic Press Limited
引用
收藏
页码:187 / 228
页数:42
相关论文
共 50 条
  • [21] Maximum entropy method in comminution modelling
    Otwinowski, Henryk
    GRANULAR MATTER, 2006, 8 (3-4) : 239 - 249
  • [22] Localized maximum entropy shape modelling
    Loog, Marco
    INFORMATION PROCESSING IN MEDICAL IMAGING, PROCEEDINGS, 2007, 4584 : 619 - 629
  • [23] Maximum Entropy Method in Comminution Modelling
    Henryk Otwinowski
    Granular Matter, 2006, 8 : 239 - 249
  • [24] An improved maximum entropy language model
    Fang, GL
    Wen, G
    2002 6TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING PROCEEDINGS, VOLS I AND II, 2002, : 1083 - 1086
  • [25] DEVELOPMENT OF A MAXIMUM ENTROPY APPROACH FOR THE THERMOMECANICAL MODELLING OF THE ROTARY FRICTION WELDING PROCESS
    Foca, M.
    Racineux, G.
    Stainier, L.
    COMPUTATIONAL METHODS FOR COUPLED PROBLEMS IN SCIENCE AND ENGINEERING V, 2013, : 1354 - 1364
  • [26] Entropy minimization within the maximum entropy approach
    Vakarin, E. V.
    Badiali, J. P.
    4TH INTERNATIONAL WORKSHOP ON STATISTICAL PHYSICS AND MATHEMATICS FOR COMPLEX SYSTEMS (SPMCS2014), 2015, 604
  • [27] Optimized approach to maximum entropy
    Alter-Gartenberg, R
    Park, SK
    NONLINEAR IMAGE PROCESSING IX, 1998, 3304 : 2 - 13
  • [28] Maximum entropy approach to reliability
    Du, Yi-Mu
    Ma, Yu-Han
    Wei, Yuan-Fa
    Guan, Xuefei
    Sun, C. P.
    PHYSICAL REVIEW E, 2020, 101 (01)
  • [29] MAXIMUM ENTROPY ARGUMENTS IN RELATIVISTIC STATISTICAL MECHANICS
    JOHNS, KA
    LANDSBERG, PT
    JOURNAL OF PHYSICS PART A GENERAL, 1970, 3 (02): : 121 - +
  • [30] Classical thermodynamic-statistical approach of open systems deviating from maximum entropy
    Dejak, C
    Pastres, R
    Pecenik, G
    Polenghi, I
    TEMPOS IN SCIENCE AND NATURE: STRUCTURES, RELATIONS, AND COMPLEXITY, 1999, 879 : 400 - 405