Model-based estimation of word saliency in text

被引:0
|
作者
Wang, Xin [1 ]
Kaban, Ata [1 ]
机构
[1] Univ Birmingham, Sch Comp Sci, Birmingham B15 2TT, W Midlands, England
来源
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We investigate a generative latent variable model for model-based word saliency estimation for text modelling and classification. The estimation algorithm derived is able to infer the saliency of words with respect to the mixture modelling objective. We demonstrate experimental results showing that common stop-words as well as other corpus-specific common words are automatically down-weighted and this enhances our ability to capture the essential structure in the data, ignoring irrelevant details. As a classifier, our approach improves over the class prediction accuracy of the Naive Bayes classifier in all our experiments. Compared with a recent state of the art text classification method (Dirichlet Compound Multinomial model) we obtained improved results in two out of three benchmark text collections tested, and comparable results on one other data set.
引用
收藏
页码:279 / 290
页数:12
相关论文
共 50 条
  • [41] Model-based probability density function estimation
    Univ of Rhode Island, Kingston, United States
    IEEE Signal Process Lett, 12 (318-320):
  • [42] A model-based method for myocardium flow estimation
    M. F. Santarelli
    L. Landini
    M. Lombardi
    V. Positano
    A. L’Abbate
    A. Benassi
    Magma: Magnetic Resonance Materials in Physics, Biology, and Medicine, 2000, 11 (1-2): : 87 - 88
  • [43] OPTIMAL PARAMETER ESTIMATION FOR MODEL-BASED QUANTIZATION
    Ozerov, Alexey
    Kleijn, W. Bastiaan
    2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS, 2009, : 2497 - +
  • [44] Highway traffic model-based density estimation
    Morarescu, Irinel-Constantin
    Canudas-de-Wit, Carlos
    2011 AMERICAN CONTROL CONFERENCE, 2011,
  • [45] Model-based estimation of small target parameters
    Eitner, PG
    SIGNAL AND DATA PROCESSING OF SMALL TARGETS 1998, 1998, 3373 : 24 - 31
  • [46] Statistical Model-Based Face Pose Estimation
    戈新良
    杨杰
    李冯
    王华华
    Transactions of Tianjin University, 2007, (02) : 152 - 156
  • [47] Model-based eigenspectrum estimation for speech enhancement
    Bhunjun, Vinesh
    Brookes, Mike
    Naylor, Patrick
    2006 FORTIETH ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS AND COMPUTERS, VOLS 1-5, 2006, : 1331 - +
  • [48] Model-Based Estimation of Ankle Joint Stiffness
    Misgeld, Berno J. E.
    Zhang, Tony
    Lueken, Markus J.
    Leonhardt, Steffen
    SENSORS, 2017, 17 (04)
  • [49] Model-based multiplicity estimation of population size
    Laska, Eugene M.
    Meisner, Morris
    Wanderling, Joseph
    STATISTICS IN MEDICINE, 2009, 28 (17) : 2230 - 2252
  • [50] Model-based estimation of baseball batting metrics
    Wickramasinghe, Lahiru
    Leblanc, Alexandre
    Muthukumarana, Saman
    JOURNAL OF APPLIED STATISTICS, 2021, 48 (10) : 1775 - 1797