Model-based estimation of word saliency in text

被引:0
|
作者
Wang, Xin [1 ]
Kaban, Ata [1 ]
机构
[1] Univ Birmingham, Sch Comp Sci, Birmingham B15 2TT, W Midlands, England
来源
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We investigate a generative latent variable model for model-based word saliency estimation for text modelling and classification. The estimation algorithm derived is able to infer the saliency of words with respect to the mixture modelling objective. We demonstrate experimental results showing that common stop-words as well as other corpus-specific common words are automatically down-weighted and this enhances our ability to capture the essential structure in the data, ignoring irrelevant details. As a classifier, our approach improves over the class prediction accuracy of the Naive Bayes classifier in all our experiments. Compared with a recent state of the art text classification method (Dirichlet Compound Multinomial model) we obtained improved results in two out of three benchmark text collections tested, and comparable results on one other data set.
引用
收藏
页码:279 / 290
页数:12
相关论文
共 50 条
  • [31] Model-based approach for elevator performance estimation
    Esteban, E.
    Salgado, O.
    Iturrospe, A.
    Isasa, I.
    MECHANICAL SYSTEMS AND SIGNAL PROCESSING, 2016, 68-69 : 125 - 137
  • [32] Model-Based Oscillometric Blood Pressure Estimation
    Forouzanfar, Mohamad
    Dajani, Hilmi R.
    Groza, Voicu Z.
    Bolic, Miodrag
    2014 IEEE INTERNATIONAL SYMPOSIUM ON MEDICAL MEASUREMENTS AND APPLICATIONS (MEMEA), 2014, : 443 - 448
  • [33] MODEL-BASED DESIGN IN SMALL AREA ESTIMATION
    Nekrasaite-Liege, Vilma
    Radavicius, Marijus
    Rudys, Tomas
    LITHUANIAN MATHEMATICAL JOURNAL, 2011, 51 (03) : 417 - 424
  • [34] Model-based Estimation of Neonatal Pleural Pressure
    McDonald, Mariah Aroha
    Knopp, Jennifer L.
    Guy, Ella F. S.
    Dixon, Bronwyn
    Chase, J. Geoffrey
    IFAC PAPERSONLINE, 2023, 56 (02): : 4764 - 4769
  • [35] Model-Based Estimation of Wheel Slip in Locomotives
    van de Merwe, C., V
    le Roux, J. D.
    2022 EUROPEAN CONTROL CONFERENCE (ECC), 2022, : 2124 - 2129
  • [37] A model-based method for myocardium flow estimation
    Santarelli, M. F.
    Landini, L.
    Lombardi, M.
    Positano, V.
    L'Abbate, A.
    Benassi, A.
    MAGNETIC RESONANCE MATERIALS IN PHYSICS BIOLOGY AND MEDICINE, 2000, 11 (01) : 87 - 88
  • [38] Model-Based Force Estimation for Intracardiac Catheters
    Hasanzadeh, Shahir
    Janabi-Sharifi, Farrokh
    IEEE-ASME TRANSACTIONS ON MECHATRONICS, 2016, 21 (01) : 154 - 162
  • [39] An evidence-based model of saliency feature extraction for scene text analysis
    Chen, Yui-Lang
    Yu, Pao-Ta
    INTERNATIONAL JOURNAL ON DOCUMENT ANALYSIS AND RECOGNITION, 2016, 19 (03) : 269 - 287
  • [40] Model-based quality estimation of fingerprint images
    Lee, S
    Lee, C
    Kim, J
    ADVANCES IN BIOMETRICS, PROCEEDINGS, 2006, 3832 : 229 - 235