Model-based estimation of word saliency in text

被引:0
|
作者
Wang, Xin [1 ]
Kaban, Ata [1 ]
机构
[1] Univ Birmingham, Sch Comp Sci, Birmingham B15 2TT, W Midlands, England
来源
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We investigate a generative latent variable model for model-based word saliency estimation for text modelling and classification. The estimation algorithm derived is able to infer the saliency of words with respect to the mixture modelling objective. We demonstrate experimental results showing that common stop-words as well as other corpus-specific common words are automatically down-weighted and this enhances our ability to capture the essential structure in the data, ignoring irrelevant details. As a classifier, our approach improves over the class prediction accuracy of the Naive Bayes classifier in all our experiments. Compared with a recent state of the art text classification method (Dirichlet Compound Multinomial model) we obtained improved results in two out of three benchmark text collections tested, and comparable results on one other data set.
引用
收藏
页码:279 / 290
页数:12
相关论文
共 50 条
  • [1] Adaptive Gradient-based Word Saliency for adversarial text attacks
    Qi, Yupeng
    Yang, Xinghao
    Liu, Baodi
    Zhang, Kai
    Liu, Weifeng
    [J]. NEUROCOMPUTING, 2024, 590
  • [2] Knowledge based word-concept model estimation and refinement for biomedical text mining
    Yepes, Antonio Jimeno
    Berlanga, Rafael
    [J]. JOURNAL OF BIOMEDICAL INFORMATICS, 2015, 53 : 300 - 307
  • [3] Visual Saliency Model-Based Image Watermarking with Laplacian Distribution
    Liu, Hongmei
    Liu, Jinhua
    Zhao, Mingfeng
    [J]. INFORMATION, 2018, 9 (09)
  • [4] Model-based Clustering of Short Text Streams
    Yin, Jianhua
    Chao, Daren
    Liu, Zhongkun
    Zhang, Wei
    Yu, Xiaohui
    Wang, Jianyong
    [J]. KDD'18: PROCEEDINGS OF THE 24TH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING, 2018, : 2634 - 2642
  • [5] Text to Region: Visual-Word Guided Saliency Detection
    Xing, Tengfei
    Wang, Zhaohui
    Yang, Jianyu
    Ji, Yi
    Liu, Chunping
    [J]. ADVANCES IN MULTIMEDIA INFORMATION PROCESSING, PT III, 2018, 11166 : 740 - 749
  • [6] Saliency Estimation Model Based on Superpixel and Regions Contrast
    Xie, Zhaoxia
    [J]. 2017 10TH INTERNATIONAL SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE AND DESIGN (ISCID), VOL. 1, 2017, : 466 - 469
  • [7] Model-Based Correlated Channels Estimation
    Alimorad Mahmoudi
    [J]. Wireless Personal Communications, 2017, 92 : 483 - 493
  • [8] Model-based road friction estimation
    Shim, T
    Margolis, D
    [J]. VEHICLE SYSTEM DYNAMICS, 2004, 41 (04) : 249 - 276
  • [9] MODEL-BASED ATTITUDE ESTIMATION FOR MULTICOPTERS
    Baranek, Radek
    Solc, Frantisek
    [J]. ADVANCES IN ELECTRICAL AND ELECTRONIC ENGINEERING, 2014, 12 (05) : 501 - 510
  • [10] Model-Based Correlated Channels Estimation
    Mahmoudi, Alimorad
    [J]. WIRELESS PERSONAL COMMUNICATIONS, 2017, 92 (02) : 483 - 493