Supervised Topic Models for Microblog Classification

被引:9
|
作者
Kataria, Saurabh [1 ]
Agarwal, Arvind [1 ]
机构
[1] Palo Alto Res Ctr, Webster, NY 14580 USA
关键词
D O I
10.1109/ICDM.2015.148
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper we present a topic model based approach for classifying micro-blog posts into a given topics of interests. The short nature of micro-blog posts make them challenging for directly learning a classification model. To overcome this limitation, we use content of the links embedded in these posts to improve the topic learning. The hypothesis is that since the link content is far richer than the content of the post itself, using link content along with the content of the post will help learning. However, how this link content can be used to construct features for classification remains a challenging issue. Furthermore, in previous methods, user based information is utilized in an ad-hoc manner that only work for certain type of classification, such as characterizing content of microblogs. In this paper, we propose supervised topic model, User-Labeled-LDA and its nonparametric variant that can avoid the ad-hoc feature construction task and model the topics in a discriminative way. Our experiments on a Twitter dataset shows that modeling user interests and link information helps in learning quality topics for sparse tweets as well as helps significantly in classification task. Our experiments further show that modeling this information in a principled way through topic models helps more than simply adding this information through features.
引用
收藏
页码:793 / 798
页数:6
相关论文
共 50 条
  • [1] Supervised topic models for multi-label classification
    Li, Ximing
    Ouyang, Jihong
    Zhou, Xiaotang
    NEUROCOMPUTING, 2015, 149 : 811 - 819
  • [2] Learning Supervised Topic Models for Classification and Regression from Crowds
    Rodrigues, Filipe
    Lourenco, Mariana
    Ribeiro, Bernardete
    Pereira, Francisco C.
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2017, 39 (12) : 2409 - 2422
  • [3] Microblog Sentiment Classification Based on Supervised and Unsupervised Combination
    Pei, Shaojie
    Zhang, Lumin
    Li, Aiping
    Liu, Yusha
    TRUSTWORTHY COMPUTING AND SERVICES, 2014, 426 : 138 - 145
  • [4] Heterogeneous Supervised Topic Models
    Sridhar, Dhanya
    Daume III, Hal
    Blei, David
    TRANSACTIONS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, 2022, 10 : 732 - 745
  • [5] Supervised topic models with weighted words:multi-label document classification
    Yue-peng ZOU
    Ji-hong OUYANG
    Xi-ming LI
    Frontiers of Information Technology & Electronic Engineering, 2018, 19 (04) : 513 - 523
  • [6] Supervised topic models with word order structure for document classification and retrieval learning
    Jameel, Shoaib
    Lam, Wai
    Bing, Lidong
    INFORMATION RETRIEVAL JOURNAL, 2015, 18 (04): : 283 - 330
  • [7] Supervised topic models with weighted words: multi-label document classification
    Zou, Yue-peng
    Ouyang, Ji-hong
    Li, Xi-ming
    FRONTIERS OF INFORMATION TECHNOLOGY & ELECTRONIC ENGINEERING, 2018, 19 (04) : 513 - 523
  • [8] Supervised Nonparametric Multimodal Topic Models for Multi-class Video Classification
    Xue, Jianfei
    Eguchi, Koji
    ITE TRANSACTIONS ON MEDIA TECHNOLOGY AND APPLICATIONS, 2019, 7 (02): : 80 - 91
  • [9] Supervised topic models with weighted words: multi-label document classification
    Yue-peng Zou
    Ji-hong Ouyang
    Xi-ming Li
    Frontiers of Information Technology & Electronic Engineering, 2018, 19 : 513 - 523
  • [10] Supervised topic models with word order structure for document classification and retrieval learning
    Shoaib Jameel
    Wai Lam
    Lidong Bing
    Information Retrieval Journal, 2015, 18 : 283 - 330