ClassiNet - Predicting Missing Features for Short-Text Classification

被引:9
|
作者
Bollegala, Dan Ushka [1 ]
Atanasov, Vincent [1 ]
Maehara, Takanori [2 ]
Kawarabayashi, Ken-Ichi [3 ]
机构
[1] Univ Liverpool, Dept Comp Sci, Liverpool L16 3GL, Merseyside, England
[2] RIKEN Ctr Adv Intelligence Project, Discrete Optimizat Unit, Chuo Ku, 1-4-1 Nihonbashi, Tokyo 1030027, Japan
[3] Natl Inst Informat, Chiyoda Ku, 2-1-2 Hitotsubashi, Tokyo 1018430, Japan
基金
日本科学技术振兴机构;
关键词
Classifier networks; feature sparseness; short-texts; text classification;
D O I
10.1145/3201578
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Short and sparse texts such as tweets, search engine snippets, product reviews, and chat messages are abundant on the Web. Classifying such short-texts into a pre-defined set of categories is a common problem that arises in various contexts, such as sentiment classification, spam detection, and information recommendation. The fundamental problem in short-text classification is feature sparseness - the lack of feature overlap between a trained model and a test instance to be classified. We propose ClassiNet - a network of classifiers trained for predicting missing features in a given instance, to overcome the feature sparseness problem. Using a set of unlabeled training instances, we first learn binary classifiers as feature predictors for predicting whether a particular feature occurs in a given instance. Next, each feature predictor is represented as a vertex v(i) in the ClassiNet, where a one-to-one correspondence exists between feature predictors and vertices. The weight of the directed edge e(ij) connecting a vertex v(i) to a vertex v(j) represents the conditional probability that given v(i) exists in an instance, v(j) also exists in the same instance. We show that ClassiNets generalize word co-occurrence graphs by considering implicit co-occurrences between features. We extract numerous features from the trained ClassiNet to overcome feature sparseness. In particular, for a given instance x, we find similar features from ClassiNet that did not appear in x, and append those features in the representation of x. Moreover, we propose a method based on graph propagation to find features that are indirectly related to a given short-text. We evaluate ClassiNets on several benchmark datasets for short-text classification. Our experimental results show that by using ClassiNet, we can statistically significantly improve the accuracy in short-text classification tasks, without having to use any external resources such as thesauri for finding related features.
引用
收藏
页数:29
相关论文
共 50 条
  • [1] Review of short-text classification
    Alsmadi, Issa
    Gan, Keng Hoon
    [J]. INTERNATIONAL JOURNAL OF WEB INFORMATION SYSTEMS, 2019, 15 (02) : 155 - 182
  • [2] Short-Text Classification Method with Text Features from Pre-trained Models
    Chen, Jie
    Ma, Jing
    Li, Xiaofeng
    [J]. Data Analysis and Knowledge Discovery, 2021, 5 (09) : 21 - 30
  • [3] Short-text classification based on ICA and LSA
    Pu, Qiang
    Yang, Guo-Wei
    [J]. ADVANCES IN NEURAL NETWORKS - ISNN 2006, PT 2, PROCEEDINGS, 2006, 3972 : 265 - 270
  • [4] Intent Classification of Short-Text on Social Media
    Purohit, Hemant
    Dong, Guozhu
    Shalin, Valerie
    Thirunarayan, Krishnaprasad
    Sheth, Amit
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON SMART CITY/SOCIALCOM/SUSTAINCOM (SMARTCITY), 2015, : 222 - 228
  • [5] Language independent semantic kernels for short-text classification
    Kim, Kwanho
    Chung, Beom-suk
    Choi, Yerim
    Lee, Seungjun
    Jung, Jae-Yoon
    Park, Jonghun
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2014, 41 (02) : 735 - 743
  • [6] Knowledge Guided Short-Text Classification For Healthcare Applications
    Cao, Shilei
    Qian, Buyue
    Yin, Changchang
    Li, Xiaoyu
    Wei, Jishang
    Zheng, Qinghua
    Davidson, Ian
    [J]. 2017 17TH IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM), 2017, : 31 - 40
  • [7] Deep Neural Network for Short-Text Sentiment Classification
    Li, Xiangsheng
    Pang, Jianhui
    Mo, Biyun
    Rao, Yanghui
    Wang, Fu Lee
    [J]. DATABASE SYSTEMS FOR ADVANCED APPLICATIONS, DASFAA 2016, 2016, 9645 : 168 - 175
  • [8] Classification of Sentiments in Short-Text: An approach using mSMTP measure
    Kumar, H. M. Keerthi
    Harish, B. S.
    Kumar, S. V. Aruna
    Aradhya, V. N. Manjunath
    [J]. 2ND INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND SOFT COMPUTING (ICMLSC 2018), 2015, : 145 - 150
  • [9] Regularized Supervised Distance Preserving Projections for Short-Text Classification
    Alencar, Alisson S. C.
    Gomes, Joao Paulo P.
    Souza Junior, Amauri H.
    Freire, Livio A. M.
    Silva, Jose Wellington F.
    Andrade, Rossana M. C.
    Castro, Miguel F.
    [J]. 2014 BRAZILIAN CONFERENCE ON INTELLIGENT SYSTEMS (BRACIS), 2014, : 216 - 221
  • [10] Term weighting scheme for short-text classification: Twitter corpuses
    Issa Alsmadi
    Gan Keng Hoon
    [J]. Neural Computing and Applications, 2019, 31 : 3819 - 3831