A Text Classifier with Domain Adaptation for Sentiment Classification

被引:0
|
作者
Chen, Wei [1 ]
Zhou, Jingyu [1 ]
机构
[1] Shanghai Jiao Tong Univ, MOE MS Key Lab Intelligent Comp & Intelligent Sys, Sch Software, Shanghai 200240, Peoples R China
来源
关键词
domain adaptation; sentiment classification; Naive Bayes; EM; semi-supervised;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In sentiment classification, traditional classification algorithms cannot perform well when the number of labeled data is limited. EM-based Naive Bayes algorithm is often employed to argument the labeled data with the unlabeled ones. However, such an approach assumes the distributions of these two sets of data are identical, which may not hold in practice and often results in inferior performance. We propose a semi-supervised algorithm, called Ratio-Adjusted EM-based Naive Bayes (RAEMNB), for sentiment classification, which combines knowledge from a source domain and limited training instances from a target domain. In RAEMNB, the initial Bayes model is trained from labeled instances from both domains. During each EM iteration, we add an extra R-step to adjust the ratio of predicted positive instances to negative ones, which is approximated with labeled instances of target domain. Experimental results show that our RAEMNB approach outperforms the traditional supervised, semi-supervised classifiers.
引用
收藏
页码:61 / 72
页数:12
相关论文
共 50 条
  • [1] Text Length Adaptation in Sentiment Classification
    Amplayo, Reinald Kim
    Lim, Seonjae
    Hwang, Seung-won
    ASIAN CONFERENCE ON MACHINE LEARNING, VOL 101, 2019, 101 : 646 - 661
  • [2] Multi-domain Adaptation for Sentiment Classification: using Multiple Classifier Combining Methods
    Li, Shoushan
    Zong, Chengqing
    IEEE NLP-KE 2008: PROCEEDINGS OF INTERNATIONAL CONFERENCE ON NATURAL LANGUAGE PROCESSING AND KNOWLEDGE ENGINEERING, 2008, : 458 - 465
  • [3] Multi-Domain Sentiment Classification with Classifier Combination
    Shou-Shan Li
    Chu-Ren Huang
    Cheng-Qing Zong
    Journal of Computer Science and Technology, 2011, 26 : 25 - 33
  • [4] Multi-Domain Sentiment Classification with Classifier Combination
    李寿山
    黄居仁
    宗成庆
    Journal of Computer Science & Technology, 2011, 26 (01) : 25 - 33
  • [5] Multi-Domain Sentiment Classification with Classifier Combination
    Li, Shou-Shan
    Huang, Chu-Ren
    Zong, Cheng-Qing
    JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY, 2011, 26 (01) : 25 - 33
  • [6] Text classification: Naïve bayes classifier with sentiment Lexicon
    Le, Cong-Cuong
    Prasad, P.W.C.
    Alsadoon, Abeer
    Pham, L.
    Elchouemi, A.
    IAENG International Journal of Computer Science, 2019, 46 (02) : 141 - 148
  • [7] Distant Domain Adaptation for Text Classification
    Zhu, Zhenlong
    Li, Yuhua
    Li, Ruixuan
    Gu, Xiwu
    KNOWLEDGE SCIENCE, ENGINEERING AND MANAGEMENT (KSEM 2018), PT I, 2018, 11061 : 55 - 66
  • [8] Short Text Sentiment Classification Based on Feature extension and ensemble classifier
    Liu, Yang
    Zhu, Xie
    6TH INTERNATIONAL CONFERENCE ON COMPUTER-AIDED DESIGN, MANUFACTURING, MODELING AND SIMULATION (CDMMS 2018), 2018, 1967
  • [9] Domain Adaptation with Common Feature Space for Sentiment Classification
    Hong W.
    Qi J.
    Wang W.
    Zheng X.
    Weng Y.
    Tianjin Daxue Xuebao (Ziran Kexue yu Gongcheng Jishu Ban)/Journal of Tianjin University Science and Technology, 2019, 52 (06): : 631 - 637
  • [10] Domain Adaptation for Sentiment Classification in Light of Multiple Sources
    Fang, Fang
    Dutta, Kaushik
    Datta, Anindya
    INFORMS JOURNAL ON COMPUTING, 2014, 26 (03) : 586 - 598