Classification of multi-lingual tweets, into multi-class model using Naïve Bayes and semi-supervised learning

被引:0
|
作者
Ayaz H. Khan
Muhammad Zubair
机构
[1] Habib University,Computer Science Department
[2] Karachi Institute of Economics and Technology,College of Computing and Information and Sciences
来源
关键词
Twitter; Sentiment analysis; Sentiment classification; Semi-supervised learning;
D O I
暂无
中图分类号
学科分类号
摘要
Twitter is a social media platform which has been proven to be a great tool for insights of emotions about products, policies etc. through a 280-character message called tweet, containing direct and unfiltered emotions by a large amount of user population. Twitter has attracted the attention of many researchers owing to the fact that every tweet is by default, public in nature which is not the case with Facebook. This paper proposes a model for multi-lingual (English and Roman Urdu) classification of tweets over diversely ranged classes (non-hierarchical architecture). Previous work in tweet classification is narrowly focused either on single language or either on uniform set of classes at most (Positive, Extremely Positive, Negative and Extremely Negative). The proposed model is based on semi-supervised learning and proposed feature selection approach makes it less dependent and highly adaptive for grabbing trending terms. This makes it a strong contender of choice for streaming data. In the methodology, using Naïve Bayes learning algorithm for each phase, obtained remarkable accuracy of up to 87.16% leading from both KNN and SVM models which are popular for NLP and Text classification domains.
引用
收藏
页码:32749 / 32767
页数:18
相关论文
共 50 条
  • [1] Classification of multi-lingual tweets, into multi-class model using Naive Bayes and semi-supervised learning
    Khan, Ayaz H.
    Zubair, Muhammad
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2020, 79 (43-44) : 32749 - 32767
  • [2] Semi-Supervised Boosting for Multi-Class Classification
    Valizadegan, Hamed
    Jin, Rong
    Jain, Anil K.
    [J]. MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, PART II, PROCEEDINGS, 2008, 5212 : 522 - 537
  • [3] Improving Multi-class Classification for Endomicroscopic Images by Semi-supervised Learning
    Wu, Hang
    Tong, Li
    Wang, May D.
    [J]. 2017 IEEE EMBS INTERNATIONAL CONFERENCE ON BIOMEDICAL & HEALTH INFORMATICS (BHI), 2017, : 5 - 8
  • [4] Semi-supervised Batch Mode Active Learning for Multi-class Classification
    Lv, Jujian
    Zhao, Huimin
    Chen, Rongjun
    Zhan, Jin
    Li, Jianhong
    Lin, Kaihan
    Li, Canyao
    [J]. ADVANCES IN BRAIN INSPIRED COGNITIVE SYSTEMS, 2020, 11691 : 117 - 127
  • [5] SIMILARITY LEARNING FOR SEMI-SUPERVISED MULTI-CLASS BOOSTING
    Wang, Q. Y.
    Yuen, P. C.
    Feng, G. C.
    [J]. 2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2011, : 2164 - 2167
  • [6] Combining Deep Generative Models and Multi-lingual Pretraining for Semi-supervised Document Classification
    Zhu, Yi
    Shareghi, Ehsan
    Li, Yingzhen
    Reichart, Roi
    Korhonen, Anna
    [J]. 16TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (EACL 2021), 2021, : 894 - 908
  • [7] Graph based multi-class semi-supervised learning using Gaussian process
    Song, Yangqiu
    Zhang, Changshui
    Lee, Jianguo
    [J]. STRUCTURAL, SYNTACTIC, AND STATISTICAL PATTERN RECOGNITION, PROCEEDINGS, 2006, 4109 : 450 - 458
  • [8] Regularized Multi-Class Semi-Supervised Boosting
    Saffari, Amir
    Leistner, Christian
    Bischof, Horst
    [J]. CVPR: 2009 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOLS 1-4, 2009, : 967 - 974
  • [9] A Multi-class Semi-Supervised Classification Algorithm Based on Evidence Theory
    一种基于证据理论的多类半监督分类算法
    [J]. 2018, Chinese Institute of Electronics (46):
  • [10] Dynamic label propagation for semi-supervised multi-class multi-label classification
    Wang, Bo
    Tsotsos, John
    [J]. PATTERN RECOGNITION, 2016, 52 : 75 - 84