Classification of Sentiments in Short-Text: An approach using mSMTP measure

被引:0
|
作者
Kumar, H. M. Keerthi [1 ]
Harish, B. S. [2 ]
Kumar, S. V. Aruna [2 ]
Aradhya, V. N. Manjunath [2 ]
机构
[1] Sri Jayachamarajendra Coll Engn, JSS Res Fdn, Mysuru, India
[2] Sri Jayachamarajendra Coll Engn, Mysuru, India
关键词
Sentiment Analysis; Short Text; Similarity Measure; Classification; SIMILARITY MEASURE;
D O I
10.1145/3184066.3184074
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Sentiment analysis or opinion mining is an automated process to recognize opinion, moods, emotions, attitude of individuals or communities through natural language processing, text analysis, and computational linguistics. In recent years, many studies concentrated on numerous blogs, tweets, forums and consumer review websites to identify sentiment of the communities. The information retrieved from social networking site will be in short informal text because of limited characters in blogging site or consumer review websites. Sentiment analysis in short-text is a challenging task, due to limitation of characters, user tends to shorten his/her conversation, which leads to misspellings, slang terms and shortened forms of words. Moreover, short-texts consists of more number of presence and absence of term/feature compared to regular text. In this work, our major goal is to classify sentiments into positive, negative or neutral polarity using new similarity measure. The proposed method embeds modified Similarity Measure for Text Processing (mSMTP) with K-Nearest Neighbor (KNN) classifier. The effectiveness of the proposed method is evaluated by comparing with Euclidean Distance, Cosine Similarity, Jaccard Coefficient and Correlation Coefficient. The proposed method is also compared with other classifiers like Support Vector Machine and Random Forest using benchmark dataset. The classification results are evaluated based on Accuracy, Precision, Recall and F-measure.
引用
收藏
页码:145 / 150
页数:6
相关论文
共 50 条
  • [31] Leveraging Conceptualization for Short-Text Embedding
    Huang, Heyan
    Wang, Yashen
    Feng, Chong
    Liu, Zhirun
    Zhou, Qiang
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2018, 30 (07) : 1282 - 1295
  • [32] Sequential Short-Text Classification from Multiple Textual Representations with Weak Supervision
    Reis Filho, Ivan J.
    Martins, Luiz H. D.
    Parmezan, Antonio R. S.
    Marcacini, Ricardo M.
    Rezende, Solange O.
    [J]. INTELLIGENT SYSTEMS, PT I, 2022, 13653 : 165 - 179
  • [33] Character-Level Attention Convolutional Neural Networks for Short-Text Classification
    Yin, Feiyang
    Yao, Zhilin
    Liu, Jia
    [J]. HUMAN CENTERED COMPUTING, 2019, 11956 : 560 - 567
  • [34] Few-shot short-text classification with language representations and centroid similarity
    Wenfu Liu
    Jianmin Pang
    Nan Li
    Feng Yue
    Guangming Liu
    [J]. Applied Intelligence, 2023, 53 : 8061 - 8072
  • [35] Uyghur short-text classification based on reliable sub-word morphology
    Parhat, Sardar
    Ablimit, Mijit
    Hamdulla, Askar
    [J]. International Journal of Reasoning-based Intelligent Systems, 2019, 11 (03): : 250 - 255
  • [36] Advertising Keywords Recommendation for Short-Text Web Pages Using Wikipedia
    Zhang, Weinan
    Wang, Dingquan
    Xue, Gui-Rong
    Zha, Hongyuan
    [J]. ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY, 2012, 3 (02)
  • [37] Efficient Long-Text Understanding with Short-Text Models
    Ivgi, Maor
    Shaham, Uri
    Berant, Jonathan
    [J]. TRANSACTIONS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, 2023, 11 : 284 - 299
  • [38] Using Topic Modeling Methods for Short-Text Data: A Comparative Analysis
    Albalawi, Rania
    Yeap, Tet Hin
    Benyoucef, Morad
    [J]. FRONTIERS IN ARTIFICIAL INTELLIGENCE, 2020, 3
  • [39] The Research of Chinese Short-text Classification Based on Domain Keyword Set Extension and HowNet
    Li, Xiangdong
    Gao, Fan
    Ding, Cong
    [J]. PROCEEDINGS OF THE 2016 INTERNATIONAL CONFERENCE ON INTELLIGENT CONTROL AND COMPUTER APPLICATION, 2016, 30 : 244 - 247
  • [40] Proximity estimation and hardness of short-text corpora
    Luis Errecalde, Marcelo
    Ingaramo, Diego
    Rosso, Paolo
    [J]. DEXA 2008: 19TH INTERNATIONAL CONFERENCE ON DATABASE AND EXPERT SYSTEMS APPLICATIONS, PROCEEDINGS, 2008, : 15 - +