Supervised sentiment analysis in Czech social media

被引:53
|
作者
Habernal, Ivan [1 ,2 ]
Ptacek, Tomas [2 ]
Steinberger, Josef [1 ,2 ]
机构
[1] Univ W Bohemia, Fac Sci Appl, Dept Comp Sci & Engn, Plzen 830614, Czech Republic
[2] Univ W Bohemia, Fac Sci Appl, NTIS New Technol Informat Soc, Plzen 830614, Czech Republic
关键词
Sentiment analysis; Czech language; Social media; Machine learning; Feature selection; FEATURE-SELECTION; CLASSIFICATION;
D O I
10.1016/j.ipm.2014.05.001
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This article describes in-depth research on machine learning methods for sentiment analysis of Czech social media. Whereas in English, Chinese, or Spanish this field has a long history and evaluation datasets for various domains are widely available, in the case of the Czech language no systematic research has yet been conducted. We tackle this issue and establish a common ground for further research by providing a large human-annotated Czech social media corpus. Furthermore, we evaluate state-of-the-art supervised machine learning methods for sentiment analysis. We explore different pre-processing techniques and employ various features and classifiers. We also experiment with five different feature selection algorithms and investigate the influence of named entity recognition and preprocessing on sentiment classification performance. Moreover, in addition to our newly created social media dataset, we also report results for other popular domains, such as movie and product reviews. We believe that this article will not only extend the current sentiment analysis research to another family of languages, but will also encourage competition, potentially leading to the production of high-end commercial solutions. (C) 2014 Elsevier Ltd. All rights reserved.
引用
收藏
页码:693 / 707
页数:15
相关论文
共 50 条
  • [1] Sentiment Analysis for Social Media
    Iglesias, Carlos A.
    Moreno, Antonio
    [J]. APPLIED SCIENCES-BASEL, 2019, 9 (23):
  • [2] Sentiment Analysis on Social Media
    Neri, Federico
    Aliprandi, Carlo
    Capeci, Federico
    Cuadros, Montserrat
    By, Tomas
    [J]. 2012 IEEE/ACM INTERNATIONAL CONFERENCE ON ADVANCES IN SOCIAL NETWORKS ANALYSIS AND MINING (ASONAM), 2012, : 919 - 926
  • [3] Distantly Supervised Lifelong Learning for Large-Scale Social Media Sentiment Analysis
    Xia, Rui
    Jiang, Jie
    He, Huihui
    [J]. IEEE TRANSACTIONS ON AFFECTIVE COMPUTING, 2017, 8 (04) : 480 - 491
  • [4] Supervised sentiment analysis in Czech social media (Reprinted from Information Processing and Management, vol 50, pg 693-707, 2014)
    Habernal, Ivan
    Ptacek, Tomas
    Steinberger, Josef
    [J]. INFORMATION PROCESSING & MANAGEMENT, 2015, 51 (04) : 532 - 546
  • [5] SENTIMENT ANALYSIS OF THE INFOGRAPHICS ON SOCIAL MEDIA
    Bratic, Diana
    Palic, Mirko
    Miljkovic, Petar
    [J]. PROCEEDINGS OF FEB ZAGREB 12TH INTERNATIONAL ODYSSEY CONFERENCE ON ECONOMICS AND BUSINESS, 2021, 2021, 3 : 1072 - 1081
  • [6] A survey of sentiment analysis in social media
    Lin Yue
    Weitong Chen
    Xue Li
    Wanli Zuo
    Minghao Yin
    [J]. Knowledge and Information Systems, 2019, 60 : 617 - 663
  • [7] Multi-Tier Sentiment Analysis of Social Media Text Using Supervised Machine Learning
    Rahman, Hameedur
    Tariq, Junaid
    Masood, M. Ali
    Subahi, Ahmad F.
    Khalaf, Osamah Ibrahim
    Alotaibi, Youseef
    [J]. CMC-COMPUTERS MATERIALS & CONTINUA, 2023, 74 (03): : 5527 - 5543
  • [8] Sentiment Analysis of Events in Social Media
    Petrescu, Alexandru
    Truica, Ciprian-Octavian
    Apostol, Elena-Simona
    [J]. 2019 IEEE 15TH INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTER COMMUNICATION AND PROCESSING (ICCP 2019), 2019, : 143 - 149
  • [9] Sentiment Analysis for Social Media: A Survey
    Patil, Harshali P.
    Atique, Mohammad
    [J]. 2015 2ND INTERNATIONAL CONFERENCE ON INFORMATION SCIENCE AND SECURITY (ICISS), 2015, : 263 - 266
  • [10] A survey of sentiment analysis in social media
    Yue, Lin
    Chen, Weitong
    Li, Xue
    Zuo, Wanli
    Yin, Minghao
    [J]. KNOWLEDGE AND INFORMATION SYSTEMS, 2019, 60 (02) : 617 - 663