Multilingual Sentiment Analysis using Emoticons and Keywords

被引:18
|
作者
Solakidis, Georgios S. [1 ]
Vavliakis, Konstantinos N.
Mitkas, Pericles A.
机构
[1] Aristotle Univ Thessaloniki, Dept Elect & Comp Engn, GR-54006 Thessaloniki, Greece
关键词
Sentiment Analysis; Greek; Forum; Semi Supervised Learning; Automatic Collection of Training Data; Emoticons; Keywords;
D O I
10.1109/WI-IAT.2014.86
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Nowadays the World Wide Web has evolved into a leading communication channel and information exchange medium. Especially after the introduction of the so-called web 2.0 and the explosion that followed regarding user generated content, the amount of data available over the internet has attracted the interest of both the scientific and business community. Their efforts focus on identifying the inner structures of data and the knowledge that can be derived by analyzing them. Web 2.0 is the subject of study and research in a number of areas. One of these areas is sentiment analysis, where the main goal is to study and draw conclusions about subjectivity, polarity and the feeling that is expressed in user generated content, which mainly consist of free text documents. The goal of this paper is to apply sentiment analysis on multilingual data, focusing on documents written in Greek. We developed an integrated framework that accepts user generated documents and then identifies the polarity of the text (neutral, negative or positive) and the sentiment expressed through it (joy, love, anger or sadness). We followed a semi-supervised approach which led to the development of two techniques for the automatic collection of training data without any human intervention. Our approach involves the detection and use of self-defining features that are available within the data. We take into account two emotionally rich features: a) emoticons and b) lists of emotionally intense keywords. These features are evaluated on data coming from a popular forum, using various classifiers and feature vectors. Our experimental results point to various conclusions about the effectiveness, advantages and limitations of applying such methods on Greek data. Using keywords we achieved 90% mean accuracy on identifying the subjectivity level and 93% on correctly identifying the polarity level, whereas using emoticons the mean accuracy for each of these levels was 74% and 77% respectively.
引用
收藏
页码:102 / 109
页数:8
相关论文
共 50 条
  • [1] Alternative method sentiment analysis using emojis and emoticons
    Surikov, Anatoliy
    Egorova, Evgeniia
    [J]. 9TH INTERNATIONAL YOUNG SCIENTISTS CONFERENCE IN COMPUTATIONAL SCIENCE, YSC2020, 2020, 178 : 182 - 193
  • [2] Building Corpus with Emoticons for Sentiment Analysis
    Li, Changliang
    Wang, Yongguan
    Li, Changsong
    Qi, Ji
    Liu, Pengyuan
    [J]. NATURAL LANGUAGE PROCESSING AND CHINESE COMPUTING, NLPCC 2018, PT II, 2018, 11109 : 309 - 318
  • [3] A Sentiment Analysis Method Based on Emoticons and Sentiment Words
    Gao, Baolin
    Zhou, Zhiguo
    Zou, Mingxue
    Deng, Chunyan
    [J]. PROCEEDINGS OF THE 6TH INTERNATIONAL CONFERENCE ON ELECTRONIC, MECHANICAL, INFORMATION AND MANAGEMENT SOCIETY (EMIM), 2016, 40 : 1380 - 1383
  • [4] Using SentiWordNet for multilingual sentiment analysis
    Denecke, Kerstin
    [J]. 2008 IEEE 24TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING WORKSHOP, VOLS 1 AND 2, 2008, : 427 - 432
  • [5] Analysis of Tweets with Emoticons for Sentiment Detection Using Classification Techniques
    Kaur, Ravneet
    Majumdar, Ayush
    Sharma, Priya
    Tiple, Bhavana
    [J]. DISTRIBUTED COMPUTING AND INTELLIGENT TECHNOLOGY, ICDCIT 2023, 2023, 13776 : 208 - 223
  • [6] Aspect Extraction Approach for Sentiment Analysis Using Keywords
    Ayub, Nafees
    Talib, Muhammad Ramzan
    Hanif, Muhammad Kashif
    Awais, Muhammad
    [J]. CMC-COMPUTERS MATERIALS & CONTINUA, 2023, 74 (03): : 6879 - 6892
  • [7] Role of Emoticons for Multidimensional Sentiment Analysis of Twitter
    Yamamoto, Yuki
    Kumamoto, Tadahiko
    Nadamoto, Akiyo
    [J]. 16TH INTERNATIONAL CONFERENCE ON INFORMATION INTEGRATION AND WEB-BASED APPLICATIONS & SERVICES (IIWAS 2014), 2014, : 107 - 115
  • [8] Sentiment Analysis on Tweets with Punctuations, Emoticons, and Negations
    Cureg, Miks Q.
    De La Cruz, Juan Aurel D.
    Solomon, Juan Carlos A.
    Saharkhiz, Aresh T.
    Balan, Ariel Kelly D.
    Samonte, Mary Jane C.
    [J]. PROCEEDINGS OF THE 2ND INTERNATIONAL CONFERENCE ON INFORMATION SCIENCE AND SYSTEMS (ICISS 2019), 2019, : 266 - 270
  • [9] SentiReview: Sentiment Analysis based on Text and Emoticons
    Yadav, Payal
    Pandya, Dhatri
    [J]. 2017 INTERNATIONAL CONFERENCE ON INNOVATIVE MECHANISMS FOR INDUSTRY APPLICATIONS (ICIMIA), 2017, : 467 - 472
  • [10] Microblog Sentiment Analysis Model Based on Emoticons
    Pei, Shaojie
    Zhang, Lumin
    Li, Aiping
    [J]. WEB TECHNOLOGIES AND APPLICATIONS, APWEB 2014, PT II, 2014, 8710 : 127 - 135