Annotation of a Corpus of Tweets for Sentiment Analysis

被引:1
|
作者
dos Santos, Allisfrank [1 ]
Barros Junior, Jorge Daniel [1 ]
Camargo, Heloisa de Arruda [1 ]
机构
[1] Fed Univ Sao Carlos UFSCar, Dept Comp Sci, Rodovia Washington Luis,Km 235,310-SP, BR-13565905 Sao Carlos, Brazil
关键词
Annotation; Emotion; Tweets; Corpus;
D O I
10.1007/978-3-319-99722-3_30
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This article describes the process of creation and annotation of a tweets corpus for Sentiment Analysis at sentence level. The tweets were captured using the #masterchefbr hashtag, in a tool to acquire the public stream of tweets in real time and then annotated based on the six basic emotions (joy, surprise, fear, sadness, disgust, anger) commonly used in the literature. The neutral tag was adopted to annotate sentences where there was no expressed emotion. At the end of the process, the measure of disagreement between annotators reached a Kappa value of 0.42. Some experiments with the SVM algorithm (Support Vector Machine) have been performed with the objective of submitting the annotated corpus to a classification process, to better understand the Kappa value of the corpus. An accuracy of 52.9% has been obtained in the classification process when using both discordant and concordant text within the corpus.
引用
收藏
页码:294 / 302
页数:9
相关论文
共 50 条
  • [31] Sentiment Analysis of Live Tweets After Elections
    Baid, Palak
    Chaplot, Neelam
    [J]. EMERGING TRENDS IN EXPERT APPLICATIONS AND SECURITY, 2019, 841 : 307 - 314
  • [32] Movie sentiment analysis based on public tweets
    Blatnik, Aljaz
    Jarm, Kaja
    Meza, Marko
    [J]. ELEKTROTEHNISKI VESTNIK-ELECTROCHEMICAL REVIEW, 2014, 81 (04): : 160 - 166
  • [33] Detection of Fake Tweets Using Sentiment Analysis
    Monica C.
    Nagarathna N.
    [J]. SN Computer Science, 2020, 1 (2)
  • [34] Sentiment Analysis of English Tweets Using RapidMiner
    Tripathi, Pragya
    Vishwakarma, Santosh Kr
    Lala, Ajay
    [J]. 2015 INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND COMMUNICATION NETWORKS (CICN), 2015, : 668 - 672
  • [35] A Fine Grain Sentiment Analysis with Semantics in Tweets
    Barba Gonzalez, Cristobal
    Garcia-Nieto, Jose
    Navas-Delgado, Ismael
    Aldana-Montes, Jose F.
    [J]. INTERNATIONAL JOURNAL OF INTERACTIVE MULTIMEDIA AND ARTIFICIAL INTELLIGENCE, 2016, 3 (06): : 22 - 28
  • [36] Sentiment analysis for the tweets that contain the word "earthquake"
    Pirnau, Mironela
    [J]. PROCEEDINGS OF THE 2018 10TH INTERNATIONAL CONFERENCE ON ELECTRONICS, COMPUTERS AND ARTIFICIAL INTELLIGENCE (ECAI), 2018,
  • [37] Sentiment analysis of tweets on social security and medicare
    Chakravarty, Unmesh Kumar
    Arifuzzaman, Shaikh
    [J]. SOCIAL NETWORK ANALYSIS AND MINING, 2024, 14 (01)
  • [38] Sentiment Analysis of Tweets Including Emoji Data
    LeCompte, Travis
    Chen, Jianhua
    [J]. PROCEEDINGS 2017 INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE AND COMPUTATIONAL INTELLIGENCE (CSCI), 2017, : 793 - 798
  • [39] Piegas: A System for Sentiment Analysis of Tweets in Portuguese
    Grandin, P.
    Adan, J. M.
    [J]. IEEE LATIN AMERICA TRANSACTIONS, 2016, 14 (07) : 3467 - 3473