Annotation of a Corpus of Tweets for Sentiment Analysis

被引:1
|
作者
dos Santos, Allisfrank [1 ]
Barros Junior, Jorge Daniel [1 ]
Camargo, Heloisa de Arruda [1 ]
机构
[1] Fed Univ Sao Carlos UFSCar, Dept Comp Sci, Rodovia Washington Luis,Km 235,310-SP, BR-13565905 Sao Carlos, Brazil
关键词
Annotation; Emotion; Tweets; Corpus;
D O I
10.1007/978-3-319-99722-3_30
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This article describes the process of creation and annotation of a tweets corpus for Sentiment Analysis at sentence level. The tweets were captured using the #masterchefbr hashtag, in a tool to acquire the public stream of tweets in real time and then annotated based on the six basic emotions (joy, surprise, fear, sadness, disgust, anger) commonly used in the literature. The neutral tag was adopted to annotate sentences where there was no expressed emotion. At the end of the process, the measure of disagreement between annotators reached a Kappa value of 0.42. Some experiments with the SVM algorithm (Support Vector Machine) have been performed with the objective of submitting the annotated corpus to a classification process, to better understand the Kappa value of the corpus. An accuracy of 52.9% has been obtained in the classification process when using both discordant and concordant text within the corpus.
引用
收藏
页码:294 / 302
页数:9
相关论文
共 50 条
  • [41] Sentiment Analysis of Arabic Jordanian Dialect Tweets
    Atoum, Jalal Omer
    Nouman, Mais
    [J]. INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2019, 10 (02) : 256 - 262
  • [42] Sentiment Analysis on Tweets with Punctuations, Emoticons, and Negations
    Cureg, Miks Q.
    De La Cruz, Juan Aurel D.
    Solomon, Juan Carlos A.
    Saharkhiz, Aresh T.
    Balan, Ariel Kelly D.
    Samonte, Mary Jane C.
    [J]. PROCEEDINGS OF THE 2ND INTERNATIONAL CONFERENCE ON INFORMATION SCIENCE AND SYSTEMS (ICISS 2019), 2019, : 266 - 270
  • [43] A Practical Approach to Sentiment Analysis of Hindi Tweets
    Sharma, Yakshi
    Mangat, Veenu
    Kaur, Mandeep
    [J]. 2015 1ST INTERNATIONAL CONFERENCE ON NEXT GENERATION COMPUTING TECHNOLOGIES (NGCT), 2015, : 677 - 680
  • [44] Automated Pipeline for Sentiment Analysis of Political Tweets
    Das, Atrik
    Gunturi, Kushal Sai
    Chandrasckhar, Aditya
    Padhi, Abhinandan
    Liu, Qian
    [J]. 21ST IEEE INTERNATIONAL CONFERENCE ON DATA MINING WORKSHOPS ICDMW 2021, 2021, : 128 - 135
  • [45] Sentiment Analysis of Tweets Using Deep Learning
    Ranganathan, Jaishree
    Tsahai, Tsega
    [J]. ADVANCED DATA MINING AND APPLICATIONS (ADMA 2022), PT I, 2022, 13725 : 106 - 117
  • [46] Triangulated Sentiment Analysis of Tweets for Social CRM
    Griesser, Simone E.
    Gupta, Neha
    [J]. 2019 6TH SWISS CONFERENCE ON DATA SCIENCE (SDS), 2019, : 75 - 79
  • [47] Analysis and Visualization of Sentiment and Emotion on Crisis Tweets
    Torkildson, Megan K.
    Starbird, Kate
    Aragon, Cecilia R.
    [J]. COOPERATIVE DESIGN, VISUALIZATION, AND ENGINEERING, CDVE 2014, 2014, 8683 : 64 - 67
  • [48] Uses of Machine Translation in the Sentiment Analysis of Tweets
    Peisenieks, Janis
    Skadins, Raivis
    [J]. HUMAN LANGUAGE TECHNOLOGIES - THE BALTIC PERSPECTIVE, BALTIC HLT 2014, 2014, 268 : 126 - 131
  • [49] Twitter sentiment analysis: Capturing sentiment from integrated resort tweets
    Philander, Kahlil
    Zhong, YunYing
    [J]. INTERNATIONAL JOURNAL OF HOSPITALITY MANAGEMENT, 2016, 55 : 16 - 24
  • [50] A semiautomatic annotation approach for sentiment analysis
    Alahmary, Rahma
    Al-Dossari, Hmood
    [J]. JOURNAL OF INFORMATION SCIENCE, 2023, 49 (02) : 398 - 410