Annotation of a Corpus of Tweets for Sentiment Analysis

被引:1
|
作者
dos Santos, Allisfrank [1 ]
Barros Junior, Jorge Daniel [1 ]
Camargo, Heloisa de Arruda [1 ]
机构
[1] Fed Univ Sao Carlos UFSCar, Dept Comp Sci, Rodovia Washington Luis,Km 235,310-SP, BR-13565905 Sao Carlos, Brazil
关键词
Annotation; Emotion; Tweets; Corpus;
D O I
10.1007/978-3-319-99722-3_30
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This article describes the process of creation and annotation of a tweets corpus for Sentiment Analysis at sentence level. The tweets were captured using the #masterchefbr hashtag, in a tool to acquire the public stream of tweets in real time and then annotated based on the six basic emotions (joy, surprise, fear, sadness, disgust, anger) commonly used in the literature. The neutral tag was adopted to annotate sentences where there was no expressed emotion. At the end of the process, the measure of disagreement between annotators reached a Kappa value of 0.42. Some experiments with the SVM algorithm (Support Vector Machine) have been performed with the objective of submitting the annotated corpus to a classification process, to better understand the Kappa value of the corpus. An accuracy of 52.9% has been obtained in the classification process when using both discordant and concordant text within the corpus.
引用
收藏
页码:294 / 302
页数:9
相关论文
共 50 条
  • [21] Sentiment Analysis of Tweets on Soda Taxes
    An, Ruopeng
    Yang, Yuyi
    Batcheller, Quinlan
    Zhou, Qianzi
    [J]. JOURNAL OF PUBLIC HEALTH MANAGEMENT AND PRACTICE, 2023, 29 (05): : 633 - 639
  • [22] Clustering Arabic Tweets for Sentiment Analysis
    Abuaiadah, Diab
    Rajendran, Dileep
    Jarrar, Mustafa
    [J]. 2017 IEEE/ACS 14TH INTERNATIONAL CONFERENCE ON COMPUTER SYSTEMS AND APPLICATIONS (AICCSA), 2017, : 449 - 456
  • [23] FinnSentiment: a Finnish social media corpus for sentiment polarity annotation
    Linden, Krister
    Jauhiainen, Tommi
    Hardwick, Sam
    [J]. LANGUAGE RESOURCES AND EVALUATION, 2023, 57 (02) : 581 - 609
  • [24] FinnSentiment: a Finnish social media corpus for sentiment polarity annotation
    Krister Lindén
    Tommi Jauhiainen
    Sam Hardwick
    [J]. Language Resources and Evaluation, 2023, 57 : 581 - 609
  • [25] Proposal of a Sentiment Analysis Model in Tweets for Improvement of the Teaching - Learning Process in the Classroom Using a Corpus of Subjectivity
    Gutierrez Esparza, Guadalupe
    Padilla Diaz, Alejandro
    Canul-Reich, Juana
    Alejandro De-Luna, Carlos
    Ponce, Julio
    [J]. INTERNATIONAL JOURNAL OF COMBINATORIAL OPTIMIZATION PROBLEMS AND INFORMATICS, 2016, 7 (02): : 22 - 34
  • [26] Sentiment Analysis in Spanish Tweets: Some Experiments with Focus on Neutral Tweets
    Chiruzzo, Luis
    Etcheverry, Mathias
    Rosa, Aiala
    [J]. PROCESAMIENTO DEL LENGUAJE NATURAL, 2020, (64): : 109 - 116
  • [27] A Scalable Approach for Sentiment Analysis of Turkish Tweets and Linking Tweets To News
    Kulcu, Sercan
    Dogdu, Erdogan
    [J]. 2016 IEEE TENTH INTERNATIONAL CONFERENCE ON SEMANTIC COMPUTING (ICSC), 2016, : 470 - 475
  • [28] A Microservices Based Architecture for the Sentiment Analysis of Tweets
    Di Martino, Beniamino
    Bombace, Vincenzo
    D'Angelo, Salvatore
    Esposito, Antonio
    [J]. ADVANCED INFORMATION NETWORKING AND APPLICATIONS, AINA-2022, VOL 3, 2022, 451 : 121 - 130
  • [29] Sentiment Analysis on Tweets for a Disease and Treatment Combination
    Meena, R.
    Bai, V. Thulasi
    Omana, J.
    [J]. COMPUTATIONAL VISION AND BIO-INSPIRED COMPUTING, 2020, 1108 : 1283 - 1293
  • [30] Arabic tweets sentiment analysis - a hybrid scheme
    Aldayel, Haifa K.
    Azmi, Aqil M.
    [J]. JOURNAL OF INFORMATION SCIENCE, 2016, 42 (06) : 782 - 797