Harnessing Twitter 'Big Data' for Automatic Emotion Identification

被引:162
|
作者
Wang, Wenbo [1 ]
Chen, Lu [1 ]
Thirunarayan, Krishnaprasad [1 ]
Sheth, Amit P. [1 ]
机构
[1] Wright State Univ, Kno E Sis Ctr, Dayton, OH 45435 USA
基金
美国国家科学基金会;
关键词
D O I
10.1109/SocialCom-PASSAT.2012.119
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
User generated content on Twitter (produced at an enormous rate of 340 million tweets per day) provides a rich source for gleaning people's emotions, which is necessary for deeper understanding of people's behaviors and actions. Extant studies on emotion identification lack comprehensive coverage of "emotional situations" because they use relatively small training datasets. To overcome this bottleneck, we have automatically created a large emotion-labeled dataset (of about 2.5 million tweets) by harnessing emotion-related hashtags available in the tweets. We have applied two different machine learning algorithms for emotion identification, to study the effectiveness of various feature combinations as well as the effect of the size of the training data on the emotion identification task. Our experiments demonstrate that a combination of unigrams, bigrams, sentiment/emotion-bearing words, and parts-of-speech information is most effective for gleaning emotions. The highest accuracy (65.57%) is achieved with a training data containing about 2 million tweets.
引用
收藏
页码:587 / 592
页数:6
相关论文
共 50 条
  • [21] Harnessing big data for identifying atrial fibrillation
    Altman, Robert K.
    Steinberg, Jonathan S.
    [J]. EUROPACE, 2019, 21 (09): : 1283 - 1283
  • [22] Harnessing the Potential of Big Data in Romanian Healthcare
    Ianculescu, Marilena
    Alexandru, Adriana
    Gheorghe-Moisii, Maria
    [J]. 2017 5TH INTERNATIONAL SYMPOSIUM ON ELECTRICAL AND ELECTRONICS ENGINEERING (ISEEE), 2017,
  • [23] Harnessing Big Data to Help Stop Diabetes
    不详
    [J]. AMERICAN JOURNAL OF MANAGED CARE, 2015, 21 : 1 - +
  • [24] Harnessing Big Data: Strategic Insights for IT Management
    Siddiqui, Asfar H.
    Swetha, V. P.
    Chowdhary, Harish
    Krishna, R. V. V.
    Muniyandy, Elangovan
    Maguluri, Lakshmana Phaneendra
    [J]. INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2024, 15 (07) : 912 - 921
  • [25] Harnessing the Power of Big Data in Biological Research
    McCulloch, Eve S.
    [J]. BIOSCIENCE, 2013, 63 (09) : 715 - 716
  • [26] Automatic identification of Irony: a Case Study on Twitter
    Tavares Alves, Yulli Dias
    Sanches, Ana Luiza
    Dalip, Daniel H.
    Silva, Ismael S.
    [J]. WEBMEDIA 2019: PROCEEDINGS OF THE 25TH BRAZILLIAN SYMPOSIUM ON MULTIMEDIA AND THE WEB, 2019, : 253 - 256
  • [27] Harnessing big data for a multifunctional theory of the firm
    Roth, Steffen
    Schwede, Peter
    Valentinov, Vladislav
    Perez-Valls, Miguel
    Kaivo-oja, Jari
    [J]. EUROPEAN MANAGEMENT JOURNAL, 2020, 38 (01) : 54 - 61
  • [28] Automatic Identification and Classification of Misogynistic Language on Twitter
    Anzovino, Maria
    Fersini, Elisabetta
    Rosso, Paolo
    [J]. NATURAL LANGUAGE PROCESSING AND INFORMATION SYSTEMS (NLDB 2018), 2018, 10859 : 57 - 64
  • [29] Workshop 5 report: Harnessing big data
    Sanchez-Martinez, Gabriel E.
    Munizaga, Marcela
    [J]. RESEARCH IN TRANSPORTATION ECONOMICS, 2016, 59 : 236 - 241
  • [30] Analyzing Twitter Sentiments Through Big Data
    Kumar, Monu
    Bala, Anju
    [J]. PROCEEDINGS OF THE 10TH INDIACOM - 2016 3RD INTERNATIONAL CONFERENCE ON COMPUTING FOR SUSTAINABLE GLOBAL DEVELOPMENT, 2016, : 2628 - 2631