Harnessing Twitter 'Big Data' for Automatic Emotion Identification

被引:162
|
作者
Wang, Wenbo [1 ]
Chen, Lu [1 ]
Thirunarayan, Krishnaprasad [1 ]
Sheth, Amit P. [1 ]
机构
[1] Wright State Univ, Kno E Sis Ctr, Dayton, OH 45435 USA
基金
美国国家科学基金会;
关键词
D O I
10.1109/SocialCom-PASSAT.2012.119
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
User generated content on Twitter (produced at an enormous rate of 340 million tweets per day) provides a rich source for gleaning people's emotions, which is necessary for deeper understanding of people's behaviors and actions. Extant studies on emotion identification lack comprehensive coverage of "emotional situations" because they use relatively small training datasets. To overcome this bottleneck, we have automatically created a large emotion-labeled dataset (of about 2.5 million tweets) by harnessing emotion-related hashtags available in the tweets. We have applied two different machine learning algorithms for emotion identification, to study the effectiveness of various feature combinations as well as the effect of the size of the training data on the emotion identification task. Our experiments demonstrate that a combination of unigrams, bigrams, sentiment/emotion-bearing words, and parts-of-speech information is most effective for gleaning emotions. The highest accuracy (65.57%) is achieved with a training data containing about 2 million tweets.
引用
收藏
页码:587 / 592
页数:6
相关论文
共 50 条
  • [1] Harnessing Twitter for Automatic Sentiment Identification Using Machine Learning Techniques
    Dash, Amiya Kumar
    Rout, Jitendra Kumar
    Jena, Sanjay Kumar
    [J]. PROCEEDINGS OF 3RD INTERNATIONAL CONFERENCE ON ADVANCED COMPUTING, NETWORKING AND INFORMATICS, ICACNI 2015, VOL 2, 2016, 44 : 507 - 514
  • [2] Automatic emotion detection in text streams by analyzing Twitter data
    Hasan, Maryam
    Rundensteiner, Elke
    Agu, Emmanuel
    [J]. INTERNATIONAL JOURNAL OF DATA SCIENCE AND ANALYTICS, 2019, 7 (01) : 35 - 51
  • [3] Automatic emotion detection in text streams by analyzing Twitter data
    Maryam Hasan
    Elke Rundensteiner
    Emmanuel Agu
    [J]. International Journal of Data Science and Analytics, 2019, 7 : 35 - 51
  • [4] Harnessing big data
    [J]. 1600, PennWell Corporation (57):
  • [5] Harnessing big data
    Samuelson, Nord
    Pocek, Christopher
    Lanman, Chris
    [J]. SOLID STATE TECHNOLOGY, 2014, 57 (05) : 43 - 44
  • [6] Harnessing big data for health
    Patrick, Kirsten
    [J]. CANADIAN MEDICAL ASSOCIATION JOURNAL, 2016, 188 (08) : 555 - 555
  • [7] Harnessing the Heart of Big Data
    Scruggs, Sarah B.
    Watson, Karol
    Su, Andrew I.
    Hermjakob, Henning
    Yates, John R., III
    Lindsey, Merry L.
    Ping, Peipei
    [J]. CIRCULATION RESEARCH, 2015, 116 (07) : 1115 - 1119
  • [8] Harnessing medical twitter data for pathology AI
    Ming Y. Lu
    Bowen Chen
    Faisal Mahmood
    [J]. Nature Medicine, 2023, 29 : 2181 - 2182
  • [9] Harnessing medical twitter data for pathology AI
    Lu, Ming Y.
    Chen, Bowen
    Mahmood, Faisal
    [J]. NATURE MEDICINE, 2023, 29 (9) : 2181 - 2182
  • [10] Harnessing big data for social justice: An exploration of violence against women-related conversations on Twitter
    Xue, Jia
    Macropol, Kathy
    Jia, Yanxia
    Zhu, Tingshao
    Gelles, Richard J.
    [J]. HUMAN BEHAVIOR AND EMERGING TECHNOLOGIES, 2019, 1 (03) : 269 - 279