Bridging social media via distant supervision

被引:2
|
作者
Magdy, Walid [1 ]
Sajjad, Hassan [1 ]
El-Ganainy, Tarek [1 ]
Sebastiani, Fabrizio [1 ]
机构
[1] Hamad Bin Khalifa Univ, Qatar Comp Res Inst, Doha, Qatar
关键词
Twitter; YouTube; Tweet classification; Distant supervision;
D O I
10.1007/s13278-015-0275-z
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Microblog classification has received a lot of attention in recent years. Different classification tasks have been investigated, most of them focusing on classifying microblogs into a small number of classes (five or less) using a training set of manually annotated tweets. Unfortunately, labelling data is tedious and expensive, and finding tweets that cover all the classes of interest is not always straightforward, especially when some of the classes do not frequently arise in practice. In this paper, we study an approach to tweet classification based on distant supervision, whereby we automatically transfer labels from one social medium to another for a single-label multi-class classification task. In particular, we apply YouTube video classes to tweets linking to these videos. This provides for free a virtually unlimited number of labelled instances that can be used as training data. The classification experiments we have run show that training a tweet classifier via these automatically labelled data achieves substantially better performance than training the same classifier with a limited amount of manually labelled data; this is advantageous, given that the automatically labelled data come at no cost. Further investigation of our approach shows its robustness when applied with different numbers of classes and across different languages.
引用
收藏
页码:1 / 12
页数:12
相关论文
共 50 条
  • [31] Website replica detection with distant supervision
    Carvalho, Cristiano
    de Moura, Edleno Silva
    Veloso, Adriano
    Ziviani, Nivio
    INFORMATION RETRIEVAL JOURNAL, 2018, 21 (04): : 253 - 272
  • [32] Removing Noisy Mentions for Distant Supervision
    Intxaurrondo, Ander
    Surdeanu, Mihai
    Lopez de lacalle, Oier
    Agirre, Eneko
    PROCESAMIENTO DEL LENGUAJE NATURAL, 2013, (51): : 41 - 48
  • [33] Collective Supervision of Topic Models for Predicting Surveys with Social Media
    Benton, Adrian
    Paul, Michael J.
    Hancock, Braden
    Dredze, Mark
    THIRTIETH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2016, : 2892 - 2898
  • [34] Event Extraction Using Distant Supervision
    Reschke, Kevin
    Jankowiak, Martin
    Surdeanu, Mihai
    Manning, Christopher D.
    Jurafsky, Daniel
    LREC 2014 - NINTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2014, : 4527 - 4531
  • [35] Cyberbullying Via Social Media
    Whittaker, Elizabeth
    Kowalski, Robin M.
    JOURNAL OF SCHOOL VIOLENCE, 2015, 14 (01) : 11 - 29
  • [36] Ideenmanagement via Social Media
    Benjamin Morgenstern
    Matthias Nolden
    Wirtschaftsinformatik & Management, 2013, 5 (6) : 76 - 81
  • [37] Distant Supervision for Mental Health Management in Social Media: Suicide Risk Classification System Development Study (vol 23, e26119, 2021)
    Fu, Guanghui
    Song, Changwei
    Li, Jianqiang
    Ma, Yue
    Chen, Pan
    Wang, Ruiqian
    Yang, Bing Xiang
    Huang, Zhisheng
    JOURNAL OF MEDICAL INTERNET RESEARCH, 2021, 23 (09)
  • [38] Bridging Unsupervised and Supervised Depth from Focus via All-in-Focus Supervision
    Wang, Ning-Hsu
    Wang, Ren
    Liu, Yu-Lun
    Huang, Yu-Hao
    Chang, Yu-Lin
    Chen, Chia-Ping
    Jou, Kevin
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 12601 - 12611
  • [39] Bonding, bridging, and linking social capital and social media use: How hyperlocal social media platforms serve as a conduit to access and activate bridging and linking ties in a time of crisis
    Courtney Page-Tan
    Natural Hazards, 2021, 105 : 2219 - 2240
  • [40] Bonding, bridging, and linking social capital and social media use: How hyperlocal social media platforms serve as a conduit to access and activate bridging and linking ties in a time of crisis
    Page-Tan, Courtney
    NATURAL HAZARDS, 2021, 105 (02) : 2219 - 2240