Bridging social media via distant supervision

被引:2
|
作者
Magdy, Walid [1 ]
Sajjad, Hassan [1 ]
El-Ganainy, Tarek [1 ]
Sebastiani, Fabrizio [1 ]
机构
[1] Hamad Bin Khalifa Univ, Qatar Comp Res Inst, Doha, Qatar
关键词
Twitter; YouTube; Tweet classification; Distant supervision;
D O I
10.1007/s13278-015-0275-z
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Microblog classification has received a lot of attention in recent years. Different classification tasks have been investigated, most of them focusing on classifying microblogs into a small number of classes (five or less) using a training set of manually annotated tweets. Unfortunately, labelling data is tedious and expensive, and finding tweets that cover all the classes of interest is not always straightforward, especially when some of the classes do not frequently arise in practice. In this paper, we study an approach to tweet classification based on distant supervision, whereby we automatically transfer labels from one social medium to another for a single-label multi-class classification task. In particular, we apply YouTube video classes to tweets linking to these videos. This provides for free a virtually unlimited number of labelled instances that can be used as training data. The classification experiments we have run show that training a tweet classifier via these automatically labelled data achieves substantially better performance than training the same classifier with a limited amount of manually labelled data; this is advantageous, given that the automatically labelled data come at no cost. Further investigation of our approach shows its robustness when applied with different numbers of classes and across different languages.
引用
收藏
页码:1 / 12
页数:12
相关论文
共 50 条
  • [41] Chemical-induced disease relation extraction via attention-based distant supervision
    Gu, Jinghang
    Sun, Fuqing
    Qian, Longhua
    Zhou, Guodong
    BMC BIOINFORMATICS, 2019, 20 (1)
  • [42] Distant Supervision for Relation Extraction via Piecewise Attention and Bag-Level Contextual Inference
    Phi, Van-Thuy
    Santoso, Joan
    Tran, Van-Hien
    Shindo, Hiroyuki
    Shimbo, Masashi
    Matsumoto, Yuji
    IEEE ACCESS, 2019, 7 : 103570 - 103582
  • [43] Chemical-induced disease relation extraction via attention-based distant supervision
    Jinghang Gu
    Fuqing Sun
    Longhua Qian
    Guodong Zhou
    BMC Bioinformatics, 20
  • [44] Joint Representation Learning of Cross-lingual Words and Entities via Attentive Distant Supervision
    Cao, Yixin
    Hou, Lei
    Li, Juanzi
    Liu, Zhiyuan
    Li, Chengjiang
    Chen, Xu
    Dong, Tiansi
    2018 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2018), 2018, : 227 - 237
  • [45] Distant supervision and knowledge transfer for domain-oriented text classification in online social networks
    Khodorchenko, Maria
    8TH INTERNATIONAL YOUNG SCIENTISTS CONFERENCE ON COMPUTATIONAL SCIENCE, YSC2019, 2019, 156 : 166 - 175
  • [46] Bridging the gap between social media and behavioral brand loyalty
    Yoshida, Masayuki
    Gordon, Brian S.
    Nakazawa, Makoto
    Shibuya, Shigeki
    Fujiwara, Naoyuki
    ELECTRONIC COMMERCE RESEARCH AND APPLICATIONS, 2018, 28 : 208 - 218
  • [47] Distant speech recognition:: Bridging the gaps
    McDonough, John
    Woelfel, Matthias
    2008 HANDS-FREE SPEECH COMMUNICATION AND MICROPHONE ARRAYS, 2008, : 109 - +
  • [48] Bridging the Social Media Usage Gap from Old to New: An Elderly Media Interpersonal and Social Research in Taiwan
    Lin, Shih-Hsun
    Chou, Wen Huei
    HUMAN CENTERED DESIGN (HCD), 2011, 6776 : 547 - 555
  • [49] Curriculum learning for distant supervision relation extraction
    Liu Qiongxin
    Wang Peng
    Wang Jiasheng
    Ma Jing
    JOURNAL OF WEB SEMANTICS, 2020, 61-62 (61-62):
  • [50] Distant Supervision for Relation Extraction with Type Constraint
    Ye, Yuxin
    Zhu, Zhaolong
    Ouyang, Dantong
    Cui, Xianji
    JOURNAL OF INTERNET TECHNOLOGY, 2014, 15 (07): : 1133 - 1142