On Document Representations for Detection of Biased News Articles

被引:2
|
作者
Cruz, Andre Ferreira [1 ]
Rocha, Gil [1 ]
Cardoso, Henrique Lopes [1 ]
机构
[1] Univ Porto, Fac Engn, LIACC, Porto, Portugal
基金
欧洲研究理事会;
关键词
Natural Language Processing; Deep Learning; Document Representation; Bias Detection; Hyperpartisan News;
D O I
10.1145/3341105.3374025
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Detecting bias in text is an increasingly relevant topic, given the information overload problem. Automating this task is crucial for our needs of quality news consumption. With this in mind, we explore modern deep learning approaches, including contextualized word embeddings and attention mechanisms, to compare the effects of different document representation choices. We design token-wise, sentence-wise and hierarchical document representations. Focusing on hyperpartisan news detection, we show that hierarchical attention mechanisms are able to better capture information at different levels of granularity (including intra and inter-sentence), which seems to be relevant for this task. With an accuracy of 82.5%, our best performing system is based on an ensemble of hierarchical attention networks with ELMo embeddings, achieving state-of-theart performance on the SemEval-2019 Task4 dataset.
引用
收藏
页码:892 / 899
页数:8
相关论文
共 50 条
  • [11] Image Enhanced Event Detection in News Articles
    Tong, Meihan
    Wang, Shuai
    Cao, Yixin
    Xu, Bin
    Li, Juaizi
    Hou, Lei
    Chua, Tat-Seng
    THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 9040 - 9047
  • [12] Improving Multi-label Document Classification of Czech News Articles
    Lehecka, Jan
    Svec, Jan
    TEXT, SPEECH, AND DIALOGUE (TSD 2015), 2015, 9302 : 307 - 315
  • [13] Learning De-biased Representations with Biased Representations
    Bahng, Hyojin
    Chun, Sanghyuk
    Yun, Sangdoo
    Choo, Jaegul
    Oh, Seong Joon
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 119, 2020, 119
  • [14] Collectively biased representations of the past: Ingroup Bias in Wikipedia articles about intergroup conflicts
    Oeberst, Aileen
    von der Beck, Ina
    Matschke, Christina
    Ihme, Toni Alexander
    Cress, Ulrike
    BRITISH JOURNAL OF SOCIAL PSYCHOLOGY, 2020, 59 (04) : 791 - 818
  • [15] Text Augmentation Techniques for Document Vector Generation from Russian News Articles
    Aminoff, Christoffer
    Romanenko, Aleksei
    Kosomaa, Onni
    Vankka, Jouko
    INFORMATION AND SOFTWARE TECHNOLOGIES, ICIST 2018, 2018, 920 : 571 - 586
  • [16] NewsEmbed: Modeling News through Pre-trained Document Representations
    Liu, Jialu
    Liu, Tianqi
    Yu, Cong
    KDD '21: PROCEEDINGS OF THE 27TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING, 2021, : 1076 - 1086
  • [17] Duplication Detection in News Articles Based on Big Data
    Lu, Lu
    Wang, Pengcheng
    2019 IEEE 4TH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING AND BIG DATA ANALYSIS (ICCCBDA), 2019, : 15 - 19
  • [18] Online Near-Duplicate Detection of News Articles
    Rodier, Simon
    Carter, Dave
    PROCEEDINGS OF THE 12TH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2020), 2020, : 1242 - 1249
  • [19] Automatic Detection of News Articles of Interest to Regional Communities
    Swezey, Robin M. E.
    Sano, Hiroyuki
    Shiramatsu, Shun
    Ozono, Tadachika
    Shintani, Toramatsu
    INTERNATIONAL JOURNAL OF COMPUTER SCIENCE AND NETWORK SECURITY, 2012, 12 (06): : 99 - 106
  • [20] Detection of overlapping text in articles from Anaesthesia News
    Malik, M. J.
    Yentis, S. M.
    ANAESTHESIA, 2012, 67 : 80 - 80