On Document Representations for Detection of Biased News Articles

被引:2
|
作者
Cruz, Andre Ferreira [1 ]
Rocha, Gil [1 ]
Cardoso, Henrique Lopes [1 ]
机构
[1] Univ Porto, Fac Engn, LIACC, Porto, Portugal
基金
欧洲研究理事会;
关键词
Natural Language Processing; Deep Learning; Document Representation; Bias Detection; Hyperpartisan News;
D O I
10.1145/3341105.3374025
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Detecting bias in text is an increasingly relevant topic, given the information overload problem. Automating this task is crucial for our needs of quality news consumption. With this in mind, we explore modern deep learning approaches, including contextualized word embeddings and attention mechanisms, to compare the effects of different document representation choices. We design token-wise, sentence-wise and hierarchical document representations. Focusing on hyperpartisan news detection, we show that hierarchical attention mechanisms are able to better capture information at different levels of granularity (including intra and inter-sentence), which seems to be relevant for this task. With an accuracy of 82.5%, our best performing system is based on an ensemble of hierarchical attention networks with ELMo embeddings, achieving state-of-theart performance on the SemEval-2019 Task4 dataset.
引用
收藏
页码:892 / 899
页数:8
相关论文
共 50 条
  • [1] Document Level Sentiment Analysis from News Articles
    Shirsat, Vishal S.
    Jagdale, Rajkumar S.
    Deshmukh, S. N.
    2017 INTERNATIONAL CONFERENCE ON COMPUTING, COMMUNICATION, CONTROL AND AUTOMATION (ICCUBEA), 2017,
  • [2] Using Document Embeddings for Background Linking of News Articles
    Khloponin, Pavel
    Kosseim, Leila
    NATURAL LANGUAGE PROCESSING AND INFORMATION SYSTEMS (NLDB 2021), 2021, 12801 : 317 - 329
  • [3] Single document keyword extraction for Internet news articles
    Bracewell, David B.
    Yan, Jiajun
    Ren, Fuji
    INTERNATIONAL JOURNAL OF INNOVATIVE COMPUTING INFORMATION AND CONTROL, 2008, 4 (04): : 905 - 913
  • [4] Annotating and Analyzing Biased Sentences in News Articles using Crowdsourcing
    Lim, Sora
    Jatowt, Adam
    Farber, Michael
    Yoshikawa, Masatoshi
    PROCEEDINGS OF THE 12TH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2020), 2020, : 1478 - 1484
  • [5] Discovering Biased News Articles Leveraging Multiple Human Annotations
    Lazaridou, Konstantina
    Loeser, Alexander
    Mestre, Maria
    Naumann, Felix
    PROCEEDINGS OF THE 12TH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2020), 2020, : 1268 - 1277
  • [6] Fake News Detection with Generated Comments for News Articles
    Yanagi, Yuta
    Orihara, Ryohei
    Sei, Yuichi
    Tahara, Yasuyuki
    Ohsuga, Akihiko
    2020 IEEE 24TH INTERNATIONAL CONFERENCE ON INTELLIGENT ENGINEERING SYSTEMS (INES 2020), 2020, : 85 - 89
  • [7] Topic Detection and Tracking in News Articles
    Patel, Sagar
    Suthar, Sanket
    Patel, Sandip
    Patel, Nehal
    Patel, Arpita
    INFORMATION AND COMMUNICATION TECHNOLOGY FOR INTELLIGENT SYSTEMS (ICTIS 2017) - VOL 2, 2018, 84 : 420 - 426
  • [8] Event Detection from News Articles
    Sayyadi, Hassan
    Sahraei, Alireza
    Abolhassani, Hassan
    ADVANCES IN COMPUTER SCIENCE AND ENGINEERING, 2008, 6 : 981 - 984
  • [9] Characterization and Early Detection of Evergreen News Articles
    Liao, Yiming
    Wang, Shuguang
    Han, Eui-Hong
    Lee, Jongwuk
    Lee, Dongwon
    MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, ECML PKDD 2019, PT III, 2020, 11908 : 552 - 568
  • [10] Towards an Ontology for Propaganda Detection in News Articles
    Hamilton, Kyle
    SEMANTIC WEB: ESWC 2021 SATELLITE EVENTS, 2021, 12739 : 230 - 241