On Document Representations for Detection of Biased News Articles

被引:2
|
作者
Cruz, Andre Ferreira [1 ]
Rocha, Gil [1 ]
Cardoso, Henrique Lopes [1 ]
机构
[1] Univ Porto, Fac Engn, LIACC, Porto, Portugal
基金
欧洲研究理事会;
关键词
Natural Language Processing; Deep Learning; Document Representation; Bias Detection; Hyperpartisan News;
D O I
10.1145/3341105.3374025
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Detecting bias in text is an increasingly relevant topic, given the information overload problem. Automating this task is crucial for our needs of quality news consumption. With this in mind, we explore modern deep learning approaches, including contextualized word embeddings and attention mechanisms, to compare the effects of different document representation choices. We design token-wise, sentence-wise and hierarchical document representations. Focusing on hyperpartisan news detection, we show that hierarchical attention mechanisms are able to better capture information at different levels of granularity (including intra and inter-sentence), which seems to be relevant for this task. With an accuracy of 82.5%, our best performing system is based on an ensemble of hierarchical attention networks with ELMo embeddings, achieving state-of-theart performance on the SemEval-2019 Task4 dataset.
引用
收藏
页码:892 / 899
页数:8
相关论文
共 50 条
  • [21] Multi-Document Summarization using Sentence Fusion for Indonesian News Articles
    Christie, Felicia
    Khodra, Masayu Leylia
    2016 INTERNATIONAL CONFERENCE ON ADVANCED INFORMATICS - CONCEPTS, THEORY AND APPLICATION (ICAICTA), 2016,
  • [22] Correlation Based Multi-Document Summarization for Scientific Articles and News Group
    Jayabharathy, J.
    Kanmani, S.
    Sivaranjani, N.
    PROCEEDINGS OF THE 2012 INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING, COMMUNICATIONS AND INFORMATICS (ICACCI'12), 2012, : 1093 - 1099
  • [23] News articles similarity for automatic media bias detection in Polish news portals
    Baraniak, Katarzyna
    Sydow, Marcin
    PROCEEDINGS OF THE 2018 FEDERATED CONFERENCE ON COMPUTER SCIENCE AND INFORMATION SYSTEMS (FEDCSIS), 2018, : 21 - 24
  • [24] Automatic trend detection: Time-biased document clustering
    Behpour, Sahar
    Mohammadi, Mohammadmahdi
    Albert, Mark V.
    Alam, Zinat S.
    Wang, Lingling
    Xiao, Ting
    KNOWLEDGE-BASED SYSTEMS, 2021, 220
  • [25] Topic-Centric Unsupervised Multi-Document Summarization of Scientific and News Articles
    Alambo, Amanuel
    Lohstroh, Cori
    Madaus, Erik
    Padhee, Swati
    Foster, Brandy
    Banerjee, Tanvi
    Thirunarayan, Krishnaprasad
    Raymer, Michael
    2020 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2020, : 591 - 596
  • [26] Building document graphs for multiple news articles summarization: An event-based approach
    Xu, Wei
    Yuan, Chunfa
    Li, Wenjie
    Wu, Mingli
    Wong, Kam-Fai
    COMPUTER PROCESSING OF ORIENTAL LANGUAGES, PROCEEDINGS: BEYOND THE ORIENT: THE RESEARCH CHALLENGES AHEAD, 2006, 4285 : 181 - +
  • [27] Multi-document summarization of news articles using an event-based framework
    Ou, Shiyan
    Khoo, Christopher S. G.
    Goh, Dion H.
    ASLIB PROCEEDINGS, 2006, 58 (03): : 197 - 217
  • [28] A Method for Similarity Detection in Vector Space by Summarizing News Articles
    Torun, Hakan
    Inner, A. Burak
    2022 30TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE, SIU, 2022,
  • [29] Plagiarism detection in large sets of press agency news articles
    Kienreich, Wolfgang
    Granitzer, Michael
    Sabol, Vedran
    Klieber, Werner
    SEVENTEENTH INTERNATIONAL CONFERENCE ON DATABASE AND EXPERT SYSTEMS APPLICATIONS, PROCEEDINGS, 2006, : 181 - +
  • [30] Comprehending news articles: Updating the news
    Millis, KK
    Erdman, BJ
    POETICS, 1998, 25 (06) : 343 - 361