Detecting political biases of named entities and hashtags on Twitter

被引:2
|
作者
Xiao, Zhiping [1 ]
Zhu, Jeffrey [1 ]
Wang, Yining [1 ]
Zhou, Pei [2 ]
Lam, Wen Hong [1 ]
Porter, Mason A. [3 ,4 ]
Sun, Yizhou [1 ]
机构
[1] Univ Calif Los Angeles, Dept Comp Sci, 580 Portola Pl, Los Angeles, CA 90095 USA
[2] Univ Southern Calif, Informat Sci Inst, Marina del Rey, Los Angeles, CA 90292 USA
[3] Univ Calif Los Angeles, Dept Math, 520 Portola Pl, Los Angeles, CA 90095 USA
[4] Santa Fe Inst, 1399 Hyde Pk Rd, Santa Fe, NM 87501 USA
基金
美国国家航空航天局; 美国国家科学基金会;
关键词
Political-polarity detection; Word embeddings; Multi-task learning; Adversarial training; Data sets; SENTIMENT ANALYSIS;
D O I
10.1140/epjds/s13688-023-00386-6
中图分类号
O1 [数学];
学科分类号
0701 ; 070101 ;
摘要
Ideological divisions in the United States have become increasingly prominent in daily communication. Accordingly, there has been much research on political polarization, including many recent efforts that take a computational perspective. By detecting political biases in a text document, one can attempt to discern and describe its polarity. Intuitively, the named entities (i.e., the nouns and the phrases that act as nouns) and hashtags in text often carry information about political views. For example, people who use the term "pro-choice" are likely to be liberal and people who use the term "pro-life" are likely to be conservative. In this paper, we seek to reveal political polarities in social-media text data and to quantify these polarities by explicitly assigning a polarity score to entities and hashtags. Although this idea is straightforward, it is difficult to perform such inference in a trustworthy quantitative way. Key challenges include the small number of known labels, the continuous spectrum of political views, and the preservation of both a polarity score and a polarity-neutral semantic meaning in an embedding vector of words. To attempt to overcome these challenges, we propose the Polarity-aware Embedding Multi-task learning (PEM) model. This model consists of (1) a self-supervised context-preservation task, (2) an attention-based tweet-level polarity-inference task, and (3) an adversarial learning task that promotes independence between an embedding's polarity component and its semantic component. Our experimental results demonstrate that our PEM model can successfully learn polarity-aware embeddings that perform well at tweet-level and account-level classification tasks. We examine a variety of applications-including a study of spatial and temporal distributions of polarities and a comparison between tweets from Twitter and posts from Parler-and we thereby demonstrate the effectiveness of our PEM model. We also discuss important limitations of our work and encourage caution when applying the PEM model to real-world scenarios.
引用
收藏
页数:26
相关论文
共 50 条
  • [1] Detecting political biases of named entities and hashtags on Twitter
    Zhiping Xiao
    Jeffrey Zhu
    Yining Wang
    Pei Zhou
    Wen Hong Lam
    Mason A. Porter
    Yizhou Sun
    [J]. EPJ Data Science, 12
  • [2] Gatekeeping Twitter: message diffusion in political hashtags
    Bastos, Marco Toledo
    Galdini Raimundo, Rafael Luis
    Travitzki, Rodrigo
    [J]. MEDIA CULTURE & SOCIETY, 2013, 35 (02) : 260 - 270
  • [3] Interpreting Reputation Through Frequent Named Entities in Twitter
    Bennacer, Nacera
    Bugiotti, Francesca
    Hewasinghage, Moditha
    Isaj, Suela
    Quercini, Gianluca
    [J]. WEB INFORMATION SYSTEMS ENGINEERING, WISE 2017, PT I, 2017, 10569 : 49 - 56
  • [4] Detecting Candidate Named Entities in Search Queries
    Alasiry, Areej
    Levene, Mark
    Poulovassilis, Alexandra
    [J]. SIGIR 2012: PROCEEDINGS OF THE 35TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2012, : 1049 - 1050
  • [5] Detecting OOV Named Entities in Conversational Speech
    Kumar, Rohit
    Prasad, Rohit
    Ananthakrishnan, Sankaranarayanan
    Vembu, Aravind Namandi
    Stallard, Dave
    Tsakalidis, Stavros
    Natarajan, Prem
    [J]. 13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3, 2012, : 2351 - 2354
  • [6] Detecting Named Entities and Relations in German Clinical Reports
    Roller, Roland
    Rethmeier, Nils
    Thomas, Philippe
    Huebner, Marc
    Uszkoreit, Hans
    Staeck, Oliver
    Budde, Klemens
    Halleck, Fabian
    Schmidt, Danilo
    [J]. LANGUAGE TECHNOLOGIES FOR THE CHALLENGES OF THE DIGITAL AGE, GSCL 2017, 2018, 10713 : 146 - 154
  • [7] A Frequent Named Entities-Based Approach for Interpreting Reputation in Twitter
    Seghouani, Nacera Bennacer
    Bugiotti, Francesca
    Hewasinghage, Moditha
    Isaj, Suela
    Quercini, Gianluca
    [J]. DATA SCIENCE AND ENGINEERING, 2018, 3 (02) : 86 - 100
  • [8] Discovering relations among named entities by detecting community structure
    He, Tingting
    Zhao, Junzhe
    Li, Jing
    [J]. PACLIC 20: PROCEEDINGS OF THE 20TH PACIFIC ASIA CONFERENCE ON LANGUAGE, INFORMATION AND COMPUTATION, 2006, : 42 - 48
  • [9] Detecting Biomedical Named Entities in COVID-19 Texts
    Raza, Shaina
    Schwartz, Brian
    [J]. WORKSHOP ON HEALTHCARE AI AND COVID-19, VOL 184, 2022, 184 : 117 - 126
  • [10] Detecting Bots on Russian Political Twitter
    Stukal, Denis
    Sanovich, Sergey
    Bonneau, Richard
    Tucker, Joshua A.
    [J]. BIG DATA, 2017, 5 (04) : 310 - 324