Word Embeddings for the Analysis of Ideological Placement in Parliamentary Corpora

被引:74
|
作者
Rheault, Ludovic [1 ,2 ]
Cochrane, Christopher [1 ]
机构
[1] Univ Toronto, Dept Polit Sci, Toronto, ON, Canada
[2] Univ Toronto, Munk Sch Global Affairs & Publ Policy, Toronto, ON, Canada
关键词
word embeddings; parliamentary corpora; text as data; political ideology; natural language processing; POSITIONS; LANGUAGE; PARTIES; TEXT;
D O I
10.1017/pan.2019.26
中图分类号
D0 [政治学、政治理论];
学科分类号
0302 ; 030201 ;
摘要
Word embeddings, the coefficients from neural network models predicting the use of words in context, have now become inescapable in applications involving natural language processing. Despite a few studies in political science, the potential of this methodology for the analysis of political texts has yet to be fully uncovered. This paper introduces models of word embeddings augmented with political metadata and trained on large-scale parliamentary corpora from Britain, Canada, and the United States. We fit these models with indicator variables of the party affiliation of members of parliament, which we refer to as party embeddings. We illustrate how these embeddings can be used to produce scaling estimates of ideological placement and other quantities of interest for political research. To validate the methodology, we assess our results against indicators from the Comparative Manifestos Project, surveys of experts, and measures based on roll-call votes. Our findings suggest that party embeddings are successful at capturing latent concepts such as ideology, and the approach provides researchers with an integrated framework for studying political language.
引用
收藏
页码:112 / 133
页数:22
相关论文
共 50 条
  • [41] Bias in Word Embeddings
    Papakyriakopoulos, Orestis
    Hegelich, Simon
    Serrano, Juan Carlos Medina
    Marco, Fabienne
    FAT* '20: PROCEEDINGS OF THE 2020 CONFERENCE ON FAIRNESS, ACCOUNTABILITY, AND TRANSPARENCY, 2020, : 446 - 457
  • [42] Compressing Word Embeddings
    Andrews, Martin
    NEURAL INFORMATION PROCESSING, ICONIP 2016, PT IV, 2016, 9950 : 413 - 422
  • [43] Relational Word Embeddings
    Camacho-Collados, Jose
    Espinosa-Anke, Luis
    Schockaert, Steven
    57TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2019), 2019, : 3286 - 3296
  • [44] An Analysis of Common Word Digrams in Different Literary Romanian Corpora
    Hanu, Bogdan
    Vlad, Adriana
    Mitrea, Adrian
    Dragomir, Roxana
    2016 INTERNATIONAL CONFERENCE ON COMMUNICATIONS (COMM 2016), 2016, : 43 - 46
  • [45] Biomedical Word Sense Disambiguation with Word Embeddings
    Antunes, Rui
    Matos, Sergio
    11TH INTERNATIONAL CONFERENCE ON PRACTICAL APPLICATIONS OF COMPUTATIONAL BIOLOGY & BIOINFORMATICS, 2017, 616 : 273 - 279
  • [46] Lexical Density Analysis of Word Productions in Japanese English Using Acoustic Word Embeddings
    Ando, Shintaro
    Minematsu, Nobuaki
    Saito, Daisuke
    INTERSPEECH 2021, 2021, : 4433 - 4437
  • [47] Overcoming Poor Word Embeddings with Word Definitions
    Malon, Christopher
    10TH CONFERENCE ON LEXICAL AND COMPUTATIONAL SEMANTICS (SEM 2021), 2021, : 288 - 293
  • [48] Getting into bed with embeddings? A comparison of collocations and word embeddings for corpus-assisted discourse analysis
    Batchelor, Jordan
    APPLIED CORPUS LINGUISTICS, 2024, 4 (03):
  • [49] A Comparative Evaluation of Word Embeddings Techniques for Twitter Sentiment Analysis
    Kaibi, Ibrahim
    Nfaoui, El Habib
    Satori, Hassan
    2019 INTERNATIONAL CONFERENCE ON WIRELESS TECHNOLOGIES, EMBEDDED AND INTELLIGENT SYSTEMS (WITS), 2019,
  • [50] More than Bags of Words: Sentiment Analysis with Word Embeddings
    Rudkowsky, Elena
    Haselmayer, Martin
    Wastian, Matthias
    Jenny, Marcelo
    Emrich, Stefan
    Sedlmair, Michael
    COMMUNICATION METHODS AND MEASURES, 2018, 12 (2-3) : 140 - 157