Word Embeddings for the Analysis of Ideological Placement in Parliamentary Corpora

被引：74

作者：

Rheault, Ludovic ^{[1
,2
]}

Cochrane, Christopher ^{[1
]}

机构：

[1] Univ Toronto, Dept Polit Sci, Toronto, ON, Canada

[2] Univ Toronto, Munk Sch Global Affairs & Publ Policy, Toronto, ON, Canada

来源：

POLITICAL ANALYSIS | 2020年 / 28卷 / 01期

关键词：

word embeddings; parliamentary corpora; text as data; political ideology; natural language processing; POSITIONS; LANGUAGE; PARTIES; TEXT;

D O I：

10.1017/pan.2019.26

中图分类号：

D0 [政治学、政治理论];

学科分类号：

0302 ; 030201 ;

摘要：

Word embeddings, the coefficients from neural network models predicting the use of words in context, have now become inescapable in applications involving natural language processing. Despite a few studies in political science, the potential of this methodology for the analysis of political texts has yet to be fully uncovered. This paper introduces models of word embeddings augmented with political metadata and trained on large-scale parliamentary corpora from Britain, Canada, and the United States. We fit these models with indicator variables of the party affiliation of members of parliament, which we refer to as party embeddings. We illustrate how these embeddings can be used to produce scaling estimates of ideological placement and other quantities of interest for political research. To validate the methodology, we assess our results against indicators from the Comparative Manifestos Project, surveys of experts, and measures based on roll-call votes. Our findings suggest that party embeddings are successful at capturing latent concepts such as ideology, and the approach provides researchers with an integrated framework for studying political language.

引用

页码：112 / 133

页数：22

共 50 条

[41] Bias in Word Embeddings
Papakyriakopoulos, Orestis
Hegelich, Simon
Serrano, Juan Carlos Medina
Marco, Fabienne
FAT* '20: PROCEEDINGS OF THE 2020 CONFERENCE ON FAIRNESS, ACCOUNTABILITY, AND TRANSPARENCY, 2020, : 446 - 457
[42] Compressing Word Embeddings
Andrews, Martin
NEURAL INFORMATION PROCESSING, ICONIP 2016, PT IV, 2016, 9950 : 413 - 422
[43] Relational Word Embeddings
Camacho-Collados, Jose
Espinosa-Anke, Luis
Schockaert, Steven
57TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2019), 2019, : 3286 - 3296
[44] An Analysis of Common Word Digrams in Different Literary Romanian Corpora
Hanu, Bogdan
Vlad, Adriana
Mitrea, Adrian
Dragomir, Roxana
2016 INTERNATIONAL CONFERENCE ON COMMUNICATIONS (COMM 2016), 2016, : 43 - 46
[45] Biomedical Word Sense Disambiguation with Word Embeddings
Antunes, Rui
Matos, Sergio
11TH INTERNATIONAL CONFERENCE ON PRACTICAL APPLICATIONS OF COMPUTATIONAL BIOLOGY & BIOINFORMATICS, 2017, 616 : 273 - 279
[46] Lexical Density Analysis of Word Productions in Japanese English Using Acoustic Word Embeddings
Ando, Shintaro
Minematsu, Nobuaki
Saito, Daisuke
INTERSPEECH 2021, 2021, : 4433 - 4437
[47] Overcoming Poor Word Embeddings with Word Definitions
Malon, Christopher
10TH CONFERENCE ON LEXICAL AND COMPUTATIONAL SEMANTICS (SEM 2021), 2021, : 288 - 293
[48] Getting into bed with embeddings? A comparison of collocations and word embeddings for corpus-assisted discourse analysis
Batchelor, Jordan
APPLIED CORPUS LINGUISTICS, 2024, 4 (03):
[49] A Comparative Evaluation of Word Embeddings Techniques for Twitter Sentiment Analysis
Kaibi, Ibrahim
Nfaoui, El Habib
Satori, Hassan
2019 INTERNATIONAL CONFERENCE ON WIRELESS TECHNOLOGIES, EMBEDDED AND INTELLIGENT SYSTEMS (WITS), 2019,
[50] More than Bags of Words: Sentiment Analysis with Word Embeddings
Rudkowsky, Elena
Haselmayer, Martin
Wastian, Matthias
Jenny, Marcelo
Emrich, Stefan
Sedlmair, Michael
COMMUNICATION METHODS AND MEASURES, 2018, 12 (2-3) : 140 - 157

← 1 2 3 4 5 →