INFLUENCING ELECTIONS WITH STATISTICS: TARGETING VOTERS WITH LOGISTIC REGRESSION TREES

被引:4
|
作者
Rusch, Thomas [1 ]
Lee, Ilro [3 ]
Hornik, Kurt [2 ]
Jank, Wolfgang [4 ]
Zeileis, Achim [5 ]
机构
[1] WU Vienna Univ Econ & Business, Ctr Empir Res Methods, A-1090 Vienna, Austria
[2] WU Vienna Univ Econ & Business, Inst Stat & Math, Dept Finance Accounting & Stat, A-1090 Vienna, Austria
[3] Univ New S Wales, Australian Sch Business, Sch Management, Sydney, NSW 2052, Australia
[4] Univ S Florida, Dept Informat Syst & Decis Sci, Coll Business, Tampa, FL 33620 USA
[5] Univ Innsbruck, Dept Stat, Fac Econ & Stat, A-6020 Innsbruck, Austria
来源
ANNALS OF APPLIED STATISTICS | 2013年 / 7卷 / 03期
关键词
Campaigning; classification tree; get-out-the-vote; model tree; political marketing; voter identification; voter segmentation; voter profile; microtargeting; PARTY MOBILIZATION; TURNOUT; FIELD; PARTISAN; PARTICIPATION; CAMPAIGNS; INFERENCE; MODEL;
D O I
10.1214/13-AOAS648
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
In political campaigning substantial resources are spent on voter mobilization, that is, on identifying and influencing as many people as possible to vote. Campaigns use statistical tools for deciding whom to target ("microtargeting"). In this paper we describe a nonpartisan campaign that aims at increasing overall turnout using the example of the 2004 US presidential election. Based on a real data set of 19,634 eligible voters from Ohio, we introduce a modern statistical framework well suited for carrying out the main tasks of voter targeting in a single sweep: predicting an individual's turnout (or support) likelihood for a particular cause, party or candidate as well as data-driven voter segmentation. Our framework, which we refer to as LORET (for LOgistic REgression Trees), contains standard methods such as logistic regression and classification trees as special cases and allows for a synthesis of both techniques. For our case study, we explore various LORET models with different regressors in the logistic model components and different partitioning variables in the tree components; we analyze them in terms of their predictive accuracy and compare the effect of using the full set of available variables against using only a limited amount of information. We find that augmenting a standard set of variables (such as age and voting history) with additional predictor variables (such as the household composition in terms of party affiliation) clearly improves predictive accuracy. We also find that LORET models based on tree induction beat the unpartitioned models. Furthermore, we illustrate how voter segmentation arises from our framework and discuss the resulting profiles from a targeting point of view.
引用
收藏
页码:1612 / 1639
页数:28
相关论文
共 50 条
  • [31] A comparison of regression trees, logistic regression, generalized additive models, and multivariate adaptive regression splines for predicting AMI mortality
    Austin, Peter C.
    STATISTICS IN MEDICINE, 2007, 26 (15) : 2937 - 2957
  • [32] Prediction of outcome in rtPA treated stroke patients: Comparison of logistic regression and classification and regression trees models
    Mutlu, Gurkan
    Mary, Jean-Yues
    Rosso, Charlotte
    Deltour, Sandrine
    Crozier, Sophie
    Leger, Anne
    Pires, Christine
    Samson, Yves
    NEUROLOGY, 2008, 70 (11) : A357 - A357
  • [33] Where do voters appear at ballots? Factors influencing the geographical distribution of voter turnout in Slovak parliamentary elections
    Kevicky, Dominik
    Danek, Petr
    GEOGRAFICKY CASOPIS-GEOGRAPHICAL JOURNAL, 2020, 72 (01): : 5 - 25
  • [34] Analysis 3: Who made the call? Classification based on logistic regression and trees
    Farruggia, J
    Macdonald, PDM
    Viveros-Aguilera, R
    CANADIAN JOURNAL OF STATISTICS-REVUE CANADIENNE DE STATISTIQUE, 2000, 28 (01): : 197 - 205
  • [35] Application of classification trees and logistic regression to determine factors responsible for lamb mortality
    Piwczynski, Dariusz
    Sitkowska, Beata
    Wisniewska, Ewa
    SMALL RUMINANT RESEARCH, 2012, 103 (2-3) : 225 - 231
  • [36] Comparison of four expert elicitation methods: For Bayesian logistic regression and classification trees
    O'Leary, R. A.
    Mengersen, K.
    Murray, J. V.
    Choy, S. Low
    18TH WORLD IMACS CONGRESS AND MODSIM09 INTERNATIONAL CONGRESS ON MODELLING AND SIMULATION: INTERFACING MODELLING AND SIMULATION WITH MATHEMATICAL AND COMPUTATIONAL SCIENCES, 2009, : 4276 - 4282
  • [37] PREDICTION OF CANNABIS AND COCAINE USE IN ADOLESCENCE USING DECISION TREES AND LOGISTIC REGRESSION
    Gervilla, Elena
    Palmer, Alfonso
    EUROPEAN JOURNAL OF PSYCHOLOGY APPLIED TO LEGAL CONTEXT, 2010, 2 (01): : 19 - 35
  • [38] A comparison of two tools for analyzing linguistic data: logistic regression and decision trees
    Eddington, David
    ITALIAN JOURNAL OF LINGUISTICS, 2010, 22 (02): : 265 - 286
  • [39] Phi-divergence statistics for testing linear hypotheses in logistic regression models
    Menendez, Maria Luisa
    Pardo, Julio Angel
    Pardo, Leandro
    COMMUNICATIONS IN STATISTICS-THEORY AND METHODS, 2008, 37 (04) : 494 - 507
  • [40] Advanced Statistics: Multiple Logistic Regression, Cox Proportional Hazards, and Propensity Scores
    Cioci, Alessia C.
    Cioci, Anthony L.
    Mantero, Alejandro M. A.
    Parreco, Joshua P.
    Yeh, D. Dante
    Rattan, Rishi
    SURGICAL INFECTIONS, 2021, 22 (06) : 604 - 610