Causality in statistics and data science education

被引:0
|
作者
Kevin Cummiskey
Karsten Lübke
机构
[1] United States Military Academy,Department of Mathematical Sciences
[2] FOM University of Applied Sciences,ifes Institute for Empirical Research & Statistics
关键词
Statistics education research; Data Science; Causality; Bias and Confounding; A22; C18; C55; C80; C90;
D O I
10.1007/s11943-022-00311-9
中图分类号
学科分类号
摘要
Statisticians and data scientists transform raw data into understanding and insight. Ideally, these insights empower people to act and make better decisions. However, data is often misleading especially when trying to draw conclusions about causality (for example, Simpson’s paradox). Therefore, developing causal thinking in undergraduate statistics and data science programs is important. However, there is very little guidance in the education literature about what topics and learning outcomes, specific to causality, are most important. In this paper, we propose a causality curriculum for undergraduate statistics and data science programs. Students should be able to think causally, which is defined as a broad pattern of thinking that enables individuals to appropriately assess claims of causality based upon statistical evidence. They should understand how the data generating process affects their conclusions and how to incorporate knowledge from subject matter experts in areas of application. Important topics in causality for the undergraduate curriculum include the potential outcomes framework and counterfactuals, measures of association versus causal effects, confounding, causal diagrams, and methods for estimating causal effects.
引用
收藏
页码:277 / 286
页数:9
相关论文
共 50 条