The illusion of data validity: Why numbers about people are likely wrong

被引：0

作者：

Jansen B.J. ^{[1
]}

Salminen J. ^{[2
]}

Jung S.-G. ^{[1
]}

Almerekhi H. ^{[1
,3
]}

机构：

[1] Qatar Computing Research Institute, Hamad Bin Khalifa University, Doha

[2] School of Marketing and Communication, University of Vaasa, Vaasa

[3] College of Science and Engineering, Hamad Bin Khalifa University, Doha

来源：

Data and Information Management | 2022年 / 6卷 / 04期

关键词：

Measurement; People data; Quantitative paradigm; Statistics;

D O I：

10.1016/j.dim.2022.100020

中图分类号：

学科分类号：

摘要：

This reflection article addresses a difficulty faced by scholars and practitioners working with numbers about people, which is that those who study people want numerical data about these people. Unfortunately, time and time again, this numerical data about people is wrong. Addressing the potential causes of this wrongness, we present examples of analyzing people numbers, i.e., numbers derived from digital data by or about people, and discuss the comforting illusion of data validity. We first lay a foundation by highlighting potential inaccuracies in collecting people data, such as selection bias. Then, we discuss inaccuracies in analyzing people data, such as the flaw of averages, followed by a discussion of errors that are made when trying to make sense of people data through techniques such as posterior labeling. Finally, we discuss a root cause of people data often being wrong – the conceptual conundrum of thinking the numbers are counts when they are actually measures. Practical solutions to address this illusion of data validity are proposed. The implications for theories derived from people data are also highlighted, namely that these people theories are generally wrong as they are often derived from people numbers that are wrong. © 2022 Wuhan University

引用

共 50 条

[1] The Numbers Game: Why Everything You Know About Soccer Is Wrong
Childress, Boyd
LIBRARY JOURNAL, 2013, 138 (14) : 116 - 116
[2] THE NUMBERS GAME Why everything you know about football is wrong
Goldblatt, David
TLS-THE TIMES LITERARY SUPPLEMENT, 2013, (5762): : 30 - 30
[3] Baseball between the numbers: Why everything you know about the game is wrong.
Renaud, G
LIBRARY JOURNAL, 2006, 131 (06) : 102 - 102
[4] Baseball. Between the numbers: Why everything you know about the game is wrong.
Taylor, Ihsan
NEW YORK TIMES BOOK REVIEW, 2006, : 14 - 14
[5] Why NIH is wrong about ''TMD''
Dawson, PE
CRANIO-THE JOURNAL OF CRANIOMANDIBULAR & SLEEP PRACTICE, 1997, 15 (01): : 1 - 3
[6] Why data about people are so hard to govern
Wong, Wendy H.
Duncan, Jamie
Lake, David A.
REGULATION & GOVERNANCE, 2025, 19 (01) : 236 - 252
[7] WHY KUNDERA,MILAN IS WRONG ABOUT DOSTOYEVSKY
BRODSKY, J
NEW YORK TIMES BOOK REVIEW, 1985, (FEB): : 31 - &
[8] Why hacking is wrong about human kinds
Cooper, R
BRITISH JOURNAL FOR THE PHILOSOPHY OF SCIENCE, 2004, 55 (01): : 73 - 85
[9] Asking About Numbers: Why and How
Ansolabehere, Stephen
Meredith, Marc
Snowberg, Erik
POLITICAL ANALYSIS, 2013, 21 (01) : 48 - 69
[10] On tests of validity and social class: Why Prandy and Blackburn are wrong
Evans, G
SOCIOLOGY-THE JOURNAL OF THE BRITISH SOCIOLOGICAL ASSOCIATION, 1998, 32 (01): : 189 - 202

← 1 2 3 4 5 →