Controlling for Selection Bias in Social Media Indicators through Official Statistics: a Proposal

被引:10
|
作者
Iacus, Stefano M. [1 ]
Porro, Giuseppe [2 ]
Salini, Silvia [1 ]
Siletti, Elena [1 ]
机构
[1] Univ Milan, Dept Econ Management & Quantitat Methods, Via Conservatorio 7, I-20122 Milan, Italy
[2] Univ Insubria, Dept Law Econ & Culture, Via St Abbondio 12, I-22100 Como, Italy
关键词
Well-being; big data; sentiment analysis; small area estimation; weighting; SMALL-AREA ESTIMATION; BIG DATA; TIME-SERIES; FUTURE; INCOME;
D O I
10.2478/JOS-2020-0017
中图分类号
O1 [数学]; C [社会科学总论];
学科分类号
03 ; 0303 ; 0701 ; 070101 ;
摘要
With the increase of social media usage, a huge new source of data has become available. Despite the enthusiasm linked to this revolution, one of the main outstanding criticisms in using these data is selection bias. Indeed, the reference population is unknown. Nevertheless, many studies show evidence that these data constitute a valuable source because they are more timely and possess higher space granularity. We propose to adjust statistics based on Twitter data by anchoring them to reliable official statistics through a weighted, space-time, small area estimation model. As a by-product, the proposed method also stabilizes the social media indicators, which is a welcome property required for official statistics. The method can be adapted anytime official statistics exists at the proper level of granularity and for which social media usage within the population is known. As an example, we adjust a subjective wellbeing indicator of "working conditions" in Italy, and combine it with relevant official statistics. The weights depend on broadband coverage and the Twitter rate at province level, while the analysis is performed at regional level. The resulting statistics are then compared with survey statistics on the "quality of job" at macro-economic regional level, showing evidence of similar paths.
引用
收藏
页码:315 / 338
页数:24
相关论文
共 25 条
  • [1] The Use of Official Statistics in Self-Selection Bias Modeling
    Dalla Valle, Luciana
    JOURNAL OF OFFICIAL STATISTICS, 2016, 32 (04) : 887 - 905
  • [2] The Use of Social Media for Communication In Official Statistics at European Level
    Glavan, Ionela-Roxana
    Mirica, Andreea
    Firtescu, Bogdan Narcis
    ROMANIAN STATISTICAL REVIEW, 2016, (04) : 37 - 48
  • [3] Review and proposal of indicators (Key Performance Indicators) for Library and social media
    Gonzalez-Fernandez-Villavicencio, Nieves
    Menendez Novoa, Jose Luis
    Seoane Garcia, Catuxa
    San Millan Fernandez, Maria Elvira
    REVISTA ESPANOLA DE DOCUMENTACION CIENTIFICA, 2013, 36 (01):
  • [4] Social media as a data source for official statistics; the Dutch Consumer Confidence Index
    van den Brakel, Jan
    Sohler, Emily
    Daas, Piet
    Buelens, Bart
    SURVEY METHODOLOGY, 2017, 43 (02) : 183 - 210
  • [5] Reconstruction of Media Social Representations Using Indicators of Text Statistics (Based on Media Discourse on the Pandemic)
    Radina, Nadezhda K.
    SOCIAL PSYCHOLOGY AND SOCIETY, 2024, 15 (01) : 76 - 91
  • [6] Soft Data and Public Policy: Can Social Media Offer Alternatives to Official Statistics in Urban Policymaking?
    Severo, Marta
    Feredj, Amel
    Romele, Alberto
    POLICY AND INTERNET, 2016, 8 (03): : 354 - 372
  • [7] Social circular economy indicators: Selection through fuzzy delphi method
    Padilla-Rivera, Alejandro
    do Carmo, Breno Barros Telles
    Arcese, Gabriella
    Merveille, Nicolas
    SUSTAINABLE PRODUCTION AND CONSUMPTION, 2021, 26 : 101 - 110
  • [8] Indicators on Social Media Islamic Information Credibility through an Expert Agreement
    Ab Kadir, Kairulanuar
    Ashaari, Noraidah Sahari
    Judi, Hairulliza Mohamad
    JURNAL KOMUNIKASI-MALAYSIAN JOURNAL OF COMMUNICATION, 2019, 35 (02) : 499 - 522
  • [9] Safety Assessment and Selection Bias: Who Uses Social Media to Communicate About Medications?
    DiSantostefano, Rachael L.
    Painter, Jeffery L.
    Thomas, Michele
    Powell, Greg
    PHARMACOEPIDEMIOLOGY AND DRUG SAFETY, 2015, 24 : 547 - 548
  • [10] Addressing Selection Bias in Event Studies with General-Purpose Social Media Panels
    Zhang, Han
    Hill, Shawndra
    Rothschild, David
    ACM JOURNAL OF DATA AND INFORMATION QUALITY, 2018, 10 (01):