Estimating County-Level Overdose Rates Using Opioid-Related Twitter Data: Interdisciplinary Infodemiology Study

被引:2
|
作者
Cuomo, Raphael [1 ,2 ,6 ]
Purushothaman, Vidya [2 ,3 ]
Calac, Alec J. [1 ,2 ]
McMann, Tiana [2 ,3 ,4 ,5 ]
Li, Zhuoran [2 ,3 ,5 ]
Mackey, Tim [2 ,3 ,4 ,5 ]
机构
[1] Univ Calif San Diego, Sch Med, La Jolla, CA USA
[2] Global Hlth Policy & Data Inst, San Diego, CA USA
[3] San Diego Supercomp Ctr, San Diego, CA USA
[4] Univ Calif San Diego, Dept Anthropol, La Jolla, CA USA
[5] S 3 Res, San Diego, CA USA
[6] Univ Calif San Diego, Sch Med, 9500 Gilman Dr, La Jolla, CA 92093 USA
关键词
overdose; mortality; geospatial analysis; social media; drug overuse; substance use; social media data; mortality estimates; real-time data; public health data; demographic variables; county-level; POISON CONTROL CENTERS; DATA SYSTEM;
D O I
10.2196/42162
中图分类号
R19 [保健组织与事业(卫生事业管理)];
学科分类号
摘要
Background: There were an estimated 100,306 drug overdose deaths between April 2020 and April 2021, a three-quarter increase from the prior 12-month period. There is an approximate 6-month reporting lag for provisional counts of drug overdose deaths from the National Vital Statistics System, and the highest level of geospatial resolution is at the state level. By contrast, public social media data are available close to real-time and are often accessible with precise coordinates.Objective: The purpose of this study is to assess whether county-level overdose mortality burden could be estimated using opioid-related Twitter data.Methods: International Classification of Diseases (ICD) codes for poisoning or exposure to overdose at the county level were obtained from CDC WONDER. Demographics were collected from the American Community Survey. The Twitter Application Programming Interface was used to obtain tweets that contained any of the 36 terms with drug names. An unsupervised classification approach was used for clustering tweets. Population-normalized variables and polynomial population-normalized variables were produced. Furthermore, z scores of the Getis Ord Gi clustering statistic were produced, and both these scores and their polynomial counterparts were explored in regression modeling of county-level overdose mortality burden. A series of linear regression models were used for predictive modeling to explore the interpretability of the analytical output.Results: Modeling overdose mortality with normalized demographic variables alone explained only 7.4% of the variability in county-level overdose mortality, whereas this was approximately doubled by the use of specific demographic and Twitter data covariates based on a backward selection approach. The highest adjusted R2 and lowest AIC (Akaike Info Criterion) were obtained for the model with normalized demographic variables, normalized z scores from geospatial analyses, and normalized topic counts (adjusted R2=0.133, AIC=8546.8). The z scores of the Getis Ord Gi statistic appeared to have improved utility over population-normalization alone. In this model, median age, female population, and tweets about web-based drug sales were positively associated with opioid mortality. Asian race and Hispanic ethnicity were significantly negatively associated with county-level burdens of overdose mortality.Conclusions: Social media data, when transformed using certain statistical approaches, may add utility to the goal of producing closer to real-time county-level estimates of overdose mortality. Prediction of opioid-related outcomes can be advanced to inform prevention and treatment decisions. This interdisciplinary approach can facilitate evidence-based funding decisions for various substance use disorder prevention and treatment programs.
引用
下载
收藏
页数:12
相关论文
共 31 条
  • [21] Using multi-sourced big data to correlate sleep deprivation and road traffic noise: A US county-level ecological study
    Tong, Huan
    Warren, Joshua L.
    Kang, Jian
    Li, Mingxiao
    ENVIRONMENTAL RESEARCH, 2023, 220
  • [23] Opioid-related overdose deaths among people experiencing homelessness, 2017 to 2021: A population-based analysis using coroner and health administrative data from Ontario, Canada
    Booth, Richard G.
    Shariff, Salimah Z.
    Carter, Brooke
    Hwang, Stephen W.
    Orkin, Aaron M.
    Forchuk, Cheryl
    Gomes, Tara
    ADDICTION, 2024, 119 (02) : 334 - 344
  • [24] Opioid-related diagnoses and HIV, HCV and mental disorders: using Pennsylvania hospitalisation data to assess community-level relationships over space and time
    Sumetsky, Natalie
    Burke, Jessica G.
    Mair, Christina
    JOURNAL OF EPIDEMIOLOGY AND COMMUNITY HEALTH, 2019, 73 (10) : 935 - 940
  • [25] State-level variation in distribution of oxycodone and opioid-related deaths from 2000 to 2021: an ecological study of ARCOS and CDC WONDER data in the USA
    Solgama, Jay P.
    Liu, Edward
    Davis, Mellar
    Graham, Jove
    McCall, Kenneth L.
    Piper, Brian J.
    BMJ OPEN, 2024, 14 (03):
  • [26] Optimal county-level crop yield prediction using MODIS-based variables and weather data: A comparative study on machine learning models
    Ju, Sungha
    Lim, Hyoungjoon
    Ma, Jong Won
    Kim, Soohyun
    Lee, Kyungdo
    Zhao, Shuhe
    Heo, Joon
    AGRICULTURAL AND FOREST METEOROLOGY, 2021, 307
  • [27] US county-level estimation for maternal and infant health-related behavior indicators using pregnancy risk assessment monitoring system data, 2016–2018
    Yan Wang
    Heather Tevendale
    Hua Lu
    Shanna Cox
    Susan A. Carlson
    Rui Li
    Holly Shulman
    Brian Morrow
    Philip A. Hastings
    Wanda D. Barfield
    Population Health Metrics, 20
  • [28] US county-level estimation for maternal and infant health-related behavior indicators using pregnancy risk assessment monitoring system data, 2016-2018
    Wang, Yan
    Tevendale, Heather
    Lu, Hua
    Cox, Shanna
    Carlson, Susan A.
    Li, Rui
    Shulman, Holly
    Morrow, Brian
    Hastings, Philip A.
    Barfield, Wanda D.
    POPULATION HEALTH METRICS, 2022, 20 (01)
  • [29] A county-level study of the effects of state-mandated COVID-19 lockdowns on urban and rural restaurant visits using consumers' cell phone geo-location data
    Banerjee, Tannista
    Nayak, Arnab
    Zhao, HaiYue
    JOURNAL OF PUBLIC HEALTH-HEIDELBERG, 2023, 31 (02): : 249 - 258
  • [30] A county-level study of the effects of state-mandated COVID-19 lockdowns on urban and rural restaurant visits using consumers’ cell phone geo-location data
    Tannista Banerjee
    Arnab Nayak
    HaiYue Zhao
    Journal of Public Health, 2023, 31 : 249 - 258