Towards Using Social Media to Identify Individuals at Risk for Preventable Chronic Illness

被引:0
|
作者
Bell, Dane [1 ]
Fried, Daniel [1 ]
Huangfu, Luwen [1 ]
Surdeanu, Mihai [1 ]
Kobourov, Stephen [1 ]
机构
[1] Univ Arizona, Tucson, AZ 85721 USA
关键词
machine learning; obesity detection; social media; NETWORKING; OBESITY;
D O I
暂无
中图分类号
H [语言、文字];
学科分类号
05 ;
摘要
We describe a strategy for the acquisition of training data necessary to build a social-media-driven early detection system for individuals at risk for (preventable) type 2 diabetes mellitus (T2DM). The strategy uses a game-like quiz with data and questions acquired semi-automatically from Twitter. The questions are designed to inspire participant engagement and collect relevant data to train a public-health model applied to individuals. Prior systems designed to use social media such as Twitter to predict obesity (a risk factor for T2DM) operate on entire communities such as states, counties, or cities, based on statistics gathered by government agencies. Because there is considerable variation among individuals within these groups, training data on the individual level would be more effective, but this data is difficult to acquire. The approach proposed here aims to address this issue. Our strategy has two steps. First, we trained a random forest classifier on data gathered from (public) Twitter statuses and state-level statistics with state-of-the-art accuracy. We then converted this classifier into a 20-questions-style quiz and made it available online. In doing so, we achieved high engagement with individuals that took the quiz, while also building a training set of voluntarily supplied individual-level data for future classification.
引用
下载
收藏
页码:2957 / 2964
页数:8
相关论文
共 50 条
  • [31] Towards Identifying Collaborative Learning Groups Using Social Media
    Softic, S.
    INTERNATIONAL JOURNAL OF EMERGING TECHNOLOGIES IN LEARNING, 2012, 7 : 15 - 21
  • [32] Community building and knowledge sharing by individuals with disabilities using social media
    Sweet, Kayla S.
    LeBlanc, Jennifer K.
    Stough, Laura M.
    Sweany, Noelle W.
    JOURNAL OF COMPUTER ASSISTED LEARNING, 2020, 36 (01) : 1 - 11
  • [33] A comparison of five surveys that identify individuals at risk for airflow obstruction and chronic obstructive pulmonary disease
    Sogbetun, Folarin
    Eschenbacher, William L.
    Welge, Jeffrey A.
    Panos, Ralph J.
    RESPIRATORY MEDICINE, 2016, 120 : 1 - 9
  • [34] Social Media Markers to Identify Fathers at Risk of Postpartum Depression: A Machine Learning Approach
    Shatte, Adrian B. R.
    Hutchinson, Delyse M.
    Fuller-Tyszkiewicz, Matthew
    Teague, Samantha J.
    CYBERPSYCHOLOGY BEHAVIOR AND SOCIAL NETWORKING, 2020, 23 (09) : 611 - 618
  • [35] Using Network Flows to Identify Users Sharing Extremist Content on Social Media
    Wei, Yifang
    Singh, Lisa
    ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PAKDD 2017, PT I, 2017, 10234 : 330 - 342
  • [36] Using Common Enemy Graphs to Identify Communities of Coordinated Social Media Activity
    Overbey, Lucas A.
    Ek, Bryan
    Pinzhoffer, Kevin
    Williams, Bryan
    SOCIAL, CULTURAL, AND BEHAVIORAL MODELING, SBP-BRIMS 2019, 2019, 11549 : 92 - 102
  • [37] Using social media to identify recreational bluefish angling in the Mediterranean and Black Sea
    Eryasar, Ahmet Raif
    Saygu, Ismet
    MARINE POLICY, 2022, 135
  • [38] Deployment of social nets in multilayer model to identify key individuals using majority voting
    Fozia Noor
    Asadullah Shah
    Mohammad Usman Akram
    Shoab Ahmad Khan
    Knowledge and Information Systems, 2019, 58 : 113 - 137
  • [39] Deployment of social nets in multilayer model to identify key individuals using majority voting
    Noor, Fozia
    Shah, Asadullah
    Akram, Mohammad Usman
    Khan, Shoab Ahmad
    KNOWLEDGE AND INFORMATION SYSTEMS, 2019, 58 (01) : 113 - 137
  • [40] Using Social Media Photos to Identify Tourism Preferences in Smart Tourism Destination
    Figueredo, Mickael
    Cacho, Nelio
    Thome, Antonio
    Cacho, Andrea
    Lopes, Frederico
    Araujo, Maria
    2017 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2017, : 4068 - 4073