Inferring Smoking Status from User Generated Content in an Online Cessation Community

被引:2
|
作者
Amato, Michael S. [1 ]
Papandonatos, George D. [2 ]
Cha, Sarah [1 ]
Wang, Xi [3 ]
Zhao, Kang [4 ]
Cohn, Amy M. [5 ,6 ]
Pearson, Jennifer L. [7 ,8 ]
Graham, Amanda L. [1 ,6 ]
机构
[1] Truth Initiat, Schroeder Inst Tobacco Res & Policy Studies, 900 G St NW,Fourth Floor, Washington, DC 20001 USA
[2] Brown Univ, Ctr Stat Sci, Providence, RI 02912 USA
[3] Cent Univ Finance & Econ, Sch Informat, Beijing, Peoples R China
[4] Univ Iowa, Dept Management Sci, Iowa City, IA 52242 USA
[5] Battelle Mem Inst, Arlington, VA USA
[6] Georgetown Univ, Med Ctr, Dept Oncol, Washington, DC 20007 USA
[7] Univ Nevada, Sch Community Hlth Sci, Reno, NV 89557 USA
[8] Johns Hopkins Bloomberg Sch Publ Hlth, Dept Hlth Behav & Soc, Baltimore, MD USA
基金
美国国家卫生研究院;
关键词
SOCIAL SUPPORT; INTERNET; INTERVENTIONS; AGREEMENT; TRIALS; EX;
D O I
10.1093/ntr/nty014
中图分类号
R194 [卫生标准、卫生检查、医药管理];
学科分类号
摘要
Introduction User generated content (UGC) is a valuable but underutilized source of information about individuals who participate in online cessation interventions. This study represents a first effort to passively detect smoking status among members of an online cessation program using UGC. Methods Secondary data analysis was performed on data from 826 participants in a web-based smoking cessation randomized trial that included an online community. Domain experts from the online community reviewed each post and comment written by participants and attempted to infer the author's smoking status at the time it was written. Inferences from UGC were validated by comparison with self-reported 30-day point prevalence abstinence (PPA). Following validation, the impact of this method was evaluated across all individuals and time points in the study period. Results Of the 826 participants in the analytic sample, 719 had written at least one post from which content inference was possible. Among participants for whom unambiguous smoking status was inferred during the 30 days preceding their 3-month follow-up survey, concordance with self-report was almost perfect (kappa = 0.94). Posts indicating abstinence tended to be written shortly after enrollment (median = 14 days). Conclusions Passive inference of smoking status from UGC in online cessation communities is possible and highly reliable for smokers who actively produce content. These results lay the groundwork for further development of observational research tools and intervention innovations. Implications A proof-of-concept methodology for inferring smoking status from user generated content in online cessation communities is presented and validated. Content inference of smoking status makes a key cessation variable available for use in observational designs. This method provides a powerful tool for researchers interested in online cessation interventions and establishes a foundation for larger scale application via machine learning.
引用
收藏
页码:205 / 211
页数:7
相关论文
共 50 条
  • [1] Mining user-generated content in an online smoking cessation community to identify smoking status: A machine learning approach
    Wang, Xi
    Zhao, Kang
    Cha, Sarah
    Amato, Michael S.
    Cohn, Amy M.
    Pearson, Jennifer L.
    Papandonatos, George D.
    Graham, Amanda L.
    [J]. DECISION SUPPORT SYSTEMS, 2019, 116 : 26 - 34
  • [2] Inferring brand proximities from user-generated content
    Dwyer P.
    [J]. Journal of Brand Management, 2012, 19 (6) : 467 - 483
  • [3] Latent Users in an Online User-Generated Content Community
    Velasquez, Alcides
    Wash, Rick
    Lampe, Cliff
    Bjornrud, Tor
    [J]. COMPUTER SUPPORTED COOPERATIVE WORK-THE JOURNAL OF COLLABORATIVE COMPUTING AND WORK PRACTICES, 2014, 23 (01): : 21 - 50
  • [4] Latent Users in an Online User-Generated Content Community
    Alcides Velasquez
    Rick Wash
    Cliff Lampe
    Tor Bjornrud
    [J]. Computer Supported Cooperative Work (CSCW), 2014, 23 : 21 - 50
  • [5] Social Network Analysis of an Online Smoking Cessation Community to Identify Users' Smoking Status
    Shah, Adnan Muhammad
    Yan, Xiangbin
    Qayyum, Abdul
    [J]. HEALTHCARE INFORMATICS RESEARCH, 2021, 27 (02) : 116 - 126
  • [6] Inferring User Gender from User Generated Visual Content on a Deep Semantic Space
    Semedo, David
    Magalhaes, Joao
    Martins, Flavio
    [J]. 2018 26TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2018, : 1800 - 1804
  • [7] Exploring features of social support in a Chinese online smoking cessation community: A multidimensional content analysis of user interaction data
    Qian, Yuxing
    Gui, Wenxuan
    Ma, Feicheng
    Dong, Qingxing
    [J]. HEALTH INFORMATICS JOURNAL, 2021, 27 (02)
  • [8] Editorial: Online User Behavior and User-Generated Content
    Saura, Jose Ramon
    Dwivedi, Yogesh K.
    Palacios-Marques, Daniel
    [J]. FRONTIERS IN PSYCHOLOGY, 2022, 13
  • [9] The Community Network Game Project: Enriching Online Gamers Experience with User Generated Content
    Ahmad, Shakeel
    Bouras, Christos
    Hamzaoui, Raouf
    Papazois, Andreas
    Perelman, Erez
    Shani, Alex
    Simon, Gwendal
    Tsichritzis, George
    [J]. PROCEEDINGS OF THE SECOND INTERNATIONAL CONFERENCE ON CREATIVE CONTENT TECHNOLOGIES (CONTENT 2010), 2010, : 13 - 18
  • [10] Inferring bladder cancer research prioritization from patient-generated online content
    Mossanen, Matthew
    Chu, Alice
    Smith, Angela B.
    Gore, John L.
    [J]. WORLD JOURNAL OF UROLOGY, 2019, 37 (06) : 1145 - 1150