Inferring Smoking Status from User Generated Content in an Online Cessation Community

被引:2
|
作者
Amato, Michael S. [1 ]
Papandonatos, George D. [2 ]
Cha, Sarah [1 ]
Wang, Xi [3 ]
Zhao, Kang [4 ]
Cohn, Amy M. [5 ,6 ]
Pearson, Jennifer L. [7 ,8 ]
Graham, Amanda L. [1 ,6 ]
机构
[1] Truth Initiat, Schroeder Inst Tobacco Res & Policy Studies, 900 G St NW,Fourth Floor, Washington, DC 20001 USA
[2] Brown Univ, Ctr Stat Sci, Providence, RI 02912 USA
[3] Cent Univ Finance & Econ, Sch Informat, Beijing, Peoples R China
[4] Univ Iowa, Dept Management Sci, Iowa City, IA 52242 USA
[5] Battelle Mem Inst, Arlington, VA USA
[6] Georgetown Univ, Med Ctr, Dept Oncol, Washington, DC 20007 USA
[7] Univ Nevada, Sch Community Hlth Sci, Reno, NV 89557 USA
[8] Johns Hopkins Bloomberg Sch Publ Hlth, Dept Hlth Behav & Soc, Baltimore, MD USA
基金
美国国家卫生研究院;
关键词
SOCIAL SUPPORT; INTERNET; INTERVENTIONS; AGREEMENT; TRIALS; EX;
D O I
10.1093/ntr/nty014
中图分类号
R194 [卫生标准、卫生检查、医药管理];
学科分类号
摘要
Introduction User generated content (UGC) is a valuable but underutilized source of information about individuals who participate in online cessation interventions. This study represents a first effort to passively detect smoking status among members of an online cessation program using UGC. Methods Secondary data analysis was performed on data from 826 participants in a web-based smoking cessation randomized trial that included an online community. Domain experts from the online community reviewed each post and comment written by participants and attempted to infer the author's smoking status at the time it was written. Inferences from UGC were validated by comparison with self-reported 30-day point prevalence abstinence (PPA). Following validation, the impact of this method was evaluated across all individuals and time points in the study period. Results Of the 826 participants in the analytic sample, 719 had written at least one post from which content inference was possible. Among participants for whom unambiguous smoking status was inferred during the 30 days preceding their 3-month follow-up survey, concordance with self-report was almost perfect (kappa = 0.94). Posts indicating abstinence tended to be written shortly after enrollment (median = 14 days). Conclusions Passive inference of smoking status from UGC in online cessation communities is possible and highly reliable for smokers who actively produce content. These results lay the groundwork for further development of observational research tools and intervention innovations. Implications A proof-of-concept methodology for inferring smoking status from user generated content in online cessation communities is presented and validated. Content inference of smoking status makes a key cessation variable available for use in observational designs. This method provides a powerful tool for researchers interested in online cessation interventions and establishes a foundation for larger scale application via machine learning.
引用
收藏
页码:205 / 211
页数:7
相关论文
共 50 条
  • [21] An Online Placement Mechanism for Efficient Delivery of User Generated Content
    Safavi, Mohammadhassan
    Bastani, Saeed
    Landfeldt, Bjorn
    [J]. 2017 IEEE 22ND INTERNATIONAL WORKSHOP ON COMPUTER AIDED MODELING AND DESIGN OF COMMUNICATION LINKS AND NETWORKS (CAMAD), 2017,
  • [22] Moderated Online Communities and Quality of User-Generated Content
    Chen, Jianqing
    Xu, Hong
    Whinston, Andrew B.
    [J]. JOURNAL OF MANAGEMENT INFORMATION SYSTEMS, 2011, 28 (02) : 237 - 268
  • [23] Exploring the Usefulness of User-Generated Content for Business Intelligence in Innovation: Empirical Evidence From an Online Open Innovation Community
    Daradkeh, Mohammad Kamel
    [J]. INTERNATIONAL JOURNAL OF ENTERPRISE INFORMATION SYSTEMS, 2021, 17 (02) : 44 - 70
  • [24] A Multirelational Social Network Analysis of an Online Health Community for Smoking Cessation
    Zhao, Kang
    Wang, Xi
    Cha, Sarah
    Cohn, Amy M.
    Papandonatos, George D.
    Amato, Michael S.
    Pearson, Jennifer L.
    Graham, Amanda L.
    [J]. JOURNAL OF MEDICAL INTERNET RESEARCH, 2016, 18 (08)
  • [25] Foreword to the special issue on mining actionable insights from online user generated content
    Armentano, Marcelo G.
    Bagheri, Ebrahim
    Kiseleva, Julia
    Takes, Frank W.
    [J]. INFORMATION RETRIEVAL JOURNAL, 2020, 23 (05): : 473 - 474
  • [26] Social Ties and User-Generated Content: Evidence from an Online Social Network
    Shriver, Scott K.
    Nair, Harikesh S.
    Hofstetter, Reto
    [J]. MANAGEMENT SCIENCE, 2013, 59 (06) : 1425 - 1443
  • [27] Foreword to the special issue on mining actionable insights from online user generated content
    Marcelo G. Armentano
    Ebrahim Bagheri
    Julia Kiseleva
    Frank W. Takes
    [J]. Information Retrieval Journal, 2020, 23 : 473 - 474
  • [28] "Popularity Effect" in User-Generated Content: Evidence from Online Product Reviews
    Goes, Paulo B.
    Lin, Mingfeng
    Yeung, Ching-man Au
    [J]. INFORMATION SYSTEMS RESEARCH, 2014, 25 (02) : 222 - 238
  • [29] Using Content and Network Analysis to Understand the Social Support Exchange Patterns and User Behaviors of an Online Smoking Cessation Intervention Program
    Zhang, Mi
    Yang, Christopher C.
    [J]. JOURNAL OF THE ASSOCIATION FOR INFORMATION SCIENCE AND TECHNOLOGY, 2015, 66 (03) : 564 - 575
  • [30] The institutionalization of YouTube: From user-generated content to professionally generated content
    Kim, Jin
    [J]. MEDIA CULTURE & SOCIETY, 2012, 34 (01) : 53 - 67