Ensuring High-Quality Private Data for Responsible Data Science: Vision and Challenges

被引:16
|
作者
Srivastava, Divesh [1 ]
Scannapieco, Monica [2 ]
Redman, Thomas C. [3 ]
机构
[1] AT&T Labs Res, Room 4C202B,1 AT&T Way, Bedminster, NJ 07921 USA
[2] Italian Natl Inst Stat, Via C Balbo 16, I-00184 Rome, Italy
[3] Data Qual Solut, 12 Monmouth Ave, Rumson, NJ 07760 USA
来源
关键词
Responsible data science; data trust; private data; quality of private data;
D O I
10.1145/3287168
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
High-quality data is critical for effective data science. As the use of data science has grown, so too have concerns that individuals' rights to privacy will be violated. This has led to the development of data protection regulations around the globe and the use of sophisticated anonymization techniques to protect privacy. Such measures make it more challenging for the data scientist to understand the data, exacerbating issues of data quality. Responsible data science aims to develop useful insights from the data while fully embracing these considerations. We pose the high-level problem in this article, "How can a data scientist develop the needed trust that private data has high quality?" We then identify a series of challenges for various data-centric communities and outline research questions for data quality and privacy researchers, which would need to be addressed to effectively answer the problem posed in this article.
引用
收藏
页码:1 / 9
页数:9
相关论文
共 50 条
  • [41] Scalable, High-Quality Scheduling of Data Center Workloads
    Thiyyakat, Meghana
    Kalambur, Subramaniam
    Sitaram, Dinkar
    2023 IEEE/ACM 23RD INTERNATIONAL SYMPOSIUM ON CLUSTER, CLOUD AND INTERNET COMPUTING WORKSHOPS, CCGRIDW, 2023, : 343 - 345
  • [42] HIGH-QUALITY DATA FOR ENHANCEMENT OF THE TERRAIN MODEL OF SLOVENIA
    Podobnikar, Tomaz
    GEODETSKI VESTNIK, 2008, 52 (04) : 834 - 853
  • [43] Ensuring Data Security in eLearning: Challenges and Solutions
    Jorayeva, Shirin
    Eyadat, Mohammad S.
    2023 INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE AND COMPUTATIONAL INTELLIGENCE, CSCI 2023, 2023, : 900 - 906
  • [44] Ensuring high-quality cancer care in the medicare program
    不详
    ONCOLOGY NURSING FORUM, 2006, 33 (04) : 689 - 690
  • [45] Ensuring high-quality cancer care in the medicare program
    不详
    ONCOLOGY NURSING FORUM, 2008, 35 (03) : 335 - 335
  • [46] Ensuring Full Spectrum Flow Cytometry Data Quality for High-Dimensional Data Analysis
    Ferrer-Font, Laura
    Kraker, Geoffrey
    Hally, Kathryn E.
    Price, Kylie M.
    CURRENT PROTOCOLS, 2023, 3 (02):
  • [47] Ensuring Data Science and Its Applications Benefit Humanity: Data Monetization and the Right to Science
    Lamchek, Jayson S.
    HUMAN RIGHTS LAW REVIEW, 2023, 23 (03)
  • [48] What is responsible and sustainable data science?
    Taylor, Linnet
    Purtova, Nadezhda
    BIG DATA & SOCIETY, 2019, 6 (02):
  • [49] Responsible Data Science for Genocide Prevention
    Piercey, Victor
    JOURNAL OF HUMANISTIC MATHEMATICS, 2023, 13 (02): : 64 - 85
  • [50] Responsible data science is a responsibility for all
    Fest, Isabelle
    Wieringa, Maranke
    Wagner, Ben
    PATTERNS, 2022, 3 (10):