Workshop on Human-in-the-loop Data Curation

被引:1
|
作者
Demartini, Gianluca [1 ]
Yang, Jie [2 ]
Sadiq, Shazia [1 ]
机构
[1] Univ Queensland, Brisbane, Qld, Australia
[2] Delft Univ Technol, Delft, Netherlands
关键词
D O I
10.1145/3511808.3557498
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Although data quality is a long-standing and enduring problem, it has recently received a resurgence of attention due to the fast proliferation of data analytics, machine learning, and decision-support applications built upon the wide-scale availability and accessibility of (big) data. The success of such applications heavily relies on not only the quantity, but also the quality of data. Data curation, which may include annotation, cleaning, transformation, integration, etc., is a critical step to provide adequate assurances on the quality of analytics and machine learning results. Such data preparation activities are recognised as time and resource intensive for data scientists as data often comes with a number of challenges that need to be tackled before it can be used in practice. Data re-purposing and the resulting distance between design and use intentions of the data, is a fundamental issue behind many of these challenges. These challenges include a variety of data issues such as noise and outliers, incompleteness, representativeness or biases, heterogeneity of format or semantics, etc. Mishandling these challenges can lead to negative and sometimes damaging effects, especially in critical domains like healthcare, transport, and finance. An observable distinct feature of data quality in these contexts is the increasingly important role played by humans, being often the source of data generation and the active players in data curation. This workshop will provide an opportunity to explore the interdisciplinary overlap between manual, automated, and hybrid human-machine methods of data curation.
引用
收藏
页码:5161 / 5162
页数:2
相关论文
共 50 条
  • [1] Eighth Workshop on Human-In-the-Loop Data Analytics (HILDA)
    Fekete, Jean-Daniel
    Rong, Kexin
    Omidvar-Tehrani, Behrooz
    Shraga, Roee
    [J]. COMPANION OF THE 2024 INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA, SIGMOD-COMPANION 2024, 2024, : 657 - 658
  • [2] International Workshop on Human-In-the-Loop Data Analytics (HILDA)
    Battle, Leilani
    Chaudhuri, Surajit
    Nandi, Arnab
    [J]. SIGMOD '19: PROCEEDINGS OF THE 2019 INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA, 2019, : 2072 - 2072
  • [3] Towards a human-in-the-loop curation: A qualitative perspective
    Adorjan, Alejandro
    Vargas-Solar, Genoveva
    Motz, Regina
    [J]. 2022 IEEE/ACS 19TH INTERNATIONAL CONFERENCE ON COMPUTER SYSTEMS AND APPLICATIONS (AICCSA), 2022,
  • [4] HILDA'22: The SIGMOD 2022 Workshop on Human-in-the-Loop Data Analytics
    Abouzied, Azza
    Moritz, Dominik
    Cafarella, Michael J.
    [J]. PROCEEDINGS OF THE 2022 INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA (SIGMOD '22), 2022, : 2552 - 2553
  • [5] Human-in-the-loop Data Integration
    Li, Guoliang
    [J]. PROCEEDINGS OF THE VLDB ENDOWMENT, 2017, 10 (12): : 2006 - 2017
  • [6] Enhancing Human-in-the-Loop Ontology Curation Results through Task Design
    Tsaneva, Stefani
    Sabou, Marta
    [J]. ACM JOURNAL OF DATA AND INFORMATION QUALITY, 2024, 16 (01):
  • [7] Human-in-the-Loop Data Integration System
    Sun, Ji
    Li, Guo-Liang
    [J]. Jisuanji Xuebao/Chinese Journal of Computers, 2022, 45 (03): : 654 - 668
  • [8] Human-in-the-Loop Machine Learning Curation of Problems That Bother Parkinson Disease Patients
    Marras, Connie
    Arbatti, Lakshmi
    Amara, Amy
    Anderson, Karen
    Bale, Claire
    Chahine, Lana
    Eberly, Shirley
    Hosamath, Abhishek
    Kinel, Daniel
    Mantri, Sneha
    Mathur, Soania
    Oakes, David
    Purks, Jennifer
    Weintraub, Daniel
    Shoulson, Ira
    [J]. ANNALS OF NEUROLOGY, 2022, 92 : S116 - S116
  • [9] Human-in-the-Loop Data Analysis: A Personal Perspective
    Doan, AnHai
    [J]. HILDA'18: PROCEEDINGS OF THE WORKSHOP ON HUMAN-IN-THE-LOOP DATA ANALYTICS, 2018,
  • [10] Identification of Explainable Structures in Data with a Human-in-the-Loop
    Thrun, Michael C.
    [J]. KUNSTLICHE INTELLIGENZ, 2022, 36 (3-4): : 297 - 301