Human-supervised clustering of multidimensional data using crowdsourcing

被引:1
|
作者
Butyaev, Alexander [1 ]
Drogaris, Chrisostomos [1 ]
Tremblay-Savard, Olivier [2 ]
Waldispuehl, Jerome [1 ]
机构
[1] McGill Univ, Sch Comp Sci, Montreal, PQ, Canada
[2] Univ Manitoba, Dept Comp Sci, Winnipeg, Manitoba, Canada
来源
ROYAL SOCIETY OPEN SCIENCE | 2022年 / 9卷 / 05期
基金
加拿大健康研究院;
关键词
data clustering; human-computing; crowdsourcing; games; MECHANICAL TURK;
D O I
10.1098/rsos.211189
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Clustering is a central task in many data analysis applications. However, there is no universally accepted metric to decide the occurrence of clusters. Ultimately, we have to resort to a consensus between experts. The problem is amplified with high-dimensional datasets where classical distances become uninformative and the ability of humans to fully apprehend the distribution of the data is challenged. In this paper, we design a mobile human-computing game as a tool to query human perception for the multidimensional data clustering problem. We propose two clustering algorithms that partially or entirely rely on aggregated human answers and report the results of two experiments conducted on synthetic and real-world datasets. We show that our methods perform on par or better than the most popular automated clustering algorithms. Our results suggest that hybrid systems leveraging annotations of partial datasets collected through crowdsourcing platforms can be an efficient strategy to capture the collective wisdom for solving abstract computational problems.
引用
收藏
页数:15
相关论文
共 50 条
  • [41] Quantum Supervised Clustering Algorithm for Big Data
    Bishwas, Arit Kumar
    Mani, Ashish
    Palade, Vasile
    2018 3RD INTERNATIONAL CONFERENCE FOR CONVERGENCE IN TECHNOLOGY (I2CT), 2018,
  • [42] Semi-supervised clustering for complicated data
    Huang, Tian-Qiang
    Yu, Yang-Qiang
    Qin, Xiao-Lin
    Kongzhi yu Juece/Control and Decision, 2010, 25 (01): : 14 - 19
  • [43] A Semi-supervised Clustering for Incomplete Data
    Goel, Sonia
    Tushir, Meena
    APPLICATIONS OF ARTIFICIAL INTELLIGENCE TECHNIQUES IN ENGINEERING, SIGMA 2018, VOL 1, 2019, 698 : 323 - 331
  • [44] Multidimensional Reputation Evaluation Model for Crowdsourcing Participants Based on Big Data
    Huang, Yanrong
    Chen, Min
    2019 INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE BIG DATA AND INTELLIGENT SYSTEMS (HPBD&IS), 2019, : 41 - 46
  • [45] Combining supervised and unsupervised learning for data clustering
    Corsini, Paolo
    Lazzerini, Beatrice
    Marcelloni, Francesco
    NEURAL COMPUTING & APPLICATIONS, 2006, 15 (3-4): : 289 - 297
  • [46] Supervised hierarchical clustering using CART
    Hancock, TP
    Coomans, DH
    Everingham, YL
    MODSIM 2003: INTERNATIONAL CONGRESS ON MODELLING AND SIMULATION, VOLS 1-4: VOL 1: NATURAL SYSTEMS, PT 1; VOL 2: NATURAL SYSTEMS, PT 2; VOL 3: SOCIO-ECONOMIC SYSTEMS; VOL 4: GENERAL SYSTEMS, 2003, : 1880 - 1885
  • [47] Using supervised clustering to enhance classifiers
    Eick, CF
    Zeidat, N
    FOUNDATIONS OF INTELLIGENT SYSTEMS, PROCEEDINGS, 2005, 3488 : 248 - 256
  • [48] Generating Alerts to Assist With Task Assignments in Human-Supervised Multi-Robot Teams Operating in Challenging Environments
    Al-Hussaini, Sarah
    Gregory, Jason M.
    Guan, Yuxiang
    Gupta, Satyandra K.
    2020 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2020, : 11245 - 11252
  • [49] Detection of Sparsity in Multidimensional Data Using Network Degree Distribution and Improved Supervised Learning with Correction of Data Weighting
    Ueno, Shinya
    Sakai, Osamu
    COMPLEX NETWORKS AND THEIR APPLICATIONS XI, COMPLEX NETWORKS 2022, VOL 1, 2023, 1077 : 390 - 401
  • [50] Human-Supervised Automation Test Cell to Accelerate Personal Protective Equipment Manufacturing During the COVID-19 Pandemic
    Shaham, Michael H.
    Skopin, Matthew
    Hochsztein, Hillel
    Mabulu, Katiso
    Milburn, Lee
    Tukpah, James
    Tunik, Aleksandr
    Winn, James
    Zolotas, Mark
    Erdogmus, Deniz
    Padir, Taskin
    2022 IEEE INTERNATIONAL SYMPOSIUM ON TECHNOLOGIES FOR HOMELAND SECURITY (HST), 2022,