Human-supervised clustering of multidimensional data using crowdsourcing

被引:1
|
作者
Butyaev, Alexander [1 ]
Drogaris, Chrisostomos [1 ]
Tremblay-Savard, Olivier [2 ]
Waldispuehl, Jerome [1 ]
机构
[1] McGill Univ, Sch Comp Sci, Montreal, PQ, Canada
[2] Univ Manitoba, Dept Comp Sci, Winnipeg, Manitoba, Canada
来源
ROYAL SOCIETY OPEN SCIENCE | 2022年 / 9卷 / 05期
基金
加拿大健康研究院;
关键词
data clustering; human-computing; crowdsourcing; games; MECHANICAL TURK;
D O I
10.1098/rsos.211189
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Clustering is a central task in many data analysis applications. However, there is no universally accepted metric to decide the occurrence of clusters. Ultimately, we have to resort to a consensus between experts. The problem is amplified with high-dimensional datasets where classical distances become uninformative and the ability of humans to fully apprehend the distribution of the data is challenged. In this paper, we design a mobile human-computing game as a tool to query human perception for the multidimensional data clustering problem. We propose two clustering algorithms that partially or entirely rely on aggregated human answers and report the results of two experiments conducted on synthetic and real-world datasets. We show that our methods perform on par or better than the most popular automated clustering algorithms. Our results suggest that hybrid systems leveraging annotations of partial datasets collected through crowdsourcing platforms can be an efficient strategy to capture the collective wisdom for solving abstract computational problems.
引用
收藏
页数:15
相关论文
共 50 条
  • [1] Crowdsourcing based terminal positioning using multidimensional data clustering and interpolation
    Boujnah, Noureddine
    Korbel, Piotr
    PROCEEDINGS OF THE 2016 FEDERATED CONFERENCE ON COMPUTER SCIENCE AND INFORMATION SYSTEMS (FEDCSIS), 2016, 8 : 961 - 967
  • [2] Human-supervised multiple mobile robot system
    Nakamura, A
    Ota, J
    Arai, T
    IEEE TRANSACTIONS ON ROBOTICS AND AUTOMATION, 2002, 18 (05): : 728 - 743
  • [3] Human-supervised data science framework for city governments: A design science approach
    Hagen, Loni
    Patel, Mihir
    Luna-Reyes, Luis
    JOURNAL OF THE ASSOCIATION FOR INFORMATION SCIENCE AND TECHNOLOGY, 2023, 74 (08) : 923 - 936
  • [4] Human-Supervised Control of the ATLAS Humanoid Robot for Traversing Doors
    Banerjee, Nandan
    Long, Xianchao
    Du, Ruixiang
    Polido, Felipe
    Feng, Siyuan
    Atkeson, Christopher G.
    Gennert, Michael
    Padir, Taskin
    2015 IEEE-RAS 15TH INTERNATIONAL CONFERENCE ON HUMANOID ROBOTS (HUMANOIDS), 2015, : 722 - 729
  • [5] Mining multidimensional data using clustering techniques
    Pagani, Marco
    Bordogna, Gloria
    Valle, Massimiliano
    DEXA 2007: 18TH INTERNATIONAL CONFERENCE ON DATABASE AND EXPERT SYSTEMS APPLICATIONS, PROCEEDINGS, 2007, : 382 - +
  • [6] On Trust-aware Assistance-seeking in Human-Supervised Autonomy
    Mangalindan, Dong Hae
    Rovira, Ericka
    Srivastava, Vaibhav
    2023 AMERICAN CONTROL CONFERENCE, ACC, 2023, : 3901 - 3906
  • [7] A Holistic Approach to Human-Supervised Humanoid Robot Operations in Extreme Environments
    Wonsick, Murphy
    Long, Philip
    Onol, Aykut Ozgun
    Wang, Maozhen
    Padir, Taskin
    FRONTIERS IN ROBOTICS AND AI, 2021, 8
  • [8] Fast data association using multidimensional assignment with clustering
    Chummun, MR
    Kirubarajan, T
    Pattipati, KR
    Bar-Shalom, Y
    IEEE TRANSACTIONS ON AEROSPACE AND ELECTRONIC SYSTEMS, 2001, 37 (03) : 898 - 913
  • [9] Integration of multidimensional archaeogeophysical data using supervised and unsupervised classification
    Ernenwein, Eileen G.
    NEAR SURFACE GEOPHYSICS, 2009, 7 (03) : 147 - 158
  • [10] Reliability of human-supervised formant-trajectory measurement for forensic voice comparison
    Zhang, Cuiling
    Morrison, Geoffrey Stewart
    Ochoa, Felipe
    Enzinger, Ewald
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2013, 133 (01): : EL54 - EL60