Flock: Hybrid Crowd-Machine Learning Classifiers

被引:74
|
作者
Cheng, Justin [1 ]
Bernstein, Michael S. [1 ]
机构
[1] Stanford Univ, Stanford, CA 94305 USA
关键词
Crowdsourcing; interactive machine learning;
D O I
10.1145/2675133.2675214
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
We present hybrid crowd-machine learning classifiers: classification models that start with a written description of a learning goal, use the crowd to suggest predictive features and label data, and then weigh these features using machine learning to produce models that are accurate and use human understandable features. These hybrid classifiers enable fast prototyping of machine learning models that can improve on both algorithm performance and human judgment, and accomplish tasks where automated feature extraction is not yet feasible. Flock, an interactive machine learning platform, instantiates this approach. To generate informative features, Flock asks the crowd to compare paired examples, an approach inspired by analogical encoding. The crowd's efforts can be focused on specific subsets of the input space where machine-extracted features are not predictive, or instead used to partition the input space and improve algorithm performance in subregions of the space. An evaluation on six prediction tasks, ranging from detecting deception to differentiating impressionist artists, demonstrated that aggregating crowd features improves upon both asking the crowd for a direct prediction and off-the-shelf machine learning features by over 10%. Further, hybrid systems that use both crowd-nominated and machine-extracted features can outperform those that use either in isolation.
引用
收藏
页码:600 / 611
页数:12
相关论文
共 50 条
  • [31] Machine learning classifiers and fMRI: A tutorial overview
    Pereira, Francisco
    Mitchell, Tom
    Botvinick, Matthew
    [J]. NEUROIMAGE, 2009, 45 (01) : S199 - S209
  • [32] Machine Learning Classifiers for Android Malware Analysis
    Urcuqui Lopez, Christian Camilo
    Navarro Cadavid, Andres
    [J]. 2016 IEEE COLOMBIAN CONFERENCE ON COMMUNICATIONS AND COMPUTING (COLCOM), 2016,
  • [33] Multi-objective learning of hybrid classifiers
    Piltaver, Rok
    Lustrek, Mitja
    Zupancic, Jernej
    Dzeroski, Saso
    Gams, Matjaz
    [J]. 21ST EUROPEAN CONFERENCE ON ARTIFICIAL INTELLIGENCE (ECAI 2014), 2014, 263 : 717 - 722
  • [34] A hybrid approach to design efficient learning classifiers
    Sarkar, Bikash Kanti
    Sana, Shib Sankar
    [J]. COMPUTERS & MATHEMATICS WITH APPLICATIONS, 2009, 58 (01) : 65 - 73
  • [35] Hybrid Parallel Classifiers for Semantic Subspace Learning
    Tripathi, Nandita
    Oakes, Michael
    Wermter, Stefan
    [J]. ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2011, PT II, 2011, 6792 : 64 - +
  • [36] Physics-infused Machine Learning for Crowd Simulation
    Zhang, Guozhen
    Yu, Zihan
    Jin, Depeng
    Li, Yong
    [J]. PROCEEDINGS OF THE 28TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, KDD 2022, 2022, : 2439 - 2449
  • [37] Crowd abnormal behavior detection based on machine learning
    Zhang, Dongping
    Peng, Huailiang
    Haibin, Yu
    Lu, Yafei
    [J]. Information Technology Journal, 2013, 12 (06) : 1199 - 1205
  • [38] Machine learning multi-classifiers for peptide classification
    Nanni, Loris
    Lumini, Alessandra
    [J]. NEURAL COMPUTING & APPLICATIONS, 2009, 18 (02): : 185 - 192
  • [39] Testing and validating machine learning classifiers by metamorphic testing
    Xie, Xiaoyuan
    Ho, Joshua W. K.
    Murphy, Christian
    Kaiser, Gail
    Xu, Baowen
    Chen, Tsong Yueh
    [J]. JOURNAL OF SYSTEMS AND SOFTWARE, 2011, 84 (04) : 544 - 558
  • [40] Selecting Appropriate Machine Learning Classifiers for DGA Diagnosis
    Aizpurua, Jose Ignacio
    Catterson, Victoria M.
    Stewart, Brian G.
    McArthur, Stephen D. J.
    Lambert, Brandon
    Ampofo, Bismark
    Pereira, Gavin
    Cross, James G.
    [J]. 2017 IEEE CONFERENCE ON ELECTRICAL INSULATION AND DIELECTRIC PHENOMENON (CEIDP), 2017, : 153 - 156