Leveraging Attributes and Crowdsourcing for Join

被引:0
|
作者
Feng, Jianhong [1 ]
Feng, Jianhua [1 ]
Hu, Huiqi [1 ]
机构
[1] Tsinghua Univ, Dept Comp Sci, Beijing 100084, Peoples R China
来源
关键词
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Join operation is usually hard to achieve high quality with machine alone. We adopt crowdsourcing to improve the quality of join. Depending on the number of generated pairs, the overall cost can be expensive for hiring workers to do the verification. We propose a hybrid approach to generate pairs by leveraging attributes, which combines category, sorting and clustering techniques, called CSCER. We also propose an adaptive attribute-selection strategy to efficiently generate pairs based on attributes. Experiments on a real crowdsourcing platform using real datasets indicate that our approaches save the overall cost compared to existing methods and achieve high quality of join results.
引用
收藏
页码:448 / 452
页数:5
相关论文
共 50 条
  • [1] Leveraging Online Populations for Crowdsourcing
    Chi, Ed H.
    Bernstein, Michael S.
    [J]. IEEE INTERNET COMPUTING, 2012, 16 (05) : 10 - 12
  • [2] Leveraging Crowdsourcing for the Thematic Annotation of the Qur'an
    Basharat, Amna
    Arpinar, I. Budak
    Rasheed, Khaled
    [J]. PROCEEDINGS OF THE 25TH INTERNATIONAL CONFERENCE ON WORLD WIDE WEB (WWW'16 COMPANION), 2016, : 13 - 14
  • [3] Leveraging Peer Communication to Enhance Crowdsourcing
    Tang, Wei
    Ho, Chien-Ju
    Yin, Ming
    [J]. WEB CONFERENCE 2019: PROCEEDINGS OF THE WORLD WIDE WEB CONFERENCE (WWW 2019), 2019, : 1794 - 1805
  • [4] Leveraging Crowdsourcing to Facilitate the Discovery of New Medicines
    Norman, Thea C.
    Bountra, Chas
    Edwards, Aled M.
    Yamamoto, Keith R.
    Friend, Stephen H.
    [J]. SCIENCE TRANSLATIONAL MEDICINE, 2011, 3 (88)
  • [5] Leveraging crowdsourcing to accelerate global health solutions
    Davis, Sage
    Button-Simons, Katrina
    Bensellak, Taoufik
    Ahsen, Eren Mehmet
    Checkley, Lisa
    Foster, Gabriel J.
    Su, Xinzhuan
    Moussa, Ahmed
    Mapiye, Darlington
    Khoo, Sok Kean
    Nosten, Francois
    Anderson, Timothy J. C.
    Vendrely, Katelyn
    Bletz, Julie
    Yu, Thomas
    Panji, Sumir
    Ghouila, Amel
    Mulder, Nicola
    Norman, Thea
    Kern, Steven
    Meyer, Pablo
    Stolovitzky, Gustavo
    Ferdig, Michael T.
    Siwo, Geoffrey H.
    [J]. NATURE BIOTECHNOLOGY, 2019, 37 (08) : 848 - 850
  • [6] Leveraging crowdsourcing to accelerate global health solutions
    Sage Davis
    Katrina Button-Simons
    Taoufik Bensellak
    Eren Mehmet Ahsen
    Lisa Checkley
    Gabriel J. Foster
    Xinzhuan Su
    Ahmed Moussa
    Darlington Mapiye
    Sok Kean Khoo
    Francois Nosten
    Timothy J. C. Anderson
    Katelyn Vendrely
    Julie Bletz
    Thomas Yu
    Sumir Panji
    Amel Ghouila
    Nicola Mulder
    Thea Norman
    Steven Kern
    Pablo Meyer
    Gustavo Stolovitzky
    Michael T. Ferdig
    Geoffrey H. Siwo
    [J]. Nature Biotechnology, 2019, 37 : 848 - 850
  • [7] Leveraging non-expert crowdsourcing workers for improper task detection in crowdsourcing marketplaces
    Baba, Yukino
    Kashima, Hisashi
    Kinoshita, Kei
    Yamaguchi, Goushi
    Akiyoshi, Yosuke
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2014, 41 (06) : 2678 - 2687
  • [8] Effectively Leveraging Attributes for Visual Similarity
    Mishra, Samarth
    Zhang, Zhongping
    Shen, Yuan
    Kumar, Ranjitha
    Saligrama, Venkatesh
    Plummer, Bryan A.
    [J]. 2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 995 - 1004
  • [9] Effectively Leveraging Attributes for Visual Similarity
    Mishra, Samarth
    Zhang, Zhongping
    Shen, Yuan
    Kumar, Ranjitha
    Saligrama, Venkatesh
    Plummer, Bryan
    [J]. 2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, CVPRW 2021, 2021, : 3899 - 3904
  • [10] The Spectral Game: leveraging Open Data and crowdsourcing for education
    Bradley, Jean-Claude
    Lancashire, Robert J.
    Lang, Andrew S. I. D.
    Williams, Antony J.
    [J]. JOURNAL OF CHEMINFORMATICS, 2009, 1