Leveraging Attributes and Crowdsourcing for Join

被引:0
|
作者
Feng, Jianhong [1 ]
Feng, Jianhua [1 ]
Hu, Huiqi [1 ]
机构
[1] Tsinghua Univ, Dept Comp Sci, Beijing 100084, Peoples R China
来源
关键词
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Join operation is usually hard to achieve high quality with machine alone. We adopt crowdsourcing to improve the quality of join. Depending on the number of generated pairs, the overall cost can be expensive for hiring workers to do the verification. We propose a hybrid approach to generate pairs by leveraging attributes, which combines category, sorting and clustering techniques, called CSCER. We also propose an adaptive attribute-selection strategy to efficiently generate pairs based on attributes. Experiments on a real crowdsourcing platform using real datasets indicate that our approaches save the overall cost compared to existing methods and achieve high quality of join results.
引用
收藏
页码:448 / 452
页数:5
相关论文
共 50 条
  • [41] Crowdsourcing based API Search via Leveraging Twitter Lists Information
    Liang, Tingting
    Chen, Liang
    Ying, Haochao
    Zheng, Zibin
    Wu, Jian
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON DATA MINING WORKSHOP (ICDMW), 2015, : 1540 - 1547
  • [42] Leveraging Neighbor Attributes for Classification in Sparsely Labeled Networks
    McDowell, Luke K.
    Aha, David W.
    [J]. ACM TRANSACTIONS ON KNOWLEDGE DISCOVERY FROM DATA, 2016, 11 (01)
  • [43] Leveraging deletion neighborhoods and trie for efficient string similarity search and join
    Cui, Jia
    Meng, Dan
    Chen, Zhong-Tao
    [J]. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 2014, 8870
  • [44] Leveraging spatial join for robust tuple extraction from web pages
    Han, Wook-Shin
    Kwak, Wooseong
    Yu, Hwanjo
    Lee, Jeong-Hoon
    Kim, Min-Soo
    [J]. INFORMATION SCIENCES, 2014, 261 : 132 - 148
  • [46] Leveraging Deletion Neighborhoods and Trie for Efficient String Similarity Search and Join
    Cui, Jia
    Meng, Dan
    Chen, Zhong-Tao
    [J]. INFORMATION RETRIEVAL TECHNOLOGY, AIRS 2014, 2014, 8870 : 1 - 13
  • [47] A Typology of Crowd Configurations Based on Crowd Attributes and Their Impacts on Crowdsourcing Outcomes
    He, Hee Rui
    [J]. IEEE ACCESS, 2022, 10 : 88178 - 88190
  • [48] Aligning the crowdsourcing type with the problem attributes to improve solution search efficacy
    Gurca, Andrei
    Bagherzadeh, Mehdi
    Velayati, Rezvan
    [J]. TECHNOVATION, 2023, 119
  • [49] Quality Attributes Analysis in a Crowdsourcing-based Emergency Management System
    Amorim, Ana Maria
    Boechat, Glaucya
    Novais, Renato
    Vieira, Vaninha
    Villela, Karina
    [J]. ICEIS: PROCEEDINGS OF THE 19TH INTERNATIONAL CONFERENCE ON ENTERPRISE INFORMATION SYSTEMS - VOL 2, 2017, : 501 - 509
  • [50] Leveraging Crowdsourcing Data For Deep Active Learning An Application: Learning Intents in Alexa
    Yang, Jie
    Drake, Thomas
    Damianou, Andreas
    Maarek, Yoelle
    [J]. WEB CONFERENCE 2018: PROCEEDINGS OF THE WORLD WIDE WEB CONFERENCE (WWW2018), 2018, : 23 - 32