Crowd-sourcing Web Knowledge for Metadata Extraction

被引:0
|
作者
Wu, Zhaohui [1 ]
Huang, Wenyi [2 ]
Liang, Chen [2 ]
Giles, C. Lee [1 ,2 ]
机构
[1] Penn State Univ, Comp Sci & Engn, University Pk, PA 16802 USA
[2] Penn State Univ, Informat Sci & Technol, University Pk, PA 16802 USA
关键词
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We explore a new metadata extraction framework without human annotators with the ground truth harvested from Web. A new training sample is selected based on not only the uncertainty and representativeness in the unlabeled pool, but also on its availability and credibility in Web knowledge bases. We construct a dataset of 4329 books with valid metadata and evaluate our approach using 5 Web book databases as oracles. Empirical results demonstrate its effectiveness and efficiency.
引用
收藏
页码:141 / 144
页数:4
相关论文
共 50 条
  • [41] Program Boosting: Program Synthesis via Crowd-Sourcing
    Cochran, Robert A.
    D'Antoni, Loris
    Livshits, Benjamin
    Molnar, David
    Veanes, Margus
    [J]. ACM SIGPLAN NOTICES, 2015, 50 (01) : 677 - 688
  • [42] The GEP: Crowd-Sourcing Big Data Analysis with Undergraduates
    Elgin, Sarah C. R.
    Hauser, Charles
    Holzen, Teresa M.
    Jones, Christopher
    Kleinschmit, Adam
    Leatherman, Judith
    [J]. TRENDS IN GENETICS, 2017, 33 (02) : 81 - 85
  • [43] Crowd-sourcing Home Energy Efficiency Measurement System
    Son, Young-Sung
    Han, Hyonyung
    Jo, Jun
    Park, Jun-Hee
    [J]. 2015 INTERNATIONAL CONFERENCE ON ICT CONVERGENCE (ICTC), 2015, : 1272 - 1275
  • [44] Collaboration Trumps Homophily in Urban Mobile Crowd-sourcing
    Kandappu, Thivya
    Misra, Archan
    Tandriansyah, Randy
    [J]. CSCW'17: PROCEEDINGS OF THE 2017 ACM CONFERENCE ON COMPUTER SUPPORTED COOPERATIVE WORK AND SOCIAL COMPUTING, 2017, : 902 - 915
  • [45] CROWD-SOURCING PARENTAL PREFERENCE ASSESSMENTS FOR VESICOURETERAL REFLUX
    Dionise, Zachary
    Garcia-Roig, Michael
    Kirsch, Andrew
    Routh, Jonathan
    [J]. JOURNAL OF UROLOGY, 2018, 199 (04): : E589 - E590
  • [46] Crowd-sourcing and author submission as alternatives to professional curation
    Karp, Peter D.
    [J]. DATABASE-THE JOURNAL OF BIOLOGICAL DATABASES AND CURATION, 2016,
  • [47] LingoBee: Engaging Mobile Language Learners Through Crowd-Sourcing
    Petersen, Sobah Abbas
    Procter-Legg, Emma
    Cacchione, Annamaria
    [J]. INTERNATIONAL JOURNAL OF MOBILE AND BLENDED LEARNING, 2014, 6 (02) : 58 - 73
  • [48] Learning motion primitives and annotative texts from crowd-sourcing
    Takano W.
    [J]. ROBOMECH Journal, 2 (1):
  • [49] Teaching global health using crowd-sourcing with Missing Maps
    Schwerdtle, Patricia
    Herfort, Benjamin
    [J]. NURSE EDUCATION TODAY, 2018, 60 : 1 - 2
  • [50] Pronunciation Learning for Named-Entities through Crowd-Sourcing
    Rutherford, Attapol T.
    Peng, Fuchun
    Beaufays, Francoise
    [J]. 15TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2014), VOLS 1-4, 2014, : 1448 - 1452