Crowd-sourcing Web Knowledge for Metadata Extraction

被引:0
|
作者
Wu, Zhaohui [1 ]
Huang, Wenyi [2 ]
Liang, Chen [2 ]
Giles, C. Lee [1 ,2 ]
机构
[1] Penn State Univ, Comp Sci & Engn, University Pk, PA 16802 USA
[2] Penn State Univ, Informat Sci & Technol, University Pk, PA 16802 USA
关键词
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We explore a new metadata extraction framework without human annotators with the ground truth harvested from Web. A new training sample is selected based on not only the uncertainty and representativeness in the unlabeled pool, but also on its availability and credibility in Web knowledge bases. We construct a dataset of 4329 books with valid metadata and evaluate our approach using 5 Web book databases as oracles. Empirical results demonstrate its effectiveness and efficiency.
引用
收藏
页码:141 / 144
页数:4
相关论文
共 50 条
  • [1] JabberWocky: Crowd-Sourcing Metadata for Files
    Bhagwan, Varun
    Maltzahn, Carlos
    [J]. 2009 IEEE INTERNATIONAL CONFERENCE ON SERVICES COMPUTING, 2009, : 513 - +
  • [2] Crowd-Sourcing Creation
    Brunick, Paul
    [J]. FILM COMMENT, 2011, 47 (04) : 42 - 45
  • [3] Software CROWD-Sourcing
    Naik, Nitin
    [J]. 2017 11TH INTERNATIONAL CONFERENCE ON RESEARCH CHALLENGES IN INFORMATION SCIENCE (RCIS), 2017, : 463 - 464
  • [4] Integration of Computational and Crowd-Sourcing Methods for Ontology Extraction
    Lin, Huairen
    Davis, Joseph
    Zhou, Ying
    [J]. 2009 FIFTH INTERNATIONAL CONFERENCE ON SEMANTICS, KNOWLEDGE AND GRID (SKG 2009), 2009, : 306 - 309
  • [5] Crowd-Sourcing Drug Discovery
    Bagla, Pallava
    [J]. SCIENCE, 2012, 335 (6071) : 909 - 909
  • [6] Crowd-Sourcing for Smart Cities
    Chowdhury, Srinjoy Nag
    Dhawan, Saniya
    Agnihotri, Akshay
    [J]. 2016 IEEE INTERNATIONAL CONFERENCE ON RECENT TRENDS IN ELECTRONICS, INFORMATION & COMMUNICATION TECHNOLOGY (RTEICT), 2016, : 360 - 365
  • [7] REMOTE SENSING AND CROWD-SOURCING
    Guida, Raffaella
    Brett, Peter T. B.
    Khan, Salman S.
    [J]. 2013 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS), 2013, : 3942 - 3945
  • [8] Crowd-sourcing: Strength in numbers
    Philip Ball
    [J]. Nature, 2014, 506 : 422 - 423
  • [9] Crowd-sourcing prosodic annotation
    Cole, Jennifer
    Mahrt, Timothy
    Roy, Joseph
    [J]. COMPUTER SPEECH AND LANGUAGE, 2017, 45 : 300 - 325
  • [10] Crowd-Sourcing the Sounds of Places with a Web-Based Evolutionary Algorithm
    Brownlee, Alexander E., I
    Kim, Suk-Jun
    Wang, Szu-Han
    Chan, Stella
    Lawson, Jamie A.
    [J]. PROCEEDINGS OF THE 2019 GENETIC AND EVOLUTIONARY COMPUTATION CONFERENCE COMPANION (GECCCO'19 COMPANION), 2019, : 131 - 132