Crowd-sourcing Web Knowledge for Metadata Extraction

被引:0
|
作者
Wu, Zhaohui [1 ]
Huang, Wenyi [2 ]
Liang, Chen [2 ]
Giles, C. Lee [1 ,2 ]
机构
[1] Penn State Univ, Comp Sci & Engn, University Pk, PA 16802 USA
[2] Penn State Univ, Informat Sci & Technol, University Pk, PA 16802 USA
关键词
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We explore a new metadata extraction framework without human annotators with the ground truth harvested from Web. A new training sample is selected based on not only the uncertainty and representativeness in the unlabeled pool, but also on its availability and credibility in Web knowledge bases. We construct a dataset of 4329 books with valid metadata and evaluate our approach using 5 Web book databases as oracles. Empirical results demonstrate its effectiveness and efficiency.
引用
收藏
页码:141 / 144
页数:4
相关论文
共 50 条
  • [31] Online Incentive Mechanism Design for Smartphone Crowd-sourcing
    Subramanian, Ashwin
    Kanth, G. Sai
    Moharir, Sharayu
    Vaze, Rahul
    [J]. 2015 13TH INTERNATIONAL SYMPOSIUM ON MODELING AND OPTIMIZATION IN MOBILE, AD HOC, AND WIRELESS NETWORKS (WIOPT), 2015, : 403 - 410
  • [32] Robust and Trusted Crowd-Sourcing and Crowd-Tasking in the Future Internet
    Havlik, Denis
    Egly, Maria
    Huber, Hermann
    Kutschera, Peter
    Falgenhauer, Markus
    Cizek, Markus
    [J]. ENVIRONMENTAL SOFTWARE SYSTEMS: FOSTERING INFORMATION SHARING, 2013, 413 : 164 - 176
  • [33] Electronic Records, Registries, and the Development of "Big Data": Crowd-Sourcing Quality toward Knowledge
    Dewdney, Summer B.
    Lachance, Jason
    [J]. FRONTIERS IN ONCOLOGY, 2017, 6
  • [34] Crowd-sourcing tools within the PREPARE analytical platform
    Ikonomopoulos, A.
    Konstantopoulos, S.
    [J]. RADIOPROTECTION, 2016, 51 (HS2) : S187 - S189
  • [35] Crowd-sourcing: Citizens as scientists for air pollution monitoring
    Angelevska, Beti
    Andreevski, Igor
    Atanasova, Vaska
    [J]. 2021 56TH INTERNATIONAL SCIENTIFIC CONFERENCE ON INFORMATION, COMMUNICATION AND ENERGY SYSTEMS AND TECHNOLOGIES (ICEST), 2021, : 131 - 134
  • [36] Sony crowd-sourcing technology initiative gathers pace
    [J]. Engineer, 2010, NOVEMBER
  • [37] Conceptual Model for Crowd-Sourcing Digital Forensic Evidence
    Baror, Stacey O.
    Venter, H. S.
    Kebande, Victor R.
    [J]. 6TH INTERNATIONAL CONFERENCE ON SMART CITY APPLICATIONS, 2022, 393 : 1085 - 1099
  • [38] histoGraph as a Demonstrator for Domain Specific Challenges to Crowd-Sourcing
    Wieneke, Lars
    Duering, Marten
    Croce, Vincenzo
    Novak, Jasminko
    [J]. Social Informatics, 2015, 8852 : 469 - 476
  • [39] IP Geolocation with a Crowd-sourcing Broadband Performance Tool
    Lee, Yeonhee
    Park, Heasook
    Lee, Youngseok
    [J]. ACM SIGCOMM COMPUTER COMMUNICATION REVIEW, 2016, 46 (01) : 12 - 20
  • [40] An Online Learning Approach to Improving the Quality of Crowd-Sourcing
    Liu, Yang
    Liu, Mingyan
    [J]. IEEE-ACM TRANSACTIONS ON NETWORKING, 2017, 25 (04) : 2166 - 2179