Relation Extraction from Chinese News Web Documents Based on Weakly Supervised Learning

被引:0
|
作者
Qiu, Jing [1 ]
Liao, Lejian [1 ]
Li, Peng [1 ]
机构
[1] Beijing Inst Technol, Sch Comp Sci, Beijing Lab Intelligent Informat Technol, Beijing 100081, Peoples R China
关键词
Relation extraction; Kernel method; Machine learning; Weakly supervised;
D O I
10.1109/INCOS.2009.14
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Extracting instances of a given target relation from a given Web page corpus seems to be the basic work to exploit nearly endless source of knowledge which provided by the World Wide Web. Supervised learning requires a large amount of labeled data, but the data labeling process can be expensive and time consuming. In this paper we present a kernel-based weakly supervised machine learning algorithm for relation extraction. It takes a small set of target relations as input. The goal is to automatically extract arbitrary binary relations from Web documents in the domain of football games. Bootstrapping is used to improve the performance of the system. We also compare the performances on different input example sizes. Experimental results show the effectiveness and benefits of our approach.
引用
收藏
页码:219 / 225
页数:7
相关论文
共 50 条
  • [1] Attribute extraction of chinese online encyclopedia based on weakly supervised learning
    Jia, Zhen
    Yang, Yan
    He, Da-Ke
    Dianzi Keji Daxue Xuebao/Journal of the University of Electronic Science and Technology of China, 2014, 43 (05): : 758 - 763
  • [2] Unsupervised Relation Extraction from Web Documents
    Eichler, Kathrin
    Hemsen, Holmer
    Neumann, Guenter
    SIXTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, LREC 2008, 2008, : 1674 - 1679
  • [3] A Semi-automated Entity Relation Extraction Mechanism with Weakly Supervised Learning for Chinese Medical Webpages
    Liu, Zhao
    Tong, Jian
    Gu, Jinguang
    Liu, Kai
    Hu, Bo
    SMART HEALTH, ICSH 2016, 2017, 10219 : 44 - 56
  • [4] Towards Tabular Data Extraction From Richly-Structured Documents Using Supervised and Weakly-Supervised Learning
    Chowdhury, Arnab Ghosh
    ben Ahmed, Martin
    Atzmueller, Martin
    2022 IEEE 27TH INTERNATIONAL CONFERENCE ON EMERGING TECHNOLOGIES AND FACTORY AUTOMATION (ETFA), 2022,
  • [5] Automatic keyphrase extraction from chinese news documents
    Wang, HF
    Li, SJ
    Yu, SW
    FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY, PT 2, PROCEEDINGS, 2005, 3614 : 648 - 657
  • [6] A Feature-Based Approach for Relation Extraction from Thai News Documents
    Tonatep, Nattapong
    Theeramunkong, Thanaruk
    INTELLIGENCE AND SECURITY INFORMATICS, PROCEEDINGS, 2009, 5477 : 149 - 154
  • [7] Weakly supervised spatial relation extraction from radiology reports
    Datta, Surabhi
    Roberts, Kirk
    JAMIA OPEN, 2023, 6 (02)
  • [8] Relation extraction with weakly supervised learning based on process-structure-property-performance reciprocity
    Onishi, Takeshi
    Kadohira, Takuya
    Watanabe, Ikumu
    SCIENCE AND TECHNOLOGY OF ADVANCED MATERIALS, 2018, 19 (01) : 649 - 659
  • [9] Self-supervised relation extraction from the Web
    Rozenfeld, Benjamin
    Feldman, Ronen
    KNOWLEDGE AND INFORMATION SYSTEMS, 2008, 17 (01) : 17 - 33
  • [10] Self-supervised relation extraction from the Web
    Benjamin Rozenfeld
    Ronen Feldman
    Knowledge and Information Systems, 2008, 17 : 17 - 33