An Open Relation Extraction System for Web Text Information

被引:2
|
作者
Li, Huagang [1 ]
Liu, Bo [1 ]
机构
[1] Natl Univ Def Technol, Coll Comp Sci & Technol, Changsha 410073, Peoples R China
来源
APPLIED SCIENCES-BASEL | 2022年 / 12卷 / 11期
关键词
open relation extraction; few-shot learning; knowledge extraction; tBERT; K-BERT;
D O I
10.3390/app12115718
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Web texts typically undergo the open-ended growth of new relations. Traditional relation extraction methods lack automatic annotation and perform poorly on new relation extraction tasks. We propose an open-domain relation extraction system (ORES) based on distant supervision and few-shot learning to solve this problem. More specifically, we utilize tBERT to design instance selector 1, implementing automatic labeling in the data mining component. Meanwhile, we design example selector 2 based on K-BERT in the new relation extraction component. The real-time data management component outputs new relational data. Experiments show that ORES can filter out higher quality and diverse instances for better new relation learning. It achieves significant improvement compared to Neural Snowball with fewer seed sentences.
引用
收藏
页数:19
相关论文
共 50 条
  • [1] Spoken Dialogue System Based on Information Extraction from Web Text
    Yoshino, Koichiro
    Kawahara, Tatsuya
    [J]. SPOKEN DIALOGUE SYSTEMS FOR AMBIENT ENVIRONMENTS, 2010, 6392 : 196 - 197
  • [2] Open Information Extraction from the Web
    Banko, Michele
    Cafarella, Michael J.
    Soderland, Stephen
    Broadhead, Matt
    Etzioni, Oren
    [J]. 20TH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2007, : 2670 - 2676
  • [3] Open Information Extraction from the Web
    Etzioni, Oren
    Banko, Michele
    Soderland, Stephen
    Weld, Daniel S.
    [J]. COMMUNICATIONS OF THE ACM, 2008, 51 (12) : 68 - 74
  • [4] Open Relation Extraction from Chinese Microblog Text
    Xu, Jing
    Gan, Liang
    Yan, Zhou
    Wu, Quanyuan
    Jia, Yan
    [J]. 2016 IEEE FIRST INTERNATIONAL CONFERENCE ON DATA SCIENCE IN CYBERSPACE (DSC 2016), 2016, : 673 - 677
  • [5] Web text information extraction based on wrapper model
    Wang, Jingpu
    Lin, Yaping
    Zhou, Shunxian
    [J]. 2005 International Symposium on Computer Science and Technology, Proceedings, 2005, : 607 - 612
  • [6] A novel web page text information extraction method
    Wang, Chongjun
    Wei, Peng
    [J]. PROCEEDINGS OF 2019 IEEE 3RD INFORMATION TECHNOLOGY, NETWORKING, ELECTRONIC AND AUTOMATION CONTROL CONFERENCE (ITNEC 2019), 2019, : 2213 - 2218
  • [7] An assessment of open relation extraction systems for the semantic web
    Zouaq, Amal
    Gagnon, Michel
    Jean-Louis, Ludovic
    [J]. INFORMATION SYSTEMS, 2017, 71 : 228 - 239
  • [8] INFORMATION EXTRACTION VERSUS TEXT SEGMENTATION FOR WEB CONTENT MINING
    Fragkou, Pavlina
    [J]. INTERNATIONAL JOURNAL OF SOFTWARE ENGINEERING AND KNOWLEDGE ENGINEERING, 2013, 23 (08) : 1109 - 1137
  • [9] Open spatial information query based-on web text
    Qian, Chengyang
    Long, Yi
    Xu, Zhen
    Sun, Hao
    [J]. Wuhan Daxue Xuebao (Xinxi Kexue Ban)/ Geomatics and Information Science of Wuhan University, 2010, 35 (01): : 83 - 87
  • [10] Syntactic Representation Learning for Open Information Extraction on Web
    Ru, Chengsen
    Tang, Jintao
    Li, Shasha
    Wang, Ting
    [J]. WWW'17 COMPANION: PROCEEDINGS OF THE 26TH INTERNATIONAL CONFERENCE ON WORLD WIDE WEB, 2017, : 833 - 834