Adapting Open Information Extraction to Domain-Specific Relations

被引:38
|
作者
Soderland, Stephen [1 ]
Roof, Brendan [1 ]
Qin, Bo
Xu, Shi [1 ]
Mausam
Etzioni, Oren [1 ]
机构
[1] Univ Washington, Turing Ctr, Seattle, WA 98195 USA
基金
美国国家科学基金会;
关键词
D O I
10.1609/aimag.v31i3.2305
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Information extraction (IE) can identify a set of relations from free text to support question answering (QA). Until recently, IE systems were domain specific and needed a combination of manual engineering and supervised learning to adapt to each target domain. A new paradigm, Open IE, operates on large text corpora without any manual tagging of relations, and indeed without any prespecified relations. Due to its open-domain and open-relation nature, Open IE is purely textual and is unable to relate the surface forms to an ontology, if known in advance. We explore the steps needed to adapt Open IE to a domain-specific ontology and demonstrate our approach of mapping domainindependent tuples to an ontology using domains from the DARPA Machine Reading Project. Our system achieves precision over 0.90 from as few as eight training examples for an NFL-scoring domain. Copyright © 2010.
引用
收藏
页码:93 / 102
页数:10
相关论文
共 50 条
  • [1] Domain-specific information extraction structures
    Lyons, S
    Smith, D
    [J]. 13TH INTERNATIONAL WORKSHOP ON DATABASE AND EXPERT SYSTEMS APPLICATIONS, PROCEEDINGS, 2002, : 80 - 84
  • [2] Prioritization of Domain-Specific Web Information Extraction
    Huang, Jian
    Yu, Cong
    [J]. PROCEEDINGS OF THE TWENTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE (AAAI-10), 2010, : 1327 - 1333
  • [3] Potential and Pitfalls of Domain-Specific Information Extraction at Web Scale
    Rheinlaender, Astrid
    Lehmann, Mario
    Kunkel, Anja
    Meier, Joerg
    Leser, Ulf
    [J]. SIGMOD'16: PROCEEDINGS OF THE 2016 INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA, 2016, : 759 - 771
  • [4] Information Extraction of Domain-specific Business Documents with Limited Data
    Minh-Tien Nguyen
    Le Thai Linh
    Dung Tien Le
    Nguyen Hong Son
    Do Hoang Thai Duong
    Bui Cong Minh
    Akira Shojiguchi
    [J]. 2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,
  • [5] Domain-specific keyphrase extraction
    Frank, E
    Paynter, GW
    Witten, IH
    Gutwin, C
    Nevill-Manning, CG
    [J]. IJCAI-99: PROCEEDINGS OF THE SIXTEENTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOLS 1 & 2, 1999, : 668 - 673
  • [6] Domain-Specific Paraphrase Extraction
    Pavlick, Ellie
    Ganitkevitch, Juri
    Chan, Tsz Ping
    Yao, Xuchen
    Van Durme, Benjamin
    Callison-Burch, Chris
    [J]. PROCEEDINGS OF THE 53RD ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL) AND THE 7TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (IJCNLP), VOL 2, 2015, : 57 - 62
  • [7] Extraction and Semantic Representation of Domain-Specific Relations in Spanish Labour Law
    Revenko, Artem
    Martin-Chozas, Patricia
    [J]. PROCESAMIENTO DEL LENGUAJE NATURAL, 2022, (69): : 105 - 116
  • [8] The Open Domain-Specific Architecture
    Drucker, Kevin
    Jani, Dharmesh
    Agarwal, Ishwar
    Miller, Gary
    Mittal, Millind
    Wang, Robert
    Vinnakota, Bapiraju
    [J]. 2020 IEEE SYMPOSIUM ON HIGH-PERFORMANCE INTERCONNECTS (HOTI 2020), 2020, : 25 - 32
  • [9] The Open Domain-Specific Architecture
    Vinnakota, Bapi
    Agarwal, Ishwar
    Drucker, Kevin
    Jani, Dharmesh
    Miller, Gary
    Mittal, Millind
    Wang, Robert
    [J]. IEEE MICRO, 2021, 41 (01) : 30 - 36
  • [10] Scalable Document Image Information Extraction with Application to Domain-Specific Analysis
    Zheng, Yingbin
    Kong, Shuchen
    Zhu, Wanshan
    Ye, Hao
    [J]. 2019 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2019, : 5108 - 5115