Incremental information extraction using tree-based context representations

被引:0
|
作者
Siefkes, C [1 ]
机构
[1] Free Univ Berlin, Berlin Brandenburg Grad Sch Distributed Informat, Database & Informat Syst Grp, D-14195 Berlin, Germany
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The purpose of information extraction (IE) is to find desired pieces of information in natural language texts and store them in a form that is suitable for automatic processing. Providing annotated training data to adapt a trainable IE system to a new domain requires a considerable amount of work. To address this, we explore incremental learning. Here training documents are annotated sequentially by a user and immediately incorporated into the extraction model. Thus the system can support the user by proposing extractions based on the current extraction model, reducing the workload of the user over time. We introduce an approach to modeling IE as a token classification task that allows incremental training. To provide sufficient information to the token classifiers, we use rich, tree-based context representations of each token as feature vectors. These representations make use of the heuristically deduced document structure in addition to linguistic and semantic information. We consider the resulting feature vectors as ordered and combine proximate features into more expressive joint features, called "Orthogonal Sparse Bigrams" (OSB). Our results indicate that this setup makes it possible to employ IE in an incremental fashion without a serious performance penalty.
引用
收藏
页码:510 / 521
页数:12
相关论文
共 50 条
  • [1] Boosted incremental tree-based imputation of missing data
    Siciliano, Roberta
    Aria, Massimo
    D'Ambrosio, Antonio
    DATA ANALYSIS, CLASSIFICATION AND THE FORWARD SEARCH, 2006, : 271 - +
  • [2] Context Tree-Based Image Contour Coding Using a Geometric Prior
    Zheng, Amin
    Cheung, Gene
    Florencio, Dinei
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2017, 26 (02) : 574 - 589
  • [3] Morphological Filtering in Shape Spaces: Applications using Tree-Based Image Representations
    Xu, Yongchao
    Geraud, Thierry
    Najman, Laurent
    2012 21ST INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR 2012), 2012, : 485 - 488
  • [4] A Tree-Based Context Model for Object Recognition
    Choi, Myung Jin
    Torralba, Antonio
    Willsky, Alan S.
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2012, 34 (02) : 240 - 252
  • [5] Robust tree-based incremental imputation method for data fusion
    D'Ambrosio, Antonio
    Aria, Massimo
    Siciliano, Roberta
    ADVANCES IN INTELLIGENT DATA ANALYSIS VII, PROCEEDINGS, 2007, 4723 : 174 - +
  • [6] Incremental fuzzy decision tree-based network forensic system
    Liu, ZQ
    Feng, DG
    COMPUTATIONAL INTELLIGENCE AND SECURITY, PT 2, PROCEEDINGS, 2005, 3802 : 995 - 1002
  • [7] Incremental Tree-Based Inference with Dependent Normalized Random Measures
    Lee, Juho
    Choi, Seungjin
    ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 33, 2014, 33 : 558 - 566
  • [8] Incremental Tree-Based Missing Data Imputation with Lexicographic Ordering
    Claudio Conversano
    Roberta Siciliano
    Journal of Classification, 2009, 26 : 361 - 379
  • [9] Incremental Tree-Based Missing Data Imputation with Lexicographic Ordering
    Conversano, Claudio
    Siciliano, Roberta
    JOURNAL OF CLASSIFICATION, 2009, 26 (03) : 361 - 379
  • [10] A framework of a tree-based Grid information service
    Chen, Y
    Li, Y
    Gong, ZX
    Zhu, QM
    2005 IEEE INTERNATIONAL CONFERENCE ON SERVICES COMPUTING , VOL 2, PROCEEDINGS, 2005, : 255 - 256