Temporal knowledge extraction from large-scale text corpus

被引:8
|
作者
Liu, Yu [1 ]
Hua, Wen [1 ]
Zhou, Xiaofang [1 ]
机构
[1] Univ Queensland, Sch Informat Technol & Elect Engn, Brisbane, Qld, Australia
关键词
Temporal knowledge harvesting; Temporal patterns; Temporal facts; Knowledge base; BASE;
D O I
10.1007/s11280-020-00836-5
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Knowledge, in practice, is time-variant and many relations are only valid for a certain period of time. This phenomenon highlights the importance of harvesting temporal-aware knowledge, i.e., the relational facts coupled with their valid temporal interval. Inspired by pattern-based information extraction systems, we resort to temporal patterns to extract time-aware knowledge from free text. However, pattern design is extremely laborious and time consuming even for a single relation, and free text is usually ambiguous which makes temporal instance extraction extremely difficult. Therefore, in this work, we study the problem of temporal knowledge extraction with two steps: (1) temporal pattern extraction by automatically analysing a large-scale text corpus with a small number of seed temporal facts, (2) temporal instance extraction by applying the identified temporal patterns. For pattern extraction, we introduce various techniques, including corpus annotation, pattern generation, scoring and clustering, to improve both accuracy and coverage of the extracted patterns. For instance extraction, we propose a double-check strategy to improve the accuracy and a set of node-extension rules to improve the coverage. We conduct extensive experiments on real world datasets and compared with state-of-the-art systems. Experimental results verify the effectiveness of our proposed methods for temporal knowledge harvesting.
引用
收藏
页码:135 / 156
页数:22
相关论文
共 50 条
  • [1] Temporal knowledge extraction from large-scale text corpus
    Yu Liu
    Wen Hua
    Xiaofang Zhou
    [J]. World Wide Web, 2021, 24 : 135 - 156
  • [2] Extracting Temporal Patterns from Large-Scale Text Corpus
    Liu, Yu
    Hua, Wen
    Zhou, Xiaofang
    [J]. DATABASES THEORY AND APPLICATIONS (ADC 2019), 2019, 11393 : 17 - 30
  • [3] Large-Scale Extraction and Use of Knowledge from Text
    Clark, Peter
    Harrison, Phil
    [J]. K-CAP'09: PROCEEDINGS OF THE FIFTH INTERNATIONAL CONFERENCE ON KNOWLEDGE CAPTURE, 2009, : 153 - 160
  • [4] Causal Knowledge Extraction through Large-Scale Text Mining
    Hassanzadeh, Oktie
    Bhattacharjya, Debarun
    Feblowitz, Mark
    Srinivas, Kavitha
    Perrone, Michael
    Sohrabi, Shirin
    Katz, Michael
    [J]. THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 13610 - 13611
  • [5] Automatic label curation from large-scale text corpus
    Avasthi, Sandhya
    Chauhan, Ritu
    [J]. ENGINEERING RESEARCH EXPRESS, 2024, 6 (01):
  • [6] iTextMine: integrated text-mining system for large-scale knowledge extraction from the literature
    Ren, Jia
    Li, Gang
    Ross, Karen
    Arighi, Cecilia
    McGarvey, Peter
    Rao, Shruti
    Cowart, Julie
    Madhavan, Subha
    Vijay-Shanker, K.
    Wu, Cathy H.
    [J]. DATABASE-THE JOURNAL OF BIOLOGICAL DATABASES AND CURATION, 2018,
  • [7] Simple Large-scale Relation Extraction from Unstructured Text
    Christodoulopoulos, Christos
    Mittal, Arpit
    [J]. PROCEEDINGS OF THE ELEVENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2018), 2018, : 215 - 222
  • [8] Mining Large-scale Event Knowledge from Web Text
    Cao, Ya-nan
    Zhang, Peng
    Guo, Jing
    Guo, Li
    [J]. 2014 INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE, 2014, 29 : 478 - 487
  • [9] Feature Extraction for Large-Scale Text Collections
    Gallagher, Luke
    Mallia, Antonio
    Culpepper, J. Shane
    Suel, Torsten
    Cambazoglu, B. Barla
    [J]. CIKM '20: PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT, 2020, : 3015 - 3022
  • [10] Creating A Large-Scale Financial News Corpus for Relation Extraction
    Wu, Haoyu
    Lei, Qing
    Zhang, Xinyue
    Luo, Zhengqian
    [J]. 2020 3RD INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND BIG DATA (ICAIBD 2020), 2020, : 259 - 263