Creating Knowledge Base from Automatically Extracted Information

被引:0
|
作者
Nachyla, Beata [1 ]
机构
[1] Warsaw Univ Technol, Inst Comp Sci, PL-00665 Warsaw, Poland
来源
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this article we present a self-learning method for discovering the domain specific knowledge contained in a set of text documents. The method assumes that contents of the input documents have tagged domain-relevant information. The information is tagged with labels from a prespecified set. The method counts the co-occurrences of various sequences of the labels in a sentence and represents them in form of a data structure called a Prefix Label Tree. In order to extract knowledge from a given document, we use a hierarchical clustering method to group the labels contained within the document's content. In order to calculate similarity of clusters during the clustering process, we also propose a measure called the Relation Possibility (RP).
引用
收藏
页码:608 / 617
页数:10
相关论文
共 50 条
  • [41] Using a Knowledge Base to Automatically Annotate Speech Corpora and to Identify Sociolinguistic Variation
    Wu, Yaru
    Suchanek, Fabian
    Vasilescu, Ioana
    Lamel, Lori
    Adda-Decker, Martine
    LREC 2022: THIRTEEN INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2022, : 1054 - 1060
  • [42] TestDossier: A Dataset of Tested Values Automatically Extracted from Test Execution
    Hora, Andre
    2024 IEEE/ACM 21ST INTERNATIONAL CONFERENCE ON MINING SOFTWARE REPOSITORIES, MSR, 2024, : 299 - 303
  • [43] Towards Classification of Loop Idioms Automatically Extracted from Legacy Systems
    Okada, Joji
    Ishio, Takashi
    Sakata, Yuji
    Inoue, Katsuro
    2019 IEEE 13TH INTERNATIONAL WORKSHOP ON SOFTWARE CLONES (IWSC '19), 2019, : 34 - 35
  • [44] Mapping urban fingerprints of odonyms automatically extracted from French novels
    Moncla, Ludovic
    Gaio, Mauro
    Joliveau, Thierry
    Le Lay, Yves-Francois
    Boeglin, Noemie
    Mazagol, Pierre-Olivier
    INTERNATIONAL JOURNAL OF GEOGRAPHICAL INFORMATION SCIENCE, 2019, 33 (12) : 2477 - 2497
  • [45] Boosting Knowledge Base Automatically via Few-Shot Relation Classification
    Pang, Ning
    Tan, Zhen
    Xu, Hao
    Xiao, Weidong
    FRONTIERS IN NEUROROBOTICS, 2020, 14
  • [46] ConNeKTion: A Tool for Handling Conceptual Graphs Automatically Extracted from Text
    Leuzzi, Fabio
    Ferilli, Stefano
    Rotella, Fulvio
    BRIDGING BETWEEN CULTURAL HERITAGE INSTITUTIONS, 2014, 385 : 93 - 104
  • [47] Creating Intelligent Linking for Information Threading in Knowledge Networks
    Nair, T. R. Gopalakrishnan
    Malhotra, Meenakshi
    2011 ANNUAL IEEE INDIA CONFERENCE (INDICON-2011): ENGINEERING SUSTAINABLE SOLUTIONS, 2011,
  • [48] Extracting information automatically from biological literature
    Blaschke, C
    Hoffmann, R
    Oliveros, JC
    Valencia, A
    COMPARATIVE AND FUNCTIONAL GENOMICS, 2001, 2 (05): : 310 - 313
  • [49] Automatically Enriching a Thesaurus with Information from Dictionaries
    Oliveira, Hugo Goncalo
    Gomes, Paulo
    PROGRESS IN ARTIFICIAL INTELLIGENCE-BOOK, 2011, 7026 : 462 - 475
  • [50] Learning Tool for Applying Static Vulnerability Analysis of Office Documents Based On Automatically Extracted Information from the Associated Macro Code
    Radescu, Radu
    Rosu, Malina Andreea
    PROCEEDINGS OF THE 15TH INTERNATIONAL CONFERENCE ON VIRTUAL LEARNING (ICVL-2020), 2020, : 417 - 423