Eksairesis: A Domain-adaptable System for Ontology Building from Unstructured Text

被引:0
|
作者
Kermanidis, K. L. [1 ]
Thanopoulos, A. [2 ]
Maragoudakis, M. [3 ]
Fakotakis, N. [2 ]
机构
[1] Ionian Univ, Dept Informat, 7 Pl Tsirigoti, Corfu 49100, Greece
[2] Univ Patras, Dept Elect & Comp Engn, Wire Commun Lab, Rion 26500, Greece
[3] Univ Aegean, Dept Informat & Commun Syst Engn, Samos, Greece
关键词
D O I
暂无
中图分类号
H0 [语言学];
学科分类号
030303 ; 0501 ; 050102 ;
摘要
This paper describes Eksairesis, a system for learning economic domain knowledge automatically from Modern Greek text. The knowledge is in the form of economic terms and the semantic relations that govern them. The entire process in based on the use of minimal language-dependent tools, no external linguistic resources, and merely free, unstructured text. The methodology is thereby easily portable to other domains and other languages. The text is pre-processed with basic morphological annotation, and semantic (named and other) entities are identified using supervised learning techniques. Statistical filtering, i.e. corpora comparison is used to extract domain terms and supervised learning is again employed to detect the semantic relations between pairs of terms. Advanced classification schemata, ensemble learning, and one-sided sampling, are experimented with in order to deal with the noise in the data, which is unavoidable due to the low pre-processing level and the lack of sophisticated resources. An average f-score of 68,5% over all the classes is achieved when learning semantic relations. Bearing in mind the use of minimal resources and the highly automated nature of the process, classification performance is very promising, compared to results reported in previous work.
引用
收藏
页码:565 / 572
页数:8
相关论文
共 50 条
  • [1] Automatic Ontology Learning from Domain-specific Short Unstructured Text Data
    Xu, Yiming
    Rajpathak, Dnyanesh
    Gibbs, Ian
    Klabjan, Diego
    [J]. PROCEEDINGS OF THE 12TH INTERNATIONAL JOINT CONFERENCE ON KNOWLEDGE DISCOVERY, KNOWLEDGE ENGINEERING AND KNOWLEDGE MANAGEMENT (KMIS), VOL 3, 2020, : 29 - 39
  • [2] Arabic ontology extraction model from unstructured text
    Saber, Yasser Mohamed
    Abdel-Galil, Hala
    Belal, Mohamed Abd El -Fatah
    [J]. JOURNAL OF KING SAUD UNIVERSITY-COMPUTER AND INFORMATION SCIENCES, 2022, 34 (08) : 6066 - 6076
  • [3] Linea: Building Timelines from Unstructured Text
    Etiene, Tiago
    Pagliosa, Paulo
    Nonato, Luis Gustavo
    [J]. 2015 28TH SIBGRAPI CONFERENCE ON GRAPHICS, PATTERNS AND IMAGES, 2015, : 234 - 241
  • [4] ProxMetrics: modular proxemic similarity toolkit to generate domain-adaptable indicators from social media
    Masson, Maxime
    Roose, Philippe
    Sallaberry, Christian
    Bessagnet, Marie-Noelle
    Le Parc Lacayrelle, Annig
    Agerri, Rodrigo
    [J]. SOCIAL NETWORK ANALYSIS AND MINING, 2024, 14 (01)
  • [5] A Multidomain Layered Approach in Development of Industrial Ontology to Support Domain Identification for Unstructured Text
    Kumaravel, Rajbabu
    Selvaraj, Sudha
    Mala, C.
    [J]. IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2018, 14 (09) : 4033 - 4044
  • [6] Unsupervised Domain Ontology Learning from Text
    Venu, Sree Harissh
    Mohan, Vignesh
    Urkalan, Kodaikkaavirinaadan
    Geetha, T., V
    [J]. MINING INTELLIGENCE AND KNOWLEDGE EXPLORATION (MIKE 2016), 2017, 10089 : 132 - 143
  • [7] A Review for Domain Ontology Construction from Text
    Ren, Fei-Liang
    Shen, Ji-Kun
    Sun, Bin-Bin
    Zhu, Jing-Bo
    [J]. Jisuanji Xuebao/Chinese Journal of Computers, 2019, 42 (03): : 654 - 676
  • [8] A crowdsourcing approach to building a legal ontology from text
    Getman, Anatoly
    Karasiuk, Volodymyr
    [J]. ARTIFICIAL INTELLIGENCE AND LAW, 2014, 22 (03) : 313 - 335
  • [9] Text onto miner - A semi automated ontology building system
    Gawrysiak, Piotr
    Protaziuk, Grzegorz
    Rybinski, Henryk
    Delteil, Alexandre
    [J]. FOUNDATIONS OF INTELLIGENT SYSTEMS, PROCEEDINGS, 2008, 4994 : 563 - +
  • [10] An Ontology-Based Text Mining Method to Develop D-Matrix from Unstructured Text
    Rajpathak, Dnyanesh G.
    Singh, Satnam
    [J]. IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2014, 44 (07): : 966 - 977