Semi-automatic Dictionary Curation for Domain-specific Ontologies

被引:0
|
作者
Kulkarni, Ashish [1 ]
Gavankar, Chetana [1 ]
Ramakrishnan, Ganesh [1 ]
Raghavan, Sriram [2 ]
机构
[1] Indian Inst Technol, Bombay, Maharashtra, India
[2] IBM Res Lab, New Delhi, India
关键词
INFORMATION EXTRACTION; WEB;
D O I
10.1109/ICTAI.2013.112
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Within the broad area of information extraction, we study the problem of effective dictionary curation in an enterprise setting. Equipped with an ontology, representative of the domain of an enterprise, our approach populates the attributes of leaf nodes of the ontology with instances extracted from the enterprise corpus. For an attribute of interest, given a few seed examples or indicative features for the attribute, we first obtain a ranked list of 'list pages' potentially containing additional dictionary terms. Our ranking model ranks pages from the enterprise corpus based on their 'fist' content using several visual and lexical features. We gather users' judgement of the result pages and the model continuously learns from this feedback. We compare different techniques of dictionary curation using rule based extractors and visual features of pages. Based on rule writing exercise, we show the benefit of dictionaries for leaf node attributes, in writing rule based extractors for higher level nodes in an ontology. We have implemented a dictionary curation system based on these ideas. Experimental analysis using academic domain ontology and universities corpora, reveal (in the context of enterprise analytics) (i) the merit of dictionary support in rule based information extraction (ii) the viability and effectiveness of an interactive approach for dictionary creation.
引用
收藏
页码:727 / 734
页数:8
相关论文
共 50 条
  • [1] ParAgent: A domain-specific semi-automatic parallelization tool
    Mitra, S
    Kothari, SC
    Cho, J
    Krishnaswamy, A
    [J]. HIGH PERFORMANCE COMPUTING - HIPC 2000, PROCEEDINGS, 2001, 1970 : 141 - 148
  • [2] Semi-automatic derivation of specific-domain ontologies for the semantic web
    Pulido, J. R. G.
    Arechiga, M. A.
    Acosta, R.
    Reyes, P. D.
    Legrand, S.
    [J]. MICAI 2006: FIFTH MEXICAN INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2006, : 253 - 261
  • [3] Semi-automatic extraction of multiword terms from domain-specific corpora
    Pajic, Vesna
    Stankovic, Stasa Vujicic
    Stankovic, Ranka
    Pajic, Milos
    [J]. ELECTRONIC LIBRARY, 2018, 36 (03): : 550 - 567
  • [4] On automatic modeling and use of domain-specific ontologies
    Andreasen, T
    Bulskov, H
    Knappe, R
    [J]. FOUNDATIONS OF INTELLIGENT SYSTEMS, PROCEEDINGS, 2005, 3488 : 74 - 82
  • [5] Semi-Automatic Approaches for Exploiting Shifter Patterns in Domain-Specific Sentiment Analysis
    Brazdil, Pavel
    Muhammad, Shamsuddeen H.
    Oliveira, Fatima
    Cordeiro, Joao
    Silva, Fatima
    Silvano, Purificacao
    Leal, Antonio
    [J]. MATHEMATICS, 2022, 10 (18)
  • [6] Semi-automatic construction of topic ontologies
    Fortuna, Blaz
    Mladenic, Dunja
    Grobelnik, Marko
    [J]. SEMANTICS, WEB AND MINING, 2006, 4289 : 121 - 131
  • [7] Semi-automatic hardware design using ontologies
    Hu, H
    Liu, DY
    Du, XY
    [J]. 2004 8TH INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION, ROBOTICS AND VISION, VOLS 1-3, 2004, : 792 - 797
  • [8] Rapid synthesis of domain-specific web search engines based on semi-automatic training-example generation
    Nabeshima, Hidetomo
    Miyagawa, Reiko
    Suzuki, Yuki
    Iwanuma, Koji
    [J]. 2006 IEEE/WIC/ACM INTERNATIONAL CONFERENCE ON WEB INTELLIGENCE, (WI 2006 MAIN CONFERENCE PROCEEDINGS), 2006, : 769 - +
  • [9] An Automatic Approach for Domain-specific Dictionary Expansion Based on Web Mining
    Sun, Yueheng
    Ni, Weijie
    Men, Rui
    [J]. 2009 SECOND INTERNATIONAL SYMPOSIUM ON KNOWLEDGE ACQUISITION AND MODELING: KAM 2009, VOL 2, 2009, : 96 - 99
  • [10] Using Ontologies in the Domain Analysis of Domain-Specific Languages
    Tairas, Robert
    Mernik, Marjan
    Gray, Jeff
    [J]. MODELS IN SOFTWARE ENGINEERING, 2009, 5421 : 332 - +