Learning relational facts from the web: A tolerance rough set approach

被引:9
|
作者
Sengoz, Cenker [1 ]
Ramanna, Sheela [1 ]
机构
[1] Univ Winnipeg, Dept Appl Comp Sci, Winnipeg, MB R3B 2E9, Canada
基金
加拿大自然科学与工程研究理事会;
关键词
Tolerance rough sets; Granular methodologies; Web mining; Semi-supervised learning;
D O I
10.1016/j.patrec.2014.12.005
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A key issue when mining web information is the labeling problem: data are abundant on the web but is unlabeled. In this paper, we address this problem by proposing (i) a granular model that structures categorical noun phrase instances as well as semantically related noun phrase pairs from a given corpus representing unstructured web pages with a tolerance form of rough sets, (ii) a semi-supervised Tolerant Pattern Learning (TPL) algorithm that labels categorical instances as well as relations. This work is an extension of the TPL algorithm presented in our earlier paper. Our model treats noun phrases, which are described as sets of their co-occurring contextual patterns. We use the ontological information from the Never Ending Language Learner (Nell) system. We compared the performance of our algorithm with Coupled Bayesian Sets (CBS) and Coupled Pattern Learner (CPL) algorithms for categorical and relational extractions, respectively. Experimental results suggest that TPL can achieve comparable performance with CBS and CPL in terms of precision. (C) 2014 Elsevier B.V. All rights reserved.
引用
收藏
页码:130 / 137
页数:8
相关论文
共 50 条
  • [1] LEARNING IN RELATIONAL DATABASES - A ROUGH SET APPROACH
    HU, XH
    CERCONE, N
    [J]. COMPUTATIONAL INTELLIGENCE, 1995, 11 (02) : 323 - 338
  • [2] Categorizing relational facts from the web with fuzzy rough sets
    Aditya Bharadwaj
    Sheela Ramanna
    [J]. Knowledge and Information Systems, 2019, 61 : 1695 - 1713
  • [3] Categorizing relational facts from the web with fuzzy rough sets
    Bharadwaj, Aditya
    Ramanna, Sheela
    [J]. KNOWLEDGE AND INFORMATION SYSTEMS, 2019, 61 (03) : 1695 - 1713
  • [4] A tolerance rough set approach to clustering web search results
    Ngo, CL
    Nguyen, HS
    [J]. KNOWLEDGE DISCOVERY IN DATABASES: PKDD 2004, PROCEEDINGS, 2004, 3202 : 515 - 517
  • [5] ON LEARNING - A ROUGH SET APPROACH
    PAWLAK, Z
    [J]. LECTURE NOTES IN COMPUTER SCIENCE, 1985, 208 : 197 - 227
  • [6] Research on statistical relational learning and rough set in SRL
    Chen, Fei
    [J]. GRC: 2007 IEEE INTERNATIONAL CONFERENCE ON GRANULAR COMPUTING, PROCEEDINGS, 2007, : 227 - 230
  • [7] Research on Statistical Relational Learning and Rough Set in SRL
    Fei Chen
    [J]. 南昌工程学院学报, 2006, (02) : 92 - 96
  • [8] A ROUGH SET APPROACH FOR WEB USAGE MINING
    Salem, Abdel-Badeeh M.
    Arafat, Shaimaa
    Khalifa, Wael H.
    [J]. MENDEL 2008, 2008, : 281 - 286
  • [9] Web query automatic expansion based on tolerance rough set
    Yi, GX
    Hu, HP
    [J]. 2005 JOINT INTERNATIONAL CONFERENCE ON AUTONOMIC AND AUTONOMOUS SYSTEMS AND INTERNATIONAL CONFERENCE ON NETWORKING AND SERVICES (ICAS/ICNS), 2005, : 488 - 492
  • [10] Tolerance Rough Set Model and its Applications in Web Intelligence
    Hung Son Nguyen
    [J]. 2013 IEEE/WIC/ACM INTERNATIONAL JOINT CONFERENCE ON WEB INTELLIGENCE AND INTELLIGENT AGENT TECHNOLOGY - WORKSHOPS (WI-IAT), VOL 3, 2013, : 237 - 244