Categorizing relational facts from the web with fuzzy rough sets

被引:5
|
作者
Bharadwaj, Aditya [1 ]
Ramanna, Sheela [1 ]
机构
[1] Univ Winnipeg, Dept Appl Comp Sci, Winnipeg, MB, Canada
基金
加拿大自然科学与工程研究理事会;
关键词
Text categorization; Relational facts; Semi-supervised learning; Fuzzy rough sets; Web mining;
D O I
10.1007/s10115-018-1250-6
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Significant advances have been made in automatically constructing knowledge bases of relational facts derived from web corpora. These relational facts are linguistic in nature and are represented as ordered pairs of nouns (Winnipeg, Canada) belonging to a category (City_Country). One major problem is that these facts are abundant but mostly unlabeled. Hence, semi-supervised learning approaches have been successful in building knowledge bases where a small number of labeled examples are used as seed (training) instances and a large number of unlabeled instances are learnt in an iterative fashion. In this paper, we propose a novel fuzzy rough set-based semi-supervised learning algorithm (FRL) for categorizing relational facts derived from a given corpus. The proposed FRL algorithm is compared with a tolerance rough set-based learner (TPL) and the coupled pattern learner (CPL). The same ontology derived from a subset of corpus from never ending language learner system was used in all of the experiments. This paper has demonstrated that the proposed FRL outperforms both TPL and CPL in terms of precision. The paper also addresses the concept drift problem by using mutual exclusion constraints. The contributions of this paper are: (i) introduction of a formal fuzzy rough model for relations, (ii) a semi-supervised learning algorithm, (iii) experimental comparison with other machine learning algorithms: TPL and CPL, and (iv) a novel application of fuzzy rough sets.
引用
收藏
页码:1695 / 1713
页数:19
相关论文
共 50 条
  • [1] Categorizing relational facts from the web with fuzzy rough sets
    Aditya Bharadwaj
    Sheela Ramanna
    [J]. Knowledge and Information Systems, 2019, 61 : 1695 - 1713
  • [2] Learning relational facts from the web: A tolerance rough set approach
    Sengoz, Cenker
    Ramanna, Sheela
    [J]. PATTERN RECOGNITION LETTERS, 2015, 67 : 130 - 137
  • [3] ROUGH FUZZY-SETS AND FUZZY ROUGH SETS
    DUBOIS, D
    PRADE, H
    [J]. INTERNATIONAL JOURNAL OF GENERAL SYSTEMS, 1990, 17 (2-3) : 191 - 209
  • [4] Extended rough fuzzy sets for Web search agent
    Rojanavasu, P
    Pinngern, O
    [J]. ITI 2003: PROCEEDINGS OF THE 25TH INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY INTERFACES, 2003, : 403 - 407
  • [5] Soft rough fuzzy sets and soft fuzzy rough sets
    Meng, Dan
    Zhang, Xiaohong
    Qin, Keyun
    [J]. COMPUTERS & MATHEMATICS WITH APPLICATIONS, 2011, 62 (12) : 4635 - 4645
  • [6] Rough sets and relational learning
    Milton, RS
    Maheswari, VU
    Siromoney, A
    [J]. TRANSACTIONS ON ROUGH SETS I, 2004, 3100 : 321 - 337
  • [7] Relational data and rough sets
    Stepaniuk, Jaroslaw
    [J]. FUNDAMENTA INFORMATICAE, 2007, 79 (3-4) : 525 - 539
  • [8] Detection of Web site visitors based on fuzzy rough sets
    Javad Hamidzadeh
    Mahdieh Zabihimayvan
    Reza Sadeghi
    [J]. Soft Computing, 2018, 22 : 2175 - 2188
  • [9] Detection of Web site visitors based on fuzzy rough sets
    Hamidzadeh, Javad
    Zabihimayvan, Mahdieh
    Sadeghi, Reza
    [J]. SOFT COMPUTING, 2018, 22 (07) : 2175 - 2188
  • [10] Axiomatic systems for rough sets and fuzzy rough sets
    Liu, Guilong
    [J]. INTERNATIONAL JOURNAL OF APPROXIMATE REASONING, 2008, 48 (03) : 857 - 867