Hierarchical text categorization using fuzzy relational thesaurus

被引:0
|
作者
Tikk, D [1 ]
Yang, JD
Bang, SL
机构
[1] Budapest Univ Technol & Econ, Dept Telecommun & Media Informat, H-1117 Budapest, Hungary
[2] Hungarian Lab, Intelligent Integrated Syst Japanese, H-1111 Budapest, Hungary
[3] Natl Univ, Dept Comp Sci, Chonju 561756, South Korea
关键词
text mining; knowledge base management; multi-level categorization; hieraxchical text categorization;
D O I
暂无
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Text categorization is the classification to assign a text document to an appropriate category in a predefined set of categories. We present a new approach for the text categorization by means of Fuzzy Relational Thesaurus (FRT). FRT is a multilevel category system that stores and maintains adaptive local dictionary for each category. The goal of our approach is twofold; to develop a reliable text categorization method on a certain subject domain, and to expand the initial FRT by automatically added terms, thereby obtaining an incrementally defined knowledge base of the domain. We implemented the categorization algorithm and compared it with some other hierarchical classifiers. Experimental results have been shown that our algorithm outperforms its rivals on all document corpora investigated.
引用
收藏
页码:583 / 600
页数:18
相关论文
共 50 条
  • [21] Hierarchical Persian Text Categorization in Absence of Labeled Data
    Masoudian, Soheila
    Derhami, Vali
    Zarifzadeh, Sajjad
    2019 27TH IRANIAN CONFERENCE ON ELECTRICAL ENGINEERING (ICEE 2019), 2019, : 1951 - 1955
  • [22] A neural network model for hierarchical multilingual text categorization
    Chau, RN
    Yeh, CS
    Smith, KA
    ADVANCES IN NEURAL NETWORKS - ISNN 2005, PT 2, PROCEEDINGS, 2005, 3497 : 238 - 245
  • [23] Boosting multi-label hierarchical text categorization
    Esuli, Andrea
    Fagni, Tiziano
    Sebastiani, Fabrizio
    INFORMATION RETRIEVAL, 2008, 11 (04): : 287 - 313
  • [24] Boosting multi-label hierarchical text categorization
    Andrea Esuli
    Tiziano Fagni
    Fabrizio Sebastiani
    Information Retrieval, 2008, 11 : 287 - 313
  • [25] An algorithms of document categorization using fuzzy relations and hierarchical structure between documents
    Han, SW
    Eun, HJ
    Kim, YS
    László, TK
    WORLD MULTICONFERENCE ON SYSTEMICS, CYBERNETICS AND INFORMATICS, VOL XVII, PROCEEDINGS: CYBERNETICS AND INFORMATICS: CONCEPTS AND APPLICATIONS (PT II), 2001, : 362 - 368
  • [26] Using WordNet for text categorization
    Elberrichi, Zakaria
    Rahmoun, Abdelattif
    Bentaalah, Mohamed Amine
    INTERNATIONAL ARAB JOURNAL OF INFORMATION TECHNOLOGY, 2008, 5 (01) : 16 - 24
  • [27] Hierarchical taxonomy preparation for text categorization using consistent bipartite spectral graph copartitioning
    Gao, B
    Liu, TY
    Feng, G
    Qin, T
    Cheng, QS
    Ma, WY
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2005, 17 (09) : 1263 - 1273
  • [28] Using SVMs for text categorization
    Dumais, S
    IEEE INTELLIGENT SYSTEMS & THEIR APPLICATIONS, 1998, 13 (04): : 21 - 23
  • [29] Using Thesaurus to Improve Multiclass Text Classification
    Maghsoodi, Nooshin
    Homayounpour, Mohammad Mehdi
    COMPUTATIONAL LINGUISTICS AND INTELLIGENT TEXT PROCESSING, PT II, 2011, 6609 : 244 - 253
  • [30] Multilabel Text Categorization Based on Fuzzy Relevance Clustering
    Lee, Shie-Jue
    Jiang, Jung-Yi
    IEEE TRANSACTIONS ON FUZZY SYSTEMS, 2014, 22 (06) : 1457 - 1471