Hierarchical text categorization using fuzzy relational thesaurus

被引:0
|
作者
Tikk, D [1 ]
Yang, JD
Bang, SL
机构
[1] Budapest Univ Technol & Econ, Dept Telecommun & Media Informat, H-1117 Budapest, Hungary
[2] Hungarian Lab, Intelligent Integrated Syst Japanese, H-1111 Budapest, Hungary
[3] Natl Univ, Dept Comp Sci, Chonju 561756, South Korea
关键词
text mining; knowledge base management; multi-level categorization; hieraxchical text categorization;
D O I
暂无
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Text categorization is the classification to assign a text document to an appropriate category in a predefined set of categories. We present a new approach for the text categorization by means of Fuzzy Relational Thesaurus (FRT). FRT is a multilevel category system that stores and maintains adaptive local dictionary for each category. The goal of our approach is twofold; to develop a reliable text categorization method on a certain subject domain, and to expand the initial FRT by automatically added terms, thereby obtaining an incrementally defined knowledge base of the domain. We implemented the categorization algorithm and compared it with some other hierarchical classifiers. Experimental results have been shown that our algorithm outperforms its rivals on all document corpora investigated.
引用
收藏
页码:583 / 600
页数:18
相关论文
共 50 条
  • [41] Text categorization: An experiment using phrases
    Kongovi, M
    Guzman, JC
    Dasigi, V
    ADVANCES IN INFORMATION REFTRIEVAL, 2002, 2291 : 213 - 228
  • [42] Automatic Text Categorization using NTC
    Jo, Taeho
    NDT: 2009 FIRST INTERNATIONAL CONFERENCE ON NETWORKED DIGITAL TECHNOLOGIES, 2009, : 26 - 31
  • [43] Biomedical text categorization using UMLS
    Perea Ortega, Jose Manuel
    Martin Valdivia, Maria Teresa
    Montejo Raez, Arturo
    Diaz Galiano, Manuel Carlos
    PROCESAMIENTO DEL LENGUAJE NATURAL, 2008, (40): : 121 - 127
  • [44] Using KNN Algorithm for Text Categorization
    Wajeed, M. A.
    Adilakshmi, T.
    COMPUTATIONAL INTELLIGENCE AND INFORMATION TECHNOLOGY, 2011, 250 : 796 - +
  • [45] On using partial supervision for text categorization
    Aggarwal, CC
    Gates, SC
    Yu, PS
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2004, 16 (02) : 245 - 255
  • [46] Reference algorithm of text categorization based on fuzzy cognitive maps
    Zhang Guiyun
    Liu Yang
    Zhang Weijuan
    Wang Yuanyuan
    INTELLIGENT INFORMATION PROCESSING III, 2006, 228 : 531 - +
  • [47] Text categorization rule extraction based on fuzzy decision tree
    Wang, Y
    Wang, ZO
    PROCEEDINGS OF 2005 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-9, 2005, : 2122 - 2127
  • [48] Fuzzy Rough Set-Based Unstructured Text Categorization
    Bharadwaj, Aditya
    Ramanna, Sheela
    ADVANCES IN ARTIFICIAL INTELLIGENCE, CANADIAN AI 2017, 2017, 10233 : 335 - 340
  • [49] Text multi-categorization based on Fuzzy Correlation Analysis
    Lin, Nancy P.
    Chueh, Hao-En
    WSEAS Transactions on Systems, 2007, 6 (02): : 273 - 278
  • [50] Clustering Sentence-Level Text Using a Novel Fuzzy Relational Clustering Algorithm
    Skabar, Andrew
    Abdalgader, Khaled
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2013, 25 (01) : 62 - 75