Rough Set Based Approach to Text Classification

被引:14
|
作者
Zhang, Libiao [1 ]
Li, Yuefeng [1 ]
Sun, Chao [1 ]
Nadee, Wanvimol [1 ]
机构
[1] Queensland Univ Technol, Fac Sci & Engn, Sch Elect Engn & Comp Sci, Brisbane, Qld 4001, Australia
关键词
Machine Learning; Text Classification; Feature Selection; Rough Set; Decision Making; CATEGORIZATION; SUPPORT;
D O I
10.1109/WI-IAT.2013.190
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Textual document set has become an important and rapidly growing information source in the web. Text classification is one of the crucial technologies for information organisation and management. Text classification has become more and more important and attracted wide attention of researchers from different research fields. In this paper, many feature selection methods, the implement algorithms and applications of text classification are introduced firstly. However, because there are much noise in the knowledge extracted by current data-mining techniques for text classification, it leads to much uncertainty in the process of text classification which is produced from both the knowledge extraction and knowledge usage, therefore, more innovative techniques and methods are needed to improve the performance of text classification. It has been a critical step with great challenge to further improve the process of knowledge extraction and effectively utilization of the extracted knowledge. Rough Set decision making approach is proposed to use Rough Set decision techniques to more precisely classify the textual documents which are difficult to separate by the classic text classification methods. The purpose of this paper is to give an overview of existing text classification technologies, to demonstrate the Rough Set concepts and the decision making approach based on Rough Set theory for building more reliable and effective text classification framework with higher precision, to set up an innovative evaluation metric named CEI which is very effective for the performance assessment of the similar research, and to propose a promising research direction for addressing the challenging problems in text classification, text mining and other relative fields.
引用
收藏
页码:245 / 252
页数:8
相关论文
共 50 条
  • [1] A rough set-based approach to text classification
    Chouchoulas, A
    Shen, Q
    [J]. NEW DIRECTIONS IN ROUGH SETS, DATA MINING, AND GRANULAR-SOFT COMPUTING, 1999, 1711 : 118 - 127
  • [2] Partition for the rough set-based text classification
    Bao, YG
    Asai, D
    Du, XY
    Ishii, N
    [J]. ADVANCES IN WEB-AGE INFORMATION MANAGEMENT, PROCEEDINGS, 2003, 2762 : 181 - 188
  • [3] Rough set based hybrid algorithm for text classification
    Miao, Duoqian
    Duan, Qiguo
    Zhang, Hongyu
    Jiao, Na
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2009, 36 (05) : 9168 - 9174
  • [4] A rough set based hybrid approach for classification
    Hussein, Ahmed Saad
    Li, Tianrui
    Jaber, Noora Sabah
    Yohannese, Chubato Wondaferaw
    [J]. DATA SCIENCE AND KNOWLEDGE ENGINEERING FOR SENSING DECISION SUPPORT, 2018, 11 : 683 - 690
  • [5] Rule generation based on rough set theory for text classification
    Bi, YX
    Anderson, T
    McClean, S
    [J]. RESEARCH AND DEVELOPMENT IN INTELLIGENT SYSTEMS XVII, 2001, : 157 - 170
  • [6] An effective rough set-based method for text classification
    Bao, YG
    Asai, D
    Du, XY
    Yamada, K
    Ishii, N
    [J]. INTELLIGENT DATA ENGINEERING AND AUTOMATED LEARNING, 2003, 2690 : 545 - 552
  • [7] Rough Set Based Feature Selection Approach for Text Mining
    Sailaja, N. Venkata
    Sree, L. Padma
    Mangathayaru, N.
    [J]. PROCEEDINGS OF THE 2016 2ND INTERNATIONAL CONFERENCE ON CONTEMPORARY COMPUTING AND INFORMATICS (IC3I), 2016, : 40 - 45
  • [8] A novel rough set approach for classification
    Li-Juan, Zhang
    Zhou-Jun, Li
    [J]. 2006 IEEE INTERNATIONAL CONFERENCE ON GRANULAR COMPUTING, 2006, : 349 - +
  • [9] A Rule-Based Classification Algorithm: A Rough Set Approach
    Liao, Chia-Chi
    Hsu, Kuo-Wei
    [J]. 2012 IEEE INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND CYBERNETICS (CYBERNETICSCOM), 2012, : 1 - 5
  • [10] A Rough-Set-Based Approach for Classification and Rule Induction
    L. P. Khoo
    S. B. Tor
    L. Y. Zhai
    [J]. The International Journal of Advanced Manufacturing Technology, 1999, 15 : 438 - 444