An effective rough set-based method for text classification

被引:0
|
作者
Bao, YG [1 ]
Asai, D
Du, XY
Yamada, K
Ishii, N
机构
[1] Nagoya Inst Technol, Dept Intelligence & Comp Sci, Nagoya, Aichi 4668555, Japan
[2] Renmin Univ China, Sch Informat, Beijing 100872, Peoples R China
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A central problem in good text classification for IF/IR is the high dimensionality of the data. To cope with this problem, we propose a technique using Rough Sets theory to alleviate this situation. Given corpora of documents and a training set of examples of classified documents, the technique locates a minimal set of co-ordinate keywords to distinguish between classes of documents, reducing the dimensionality of the keyword vectors. Besides, we generate several reduct bases for the classification of new object, hoping that the combination of answers of the multiple reduct bases result in better performance. To get the tidy and effective rules, we use the value reduction as the final rules. This paper describes the proposed technique and provides experimental results.
引用
收藏
页码:545 / 552
页数:8
相关论文
共 50 条
  • [41] A study on rough set-based collaborative filtering
    Zhang, W
    Liu, L
    [J]. PROCEEDINGS OF THE 2004 INTERNATIONAL CONFERENCE ON MANAGEMENT SCIENCE & ENGINEERING, VOLS 1 AND 2, 2004, : 640 - 644
  • [42] Rough fuzzy set-based image compression
    Petrosino, Alfredo
    Ferone, Alessio
    [J]. FUZZY SETS AND SYSTEMS, 2009, 160 (10) : 1485 - 1506
  • [43] AN EFFECTIVE NEUTROSOPHIC SET-BASED PREPROCESSING METHOD FOR FACE RECOGNITION
    Faraji, Mohammad Reza
    Qi, Xiaojun
    [J]. ELECTRONIC PROCEEDINGS OF THE 2013 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO WORKSHOPS (ICMEW), 2013,
  • [44] Optimistic Multi-granulation Rough Set-Based Classification for Neonatal Jaundice Diagnosis
    Kumar, S. Senthil
    Inbarani, H. Hannah
    Azar, Ahmad Taher
    Own, Hala S.
    Balas, Valentina Emilia
    Olariu, Teodora
    [J]. SOFT COMPUTING APPLICATIONS, (SOFA 2014), VOL 1, 2016, 356 : 307 - 317
  • [45] Rough set-based conflict analysis model and method over two universes
    Sun, Bingzhen
    Ma, Weimin
    Zhao, Haiyan
    [J]. INFORMATION SCIENCES, 2016, 372 : 111 - 125
  • [46] An assessment method for the impact of missing data in the rough set-based decision fusion
    Han, Shan
    Jin, Xiaoning
    Li, Jianxun
    [J]. INTELLIGENT DATA ANALYSIS, 2016, 20 (06) : 1267 - 1284
  • [47] Rough set-based entropy measure with weighted density outlier detection method
    Sangeetha, Tamilarasu
    Mary, Amalanathan Geetha
    [J]. OPEN COMPUTER SCIENCE, 2022, 12 (01): : 123 - 133
  • [48] A fuzzy rough set-based feature selection method using representative instances
    Zhang, Xiao
    Mei, Changlin
    Chen, Degang
    Yang, Yanyan
    [J]. KNOWLEDGE-BASED SYSTEMS, 2018, 151 : 216 - 229
  • [49] Topic Word Set-Based Text Clustering
    Ghazifard, Amir Mehdi
    Shams, Mohammadreza
    Shamaee, Zeinab
    [J]. 2013 7TH INTERNATIONAL CONFERENCE ON E-COMMERCE IN DEVELOPING COUNTRIES: WITH FOCUS ON E-SECURITY (ECDC), 2013,
  • [50] A Framework on Rough Set-Based Partitioning Attribute Selection
    Herawan, Tutut
    Deris, Mustafa Mat
    [J]. EMERGING INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS: WITH ASPECTS OF ARTIFICIAL INTELLIGENCE, 2009, 5755 : 91 - 100