An effective rough set-based method for text classification

被引:0
|
作者
Bao, YG [1 ]
Asai, D
Du, XY
Yamada, K
Ishii, N
机构
[1] Nagoya Inst Technol, Dept Intelligence & Comp Sci, Nagoya, Aichi 4668555, Japan
[2] Renmin Univ China, Sch Informat, Beijing 100872, Peoples R China
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A central problem in good text classification for IF/IR is the high dimensionality of the data. To cope with this problem, we propose a technique using Rough Sets theory to alleviate this situation. Given corpora of documents and a training set of examples of classified documents, the technique locates a minimal set of co-ordinate keywords to distinguish between classes of documents, reducing the dimensionality of the keyword vectors. Besides, we generate several reduct bases for the classification of new object, hoping that the combination of answers of the multiple reduct bases result in better performance. To get the tidy and effective rules, we use the value reduction as the final rules. This paper describes the proposed technique and provides experimental results.
引用
收藏
页码:545 / 552
页数:8
相关论文
共 50 条
  • [21] A rough set-based approach to handling uncertainty in geographic data classification
    Jankowski, Piotr
    [J]. GEOGRAPHIC UNCERTAINTY IN ENVIRONMENTAL SECURITY, 2007, : 75 - 87
  • [22] Rough Set-Based Decision Tool for Classification of Cotton Yarn Neps
    Das S.
    Ghosh A.
    [J]. Journal of The Institution of Engineers (India): Series E, 2021, 102 (01) : 1 - 10
  • [23] Research on the rough set-based method of mining uncertainty knowledge
    Huang, TY
    Lu, T
    Cheng, Y
    [J]. PROCEEDINGS OF THE 2001 INTERNATIONAL CONFERENCE ON MANAGEMENT SCIENCE AND ENGINEERING, VOLS I AND II, 2001, : 3 - 8
  • [24] A rough set-based multiple criteria linear programming approach for classification
    Zhang, Zhiwang
    Shi, Yong
    Zhang, Peng
    Gao, Guangxia
    [J]. COMPUTATIONAL SCIENCE - ICCS 2008, PT 2, 2008, 5102 : 476 - +
  • [25] A prediction method for the cellphone replacement: Rough set-based analysis
    Deng, Weibin
    Wang, Flaijin
    Dai, Shimin
    [J]. JOURNAL OF DISCRETE MATHEMATICAL SCIENCES & CRYPTOGRAPHY, 2018, 21 (04): : 881 - 893
  • [26] A rough set-based CBR approach for feature and document reduction in text categorization
    Li, Y
    Shiu, SCK
    Pal, SK
    Liu, JNK
    [J]. PROCEEDINGS OF THE 2004 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-7, 2004, : 2438 - 2443
  • [27] A rough set-based fuzzy clustering
    Zhao, YQ
    Zhou, XZ
    Tang, GZ
    [J]. INFORMATION RETRIEVAL TECHNOLOGY, PROCEEDINGS, 2005, 3689 : 401 - 409
  • [28] On the Definability of a Set and Rough Set-Based Rule Generation
    Sakai, Hiroshi
    Wu, Mao
    Yamaguchi, Naoto
    [J]. 2014 IIAI 3RD INTERNATIONAL CONFERENCE ON ADVANCED APPLIED INFORMATICS (IIAI-AAI 2014), 2014, : 122 - 125
  • [29] Rough Set-Based Classification of EEG Signals Related to Real and Imagery Motion
    Szczuko, Piotr
    [J]. 2016 SIGNAL PROCESSING: ALGORITHMS, ARCHITECTURES, ARRANGEMENTS, AND APPLICATIONS (SPA), 2016, : 34 - 39
  • [30] Rule generation based on rough set theory for text classification
    Bi, YX
    Anderson, T
    McClean, S
    [J]. RESEARCH AND DEVELOPMENT IN INTELLIGENT SYSTEMS XVII, 2001, : 157 - 170