Rough Set Based Feature Selection Approach for Text Mining

被引:0
|
作者
Sailaja, N. Venkata [1 ]
Sree, L. Padma [2 ]
Mangathayaru, N. [3 ]
机构
[1] Vignana Jyothi Inst Engn & Technol, Dept CSE, Hyderabad, Andhra Pradesh, India
[2] Vignana Jyothi Inst Engn & Technol, Dept ECE, Hyderabad, Andhra Pradesh, India
[3] Vignana Jyothi Inst Engn & Technol, Dept IT, Hyderabad, Andhra Pradesh, India
关键词
Rough sets; Feature Selection; Information system; Reduct; Discernibility; Lower and Upper approximations;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Text can be thought as the combination of characters. In the environment where the size of unstructured text data is hugely more, to process such data by computers is a challenging task. To extract meaningful and useful patterns from the text, some pre-processing methods and algorithms are required. Feature selection or Reduct generation intends to determine a smallest attributes subset which can represent the same knowledge as the original features(attributes) represented it. Rough set theory (RST) is such a mathematical tool, which can be used with tremendous success. Here, In the paper, we proposed a Rough set based approach for feature selection in the Text data set, which fulfil the aim of Text mining. We have taken different sample Text case documents (like biography text data, sample research articles of various domains, news articles from some sources) as input, these files can be in the form of .txt, .pdf etc. or any other format. We have also presented complexity analysis of our proposed algorithm and experimental results on a sample text data sets.
引用
收藏
页码:40 / 45
页数:6
相关论文
共 50 条
  • [1] Rough set based feature selection for web usage mining
    Inbarani, H. Hannah
    Thangavel, K.
    Pethalakshmi, A.
    ICCIMA 2007: INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND MULTIMEDIA APPLICATIONS, VOL I, PROCEEDINGS, 2007, : 33 - +
  • [2] A rough set approach to feature selection based on power set tree
    Chen, Yumin
    Miao, Duoqian
    Wang, Ruizhi
    Wu, Keshou
    KNOWLEDGE-BASED SYSTEMS, 2011, 24 (02) : 275 - 281
  • [3] Heuristic-based feature selection for rough set approach
    Stanczyk, U.
    Zielosko, B.
    INTERNATIONAL JOURNAL OF APPROXIMATE REASONING, 2020, 125 : 187 - 202
  • [4] A Rough Set Based Feature Selection Approach using Random Feature Vectors
    Raza, Muhammad Summair
    Qamar, Usman
    PROCEEDINGS OF 14TH INTERNATIONAL CONFERENCE ON FRONTIERS OF INFORMATION TECHNOLOGY PROCEEDINGS - FIT 2016, 2016, : 229 - 234
  • [5] Text Feature Extraction Based on Rough Set
    Cheng, Yiyuan
    Zhang, Ruiling
    Wang, Xiufeng
    Chen, Qiushuang
    FIFTH INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY, VOL 2, PROCEEDINGS, 2008, : 310 - 314
  • [6] A rough set approach to feature selection based on ant colony optimization
    Chen, Yumin
    Miao, Duoqian
    Wang, Ruizhi
    PATTERN RECOGNITION LETTERS, 2010, 31 (03) : 226 - 233
  • [7] A Rough Set Approach to Feature Selection Based on Relative Decision Entropy
    Zhou, Lin
    Jiang, Feng
    ROUGH SETS AND KNOWLEDGE TECHNOLOGY, 2011, 6954 : 110 - 119
  • [8] An approach for selective ensemble feature selection based on rough set theory
    Yang, Yong
    Wang, Guoyin
    He, Kun
    ROUGH SETS AND KNOWLEDGE TECHNOLOGY, PROCEEDINGS, 2007, 4481 : 518 - +
  • [9] An Approach to Feature Selection Based on Ant Colony Optimization and Rough Set
    Wu, Junyun
    Qiu, Taorong
    Wang, Lu
    Huang, Haiquan
    INTELLIGENT COMPUTING AND INFORMATION SCIENCE, PT I, 2011, 134 (0I): : 466 - 471
  • [10] A rough set approach to feature selection based on scatter search metaheuristic
    Jue Wang
    Qi Zhang
    Hedar Abdel-Rahman
    M. Ibrahim Abdel-Monem
    Journal of Systems Science and Complexity, 2014, 27 : 157 - 168