Improved feature selection approach TFIDF in text mining

被引:0
|
作者
Jing, LP [1 ]
Huang, HK [1 ]
Shi, HB [1 ]
机构
[1] No Jiaotong Univ, Sch Comp & Informat Technol, Beijing 100044, Peoples R China
关键词
text mining; TFIDF; evaluation function; VSM; feature selection;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper describes one Feature Selection method (TFIDF). With it, we process the data resource and set up the VSM model in order to provide a convenient data structure for text categorization. We calculate the precision of this method with the help of categorization results. According to the empirical results, we analyze its advantages and disadvantages and present a new TFIDF-based feature selection approach to improve its accuracy.
引用
收藏
页码:944 / 946
页数:3
相关论文
共 50 条
  • [21] Improved Mutual Information Method For Text Feature Selection
    Ding Xiaoming
    Tang Yan
    PROCEEDINGS OF THE 2013 8TH INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE & EDUCATION (ICCSE 2013), 2013, : 163 - 166
  • [22] An improved global feature selection scheme for text classification
    Uysal, Alper Kursat
    EXPERT SYSTEMS WITH APPLICATIONS, 2016, 43 : 82 - 92
  • [23] An Improved Text Feature Selection Method for Transfer Learning
    Liu, Jiang
    Wang, Hao
    Liu, Jun
    CONTEMPORARY RESEARCH ON E-BUSINESS TECHNOLOGY AND STRATEGY, 2012, 332 : 600 - +
  • [24] An Improved Strategy of the Feature Selection Algorithm for the Text Categorization
    Yang, Jieming
    Lu, Yixin
    Liu, Zhiying
    2019 20TH IEEE/ACIS INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING, ARTIFICIAL INTELLIGENCE, NETWORKING AND PARALLEL/DISTRIBUTED COMPUTING (SNPD), 2019, : 3 - 7
  • [25] An improved text feature selection method for transfer learning
    Liu, Jiang
    Wang, Hao
    Liu, Jun
    Communications in Computer and Information Science, 2013, 332 : 600 - 611
  • [26] Text Mining: An Improvised Feature Based Model Approach
    Shivaprasad, K. M.
    Reddy, T. Hanumantha
    PROCEEDINGS OF THE 2016 2ND INTERNATIONAL CONFERENCE ON APPLIED AND THEORETICAL COMPUTING AND COMMUNICATION TECHNOLOGY (ICATCCT), 2016, : 38 - 42
  • [27] TFIDF based Feature Words Extraction and Topic Modeling for Short Text
    Zhao, Guifen
    Liu, Yanjun
    Zhang, Wei
    Wang, Yiou
    PROCEEDINGS OF THE 2018 2ND INTERNATIONAL CONFERENCE ON MANAGEMENT ENGINEERING, SOFTWARE ENGINEERING AND SERVICE SCIENCES (ICMSS 2018), 2018, : 188 - 191
  • [28] A feature-based approach for guiding the selection of Internet of Things cybersecurity standards using text mining
    van der Schaaf, Koen
    Tekinerdogan, Bedir
    Catal, Cagatay
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2021, 33 (21):
  • [29] Naive bayes text categorization using improved feature selection
    Lin, Kunhui
    Kang, Kai
    Huang, Yunping
    Zhou, Changle
    Wang, Beizhan
    Journal of Computational Information Systems, 2007, 3 (03): : 1159 - 1164
  • [30] Feature selection using improved mutual information for text classification
    Novovicová, J
    Malík, A
    Pudil, P
    STRUCTURAL, SYNTACTIC, AND STATISTICAL PATTERN RECOGNITION, PROCEEDINGS, 2004, 3138 : 1010 - 1017