Improved feature selection approach TFIDF in text mining

被引:0
|
作者
Jing, LP [1 ]
Huang, HK [1 ]
Shi, HB [1 ]
机构
[1] No Jiaotong Univ, Sch Comp & Informat Technol, Beijing 100044, Peoples R China
关键词
text mining; TFIDF; evaluation function; VSM; feature selection;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper describes one Feature Selection method (TFIDF). With it, we process the data resource and set up the VSM model in order to provide a convenient data structure for text categorization. We calculate the precision of this method with the help of categorization results. According to the empirical results, we analyze its advantages and disadvantages and present a new TFIDF-based feature selection approach to improve its accuracy.
引用
收藏
页码:944 / 946
页数:3
相关论文
共 50 条
  • [1] A Text Feature Selection Algorithm Based on Improved TFIDF
    Chengcheng Yang
    Xingshi He
    [J]. PROCEEDINGS OF THE 2008 CHINESE CONFERENCE ON PATTERN RECOGNITION (CCPR 2008), 2008, : 416 - 419
  • [2] A Feature Selection Method based on Improved TFIDF
    Wei Yong-qing
    Liu Pei-yu
    Zhu Zhen-fang
    [J]. 2008 3RD INTERNATIONAL CONFERENCE ON PERVASIVE COMPUTING AND APPLICATIONS, VOLS 1 AND 2, 2008, : 94 - +
  • [3] Improvement of Text Feature Selection Method based on TFIDF
    Qu, Shouning
    Wang, Sujuan
    Zou, Yan
    [J]. 2008 INTERNATIONAL SEMINAR ON FUTURE INFORMATION TECHNOLOGY AND MANAGEMENT ENGINEERING, PROCEEDINGS, 2008, : 79 - 81
  • [4] A TEXT FEATURE SELECTION METHOD USING TFIDF BASED ON ENTROPY
    Song, Jiang
    Xu, Min
    Fan, Chuyi
    [J]. COMPUTATIONAL INTELLIGENCE: FOUNDATIONS AND APPLICATIONS: PROCEEDINGS OF THE 9TH INTERNATIONAL FLINS CONFERENCE, 2010, 4 : 962 - 967
  • [5] An improved TFIDF feature selection algorithm based on information entropy
    Zhou Yantao
    Tang Jianbo
    Wang Jiaqin
    [J]. PROCEEDINGS OF THE 26TH CHINESE CONTROL CONFERENCE, VOL 5, 2007, : 312 - +
  • [6] Rough Set Based Feature Selection Approach for Text Mining
    Sailaja, N. Venkata
    Sree, L. Padma
    Mangathayaru, N.
    [J]. PROCEEDINGS OF THE 2016 2ND INTERNATIONAL CONFERENCE ON CONTEMPORARY COMPUTING AND INFORMATICS (IC3I), 2016, : 40 - 45
  • [7] Feature selection methods for event detection in Twitter: a text mining approach
    Hossny, Ahmad Hany
    Mitchell, Lewis
    Lothian, Nick
    Osborne, Grant
    [J]. SOCIAL NETWORK ANALYSIS AND MINING, 2020, 10 (01)
  • [8] An improved TFIDF Algorithm in text classification
    Xu, Dongdong
    Wu, Shaobo
    [J]. MATERIAL SCIENCE, CIVIL ENGINEERING AND ARCHITECTURE SCIENCE, MECHANICAL ENGINEERING AND MANUFACTURING TECHNOLOGY II, 2014, 651-653 : 2258 - 2261
  • [9] Feature selection methods for event detection in Twitter: a text mining approach
    Ahmad Hany Hossny
    Lewis Mitchell
    Nick Lothian
    Grant Osborne
    [J]. Social Network Analysis and Mining, 2020, 10
  • [10] Feature Selection and Feature Weight Estimate in Web Text Mining
    Pei, Zhili
    Qi, Jianhong
    Zhang, Xinhong
    Zhou, Yuxin
    Bai, Mingyu
    Wang, Qinghu
    Liu, Lisha
    Fan, Xiaojing
    Jiang, Mingyang
    [J]. 2ND INTERNATIONAL CONFERENCE ON INFORMATION AND COMMUNICATION TECHNOLOGY FOR EDUCATION (ICTE 2015), 2015, : 316 - 320