Text Categorization Method Based on Improved Mutual Information and Characteristic Weights Evaluation Algorithms

被引:1
|
作者
Pei, Zhili [1 ,2 ]
Shi, Xiaohu [1 ]
Marchese, Maurizio [3 ]
Liang, Yanchun [1 ,3 ]
机构
[1] Jilin Univ, Coll Comp Sci & Technol, Key Lab Symbol Comp & Knowledge Engn, Minist Educ, Changchun, Peoples R China
[2] Natl Univ Inner Mongolia, Coll Math & Comp Sci, Tongliao 028043, Peoples R China
[3] Univ Trent, Dept Informat & Commun Technol, I-38050 Trento, Italy
基金
美国国家科学基金会;
关键词
D O I
10.1109/FSKD.2007.559
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The improvement of text categorization by statistical methods can be performed from two main directions, namely the feature selection and the evaluation of characteristic weights. In this paper, we propose an enhanced text categorization method based on a modified mutual information algorithm and evaluation algorithm of characteristic weights which improves both aspects. The proposed method is applied to the benchmark test set Reuters-21578 Top10 to examine its effectiveness. Numerical results show that the precision, the recall and the value of F1 of the proposed method are all superior tothose of existing conventional methods.
引用
收藏
页码:87 / +
页数:2
相关论文
共 50 条
  • [1] An enhanced text categorization method based on improved text frequency approach and mutual information algorithm
    Pei Zhili
    Shi Xiaohu
    Marchese, Maurizio
    Liang Yanchun
    [J]. PROGRESS IN NATURAL SCIENCE-MATERIALS INTERNATIONAL, 2007, 17 (12) : 1494 - 1500
  • [2] An enhanced text categorization method based on improved text frequency approach and mutual information algorithm
    Maurizio Marchese
    [J]. Progress in Natural Science:Materials International, 2007, (12) : 1494 - 1500
  • [3] An Improved Feature Selection for Categorization Based on Mutual Information
    Liu, Haifeng
    Su, Zhan
    Yao, Zeqing
    Liu, Shousheng
    [J]. WEB INFORMATION SYSTEMS AND MINING, PROCEEDINGS, 2009, 5854 : 80 - 87
  • [4] Automatic Chinese Text Categorization System Based on Mutual Information
    Lu, Zhimao
    Shi, Hong
    Zhang, Qi
    Yuan, Chaoyue
    [J]. 2009 IEEE INTERNATIONAL CONFERENCE ON MECHATRONICS AND AUTOMATION, VOLS 1-7, CONFERENCE PROCEEDINGS, 2009, : 4986 - 4990
  • [5] Improved Mutual Information Method For Text Feature Selection
    Ding Xiaoming
    Tang Yan
    [J]. PROCEEDINGS OF THE 2013 8TH INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE & EDUCATION (ICCSE 2013), 2013, : 163 - 166
  • [6] Study on mutual information-based feature selection for text categorization
    Xu, Yan
    Jones, Gareth
    Li, Jintao
    Wang, Bin
    Sun, Chunming
    [J]. Journal of Computational Information Systems, 2007, 3 (03): : 1007 - 1012
  • [7] The Improvement Research of Mutual Information Algorithm for Text Categorization
    Kai, Lu
    Li, Chen
    [J]. KNOWLEDGE ENGINEERING AND MANAGEMENT , ISKE 2013, 2014, 278 : 225 - 232
  • [8] Improved Information Gain-based Feature Selection for Text Categorization
    Gao, Zhe
    Xu, Yajing
    Meng, Fanyu
    Qi, Feng
    Lin, Zhiqing
    [J]. 2014 4TH INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS, VEHICULAR TECHNOLOGY, INFORMATION THEORY AND AEROSPACE & ELECTRONIC SYSTEMS (VITAE), 2014,
  • [9] Improved Text Matching by Enhancing Mutual Information
    Liu, Yang
    Rong, Wenge
    Xiong, Zhang
    [J]. THIRTY-SECOND AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTIETH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / EIGHTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2018, : 5269 - 5276
  • [10] Feature selection algorithm for text classification based on improved mutual information
    丛帅
    张积宾
    徐志明
    王宇颖
    [J]. Journal of Harbin Institute of Technology(New series), 2011, (03) : 144 - 148