CHINESE TEXT CATEGORIZATION STUDY BASED ON FEATURE WEIGHT LEARNING

被引:1
|
作者
Zhan, Yan [1 ]
Chen, Hao [1 ]
Zhang, Su-Fang [2 ]
Zheng, Mei [3 ]
机构
[1] Hebei Univ, Coll Math & Comp Sci, Key Lab Machine Learning & Computat Intelligence, Baoding 071002, Peoples R China
[2] Hebei Informat Engn Sch, Teaching & Res Sect Math, Baoding 071000, Peoples R China
[3] Yanshan Univ, Coll Int Educ, Qinhuangdao 066004, Peoples R China
关键词
Text Categorization; Feature weight; K-NN;
D O I
10.1109/ICMLC.2009.5212257
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Text Categorization(TC) is an important component in many information organization and information management tasks. Two key issues in TC are feature coding and classifier design. The Euclidean distance is usually chosen as the similarity measure in K-nearest neighbor classification algorithm. All the features of each vector have different functions in describing samples. So we can decide different function of every feature by using feature weight learning. In this paper Text Categorization via K-nearest neighbor algorithm based on feature weight learning is described. The numerical experiments prove the validity of this learning algorithm.
引用
收藏
页码:1723 / +
页数:2
相关论文
共 50 条
  • [1] A comparative study on feature weight in text categorization
    Deng, ZH
    Tang, SW
    Yang, DQ
    Zhang, M
    Li, LY
    Xie, KQ
    [J]. ADVANCED WEB TECHNOLOGIES AND APPLICATIONS, 2004, 3007 : 588 - 597
  • [2] A study on feature weighting in Chinese text categorization
    Xue, DJ
    Sun, MS
    [J]. COMPUTATIONAL LINGUISTICS AND INTELLIGENT TEXT PROCESSING, PROCEEDINGS, 2003, 2588 : 592 - 601
  • [3] A Novel Feature Weight Algorithm for Text Categorization
    Shang, Wenqian
    Dong, Hongbin
    Zhu, Haibin
    Wang, Yongbin
    [J]. IEEE NLP-KE 2008: PROCEEDINGS OF INTERNATIONAL CONFERENCE ON NATURAL LANGUAGE PROCESSING AND KNOWLEDGE ENGINEERING, 2008, : 269 - 275
  • [4] Novel feature selection algorithm for Chinese text categorization based on CHI
    Cai Zhenliang
    Wang Jian
    Liu Jiqiang
    [J]. PROCEEDINGS OF 2016 IEEE 13TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING (ICSP 2016), 2016, : 1035 - 1039
  • [5] Feature selection, perceptron learning, and a usability case study for text categorization
    Ng, HT
    Goh, WB
    Low, KL
    [J]. PROCEEDINGS OF THE 20TH ANNUAL INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 1997, : 67 - 73
  • [6] Temporal-based Feature Selection and Transfer Learning for Text Categorization
    Fukumoto, Fumiyo
    Suzuki, Yoshimi
    [J]. 2015 7TH INTERNATIONAL JOINT CONFERENCE ON KNOWLEDGE DISCOVERY, KNOWLEDGE ENGINEERING AND KNOWLEDGE MANAGEMENT (IC3K), 2015, : 17 - 26
  • [7] An Improved Feature Weighting Strategy in Chinese Text Categorization
    Song, Jia
    Qin, Sijun
    Zhang, Pengzhou
    [J]. PROCEEDINGS OF THE 2015 6TH INTERNATIONAL CONFERENCE ON MANUFACTURING SCIENCE AND ENGINEERING, 2016, 32 : 202 - 208
  • [8] Learning effective features for Chinese text categorization
    Luo, DS
    Wang, XH
    Wu, XH
    Chi, HS
    [J]. PROCEEDINGS OF THE 2005 IEEE INTERNATIONAL CONFERENCE ON NATURAL LANGUAGE PROCESSING AND KNOWLEDGE ENGINEERING (IEEE NLP-KE'05), 2005, : 608 - 613
  • [9] Improving Chinese text categorization by outlier learning
    Wang, XH
    Luo, DS
    Wu, XH
    Chi, HS
    [J]. PROCEEDINGS OF THE 2005 IEEE INTERNATIONAL CONFERENCE ON NATURAL LANGUAGE PROCESSING AND KNOWLEDGE ENGINEERING (IEEE NLP-KE'05), 2005, : 602 - 607
  • [10] Chinese Short Text Categorization Based on Semi-Supervised Learning
    Ma, Jie
    Xiong, Zhong-Yang
    Zhang, Yu-Fang
    Wang, Liu-Qian
    Xie, Jiang
    [J]. 3RD INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND MECHANICAL AUTOMATION (CSMA 2017), 2017, : 45 - 54