Automatic identification of Chinese weblogger's interests based on text classification

被引:3
|
作者
Ni, Xiaochuan [1 ]
Wu, Xiaoyuan [1 ]
Yu, Yong [1 ]
机构
[1] Shanghai Jiao Tong Univ, 800 Dongchuan Rd, Shanghai 200240, Peoples R China
关键词
D O I
10.1109/WI.2006.47
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Chinese weblogs have been expanded in an incredible speed in recent years. There is plentiful personal information in weblogs. In this paper, we propose a text classification based approach to automatically identify the interests of a weblogger. To solve the problems arising out of classifying weblog documents, the technique of heterogeneous classifiers combination is used here. We also use hierarchical classification technique to identify much specific interests. Experiments show that our interest identification approach has a high accuracy and, for most webloggers in our experiments, their interests implied in the contents of blogs could be well identified by using this approach.
引用
收藏
页码:247 / +
页数:2
相关论文
共 50 条
  • [1] The Research of Chinese Text Automatic Classification Based on Multiple
    Zhang, Shengli
    [J]. INFORMATION TECHNOLOGY APPLICATIONS IN INDUSTRY, PTS 1-4, 2013, 263-266 : 1543 - 1548
  • [2] A Study on Automatic Chinese Text Classification
    Luo, Xi
    Ohyama, Wataru
    Wakabayashi, Tetsushi
    Kimura, Fumitaka
    [J]. 11TH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR 2011), 2011, : 920 - 924
  • [3] Automatic Chinese Text Classification Based on NSVMDT-KNN
    Xu, QiNan
    Liu, Zhijng
    [J]. FIFTH INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY, VOL 2, PROCEEDINGS, 2008, : 410 - 414
  • [4] Research of Chinese-text automatic classification based on SVM
    Coll. of Management, Univ. of Shanghai Science and Technology, Shanghai 200093, China
    [J]. Xi Tong Cheng Yu Dian Zi Ji Shu/Syst Eng Electron, 2007, 3 (475-478):
  • [5] Automatic Text Classification Method Based on Zipf's Law
    Yatsko, V. A.
    [J]. AUTOMATIC DOCUMENTATION AND MATHEMATICAL LINGUISTICS, 2015, 49 (03) : 83 - 88
  • [6] Chinese text classification without automatic word segmentation
    Liu, Wei
    Allison, Ben
    Guthrie, David
    Guthrie, Louise
    [J]. ALPIT 2007: PROCEEDINGS OF THE 6TH INTERNATIONAL CONFERENCE ON ADVANCED LANGUAGE PROCESSING AND WEB INFORMATION TECHNOLOGY, 2007, : 45 - +
  • [7] A combined weight method in automatic classification of Chinese text
    Liao, SS
    Jiang, MH
    [J]. PROCEEDINGS OF THE 2005 INTERNATIONAL CONFERENCE ON NEURAL NETWORKS AND BRAIN, VOLS 1-3, 2005, : 625 - 630
  • [8] Blogger's Interest Mining Based on Chinese Text Classification
    Yang, Suhua
    Yan, Jianzhuo
    Gao, Chen
    Tan, Guohua
    [J]. NONLINEAR MATHEMATICS FOR UNCERTAINTY AND ITS APPLICATIONS, 2011, 100 : 611 - 618
  • [9] Automatic Chinese Text Classification Using Character-based and Word-based Approach
    Luo, Xi
    Ohyama, Wataru
    Wakabayashi, Tetsushi
    Kimura, Fumitaka
    [J]. 2013 12TH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR), 2013, : 329 - 333
  • [10] A NEW FEATURE SELECTION METHOD BASED ON CONCEPT EXTRACTION IN AUTOMATIC CHINESE TEXT CLASSIFICATION
    Liao, Shasha
    Jiang, Minghu
    [J]. NEW MATHEMATICS AND NATURAL COMPUTATION, 2007, 3 (03) : 331 - 347