Automatic identification of Chinese weblogger's interests based on text classification

被引:3
|
作者
Ni, Xiaochuan [1 ]
Wu, Xiaoyuan [1 ]
Yu, Yong [1 ]
机构
[1] Shanghai Jiao Tong Univ, 800 Dongchuan Rd, Shanghai 200240, Peoples R China
关键词
D O I
10.1109/WI.2006.47
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Chinese weblogs have been expanded in an incredible speed in recent years. There is plentiful personal information in weblogs. In this paper, we propose a text classification based approach to automatically identify the interests of a weblogger. To solve the problems arising out of classifying weblog documents, the technique of heterogeneous classifiers combination is used here. We also use hierarchical classification technique to identify much specific interests. Experiments show that our interest identification approach has a high accuracy and, for most webloggers in our experiments, their interests implied in the contents of blogs could be well identified by using this approach.
引用
收藏
页码:247 / +
页数:2
相关论文
共 50 条
  • [41] Integrated features based sentiment classification for Chinese text
    Gan, Xiaohong
    [J]. Journal of Convergence Information Technology, 2012, 7 (19) : 450 - 458
  • [42] Short Chinese Text Classification Based on Correlation Analysis
    Zheng, Chenyang
    Usagawa, Tsuyoshi
    [J]. PROCEEDINGS OF 2017 11TH INTERNATIONAL CONFERENCE ON INFORMATION & COMMUNICATION TECHNOLOGY AND SYSTEMS (ICTS), 2017, : 265 - 268
  • [43] Imbalanced Chinese Text Classification Based on Weighted Sampling
    Li, Hu
    Zou, Peng
    Han, WeiHong
    Xia, Rongze
    [J]. TRUSTWORTHY COMPUTING AND SERVICES, 2014, 426 : 38 - 45
  • [44] Chinese Text Classification Based on Ant Colony Optimization
    Luo Xin
    [J]. PROCEEDINGS OF THE 4TH INTERNATIONAL CONFERENCE ON MECHATRONICS, MATERIALS, CHEMISTRY AND COMPUTER ENGINEERING 2015 (ICMMCCE 2015), 2015, 39 : 51 - 54
  • [45] A study on deception detection based on classification for Chinese text
    Zhang, Hu
    Wei, Shande
    Tan, Hongye
    Zheng, Jiaheng
    [J]. Journal of Information and Computational Science, 2009, 6 (03): : 1253 - 1261
  • [46] Chinese Text Classification Based on Particle Swarm Optimization
    Luo Xin
    [J]. PROCEEDINGS OF THE 2015 4TH NATIONAL CONFERENCE ON ELECTRICAL, ELECTRONICS AND COMPUTER ENGINEERING ( NCEECE 2015), 2016, 47 : 53 - 58
  • [47] Research on Classification of Chinese Text Data Based on SVM
    Lin, Yuan
    Yu, Hongzhi
    Wan, Fucheng
    Xu, Tao
    [J]. 2017 2ND INTERNATIONAL SEMINAR ON ADVANCES IN MATERIALS SCIENCE AND ENGINEERING, 2017, 231
  • [48] Automatic identification of the interests of web users
    Alguliev R.M.
    Alyguliev R.M.
    Yusifov F.F.
    [J]. Automatic Control and Computer Sciences, 2007, 41 (6) : 320 - 331
  • [49] Automatic text classification of English newswire articles based on statistical classification techniques
    Zu, GW
    Ohyama, W
    Wakabayashi, T
    Kimura, F
    [J]. ELECTRICAL ENGINEERING IN JAPAN, 2005, 152 (01) : 50 - 60
  • [50] Automatic Chinese Text Categorization System Based on Mutual Information
    Lu, Zhimao
    Shi, Hong
    Zhang, Qi
    Yuan, Chaoyue
    [J]. 2009 IEEE INTERNATIONAL CONFERENCE ON MECHATRONICS AND AUTOMATION, VOLS 1-7, CONFERENCE PROCEEDINGS, 2009, : 4986 - 4990