Text categorization based on combination of modified back propagation neural network and latent semantic analysis

被引:27
|
作者
Wang, Wei [2 ]
Yu, Bo [1 ]
机构
[1] Xi An Jiao Tong Univ, Sch Elect & Informat Engn, Xian 710049, Peoples R China
[2] Sichuan Univ, Inst Image & Informat, Sch Elect & Informat, Chengdu 610065, Peoples R China
来源
NEURAL COMPUTING & APPLICATIONS | 2009年 / 18卷 / 08期
关键词
Text categorization; Latent semantic analysis; Singular value decomposition; Back propagation neural network; Modified back propagation neural network;
D O I
10.1007/s00521-008-0193-3
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper proposed a new text categorization model based on the combination of modified back propagation neural network (MBPNN) and latent semantic analysis (LSA). The traditional back propagation neural network (BPNN) has slow training speed and is easy to trap into a local minimum, and it will lead to a poor performance and efficiency. In this paper, we propose the MBPNN to accelerate the training speed of BPNN and improve the categorization accuracy. LSA can overcome the problems caused by using statistically derived conceptual indices instead of individual words. It constructs a conceptual vector space in which each term or document is represented as a vector in the space. It not only greatly reduces the dimension but also discovers the important associative relationship between terms. We test our categorization model on 20-newsgroup corpus and reuter-21578 corpus, experimental results show that the MBPNN is much faster than the traditional BPNN. It also enhances the performance of the traditional BPNN. And the application of LSA for our system can lead to dramatic dimensionality reduction while achieving good classification results.
引用
收藏
页码:875 / 881
页数:7
相关论文
共 50 条
  • [1] Text categorization based on combination of modified back propagation neural network and latent semantic analysis
    Wei Wang
    Bo Yu
    Neural Computing and Applications, 2009, 18 : 875 - 881
  • [2] Latent semantic analysis for text categorization using neural network
    Yu, Bo
    Xu, Zong-ben
    Li, Cheng-hua
    KNOWLEDGE-BASED SYSTEMS, 2008, 21 (08) : 900 - 904
  • [3] Web text categorization based on latent semantic analysis
    Wang Jianfeng
    Yuan Jinsha
    ICCSE'2006: PROCEEDINGS OF THE FIRST INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE & EDUCATION: ADVANCED COMPUTER TECHNOLOGY, NEW EDUCATION, 2006, : 826 - 828
  • [4] An Application of Latent Semantic Analysis for Text Categorization
    Kou, G.
    Peng, Y.
    INTERNATIONAL JOURNAL OF COMPUTERS COMMUNICATIONS & CONTROL, 2015, 10 (03) : 357 - 369
  • [5] Local and Global Latent Semantic Analysis for Text Categorization
    Ghanem, Khadoudja
    INTERNATIONAL JOURNAL OF INFORMATION RETRIEVAL RESEARCH, 2014, 4 (03) : 1 - 13
  • [6] A novel algorithm for text categorization using improved back-propagation neural network
    Li, Cheng Hua
    Park, Soon Cheol
    FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY, PROCEEDINGS, 2006, 4223 : 452 - 460
  • [7] Robust discriminant analysis of latent semantic feature for text categorization
    Hu, Jiani
    Deng, Weihong
    Guo, Jun
    FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY, PROCEEDINGS, 2006, 4223 : 400 - 409
  • [8] Semantic Clustering and Convolutional Neural Network for Short Text Categorization
    Wang, Peng
    Xu, Jiaming
    Xu, Bo
    Liu, Cheng-Lin
    Zhang, Heng
    Wang, Fangyuan
    Hao, Hongwei
    PROCEEDINGS OF THE 53RD ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL) AND THE 7TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (IJCNLP), VOL 2, 2015, : 352 - 357
  • [9] Local Latent Semantic Analysis Based on Support Vector Machine for Imbalanced Text Categorization
    Wan, Yuan
    Tong, Hengqing
    Deng, Yanfang
    2010 THE 3RD INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND INDUSTRIAL APPLICATION (PACIIA2010), VOL III, 2010, : 168 - 171
  • [10] Local Latent Semantic Analysis Based on Support Vector Machine for Imbalanced Text Categorization
    Wan, Yuan
    Tong, Hengqing
    Deng, Yanfang
    APPLIED INFORMATICS AND COMMUNICATION, PT III, 2011, 226 : 321 - 329