Text categorization based on combination of modified back propagation neural network and latent semantic analysis

被引:27
|
作者
Wang, Wei [2 ]
Yu, Bo [1 ]
机构
[1] Xi An Jiao Tong Univ, Sch Elect & Informat Engn, Xian 710049, Peoples R China
[2] Sichuan Univ, Inst Image & Informat, Sch Elect & Informat, Chengdu 610065, Peoples R China
来源
NEURAL COMPUTING & APPLICATIONS | 2009年 / 18卷 / 08期
关键词
Text categorization; Latent semantic analysis; Singular value decomposition; Back propagation neural network; Modified back propagation neural network;
D O I
10.1007/s00521-008-0193-3
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper proposed a new text categorization model based on the combination of modified back propagation neural network (MBPNN) and latent semantic analysis (LSA). The traditional back propagation neural network (BPNN) has slow training speed and is easy to trap into a local minimum, and it will lead to a poor performance and efficiency. In this paper, we propose the MBPNN to accelerate the training speed of BPNN and improve the categorization accuracy. LSA can overcome the problems caused by using statistically derived conceptual indices instead of individual words. It constructs a conceptual vector space in which each term or document is represented as a vector in the space. It not only greatly reduces the dimension but also discovers the important associative relationship between terms. We test our categorization model on 20-newsgroup corpus and reuter-21578 corpus, experimental results show that the MBPNN is much faster than the traditional BPNN. It also enhances the performance of the traditional BPNN. And the application of LSA for our system can lead to dramatic dimensionality reduction while achieving good classification results.
引用
收藏
页码:875 / 881
页数:7
相关论文
共 50 条
  • [41] Back propagation neural network analysis for the detection of explosives based on tagged neutron
    Ke Gong
    Shu-Jun Xiao
    Shi-Wei Jing
    Yu-Lai Zheng
    Journal of Radioanalytical and Nuclear Chemistry, 2020, 326 : 329 - 336
  • [42] Improved back propagation neural network based on the enrichment for the crack propagation
    Wang, Lihua
    Ye, Wenjing
    Yang, Fan
    Zhou, Yueting
    INTERNATIONAL JOURNAL FOR NUMERICAL METHODS IN ENGINEERING, 2024, 125 (06)
  • [43] Taking advantage of improved resource allocating network and latent semantic feature selection approach for automated text categorization
    Song, Wei
    Liang, Jiu Zhen
    He, Xiao Liang
    Chen, Peng
    APPLIED SOFT COMPUTING, 2014, 21 : 210 - 220
  • [44] Action categorization by structural probabilistic latent semantic analysis
    Zhang, Jianguo
    Gong, Shaogang
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2010, 114 (08) : 857 - 864
  • [45] Security analysis of information in campus network based on improved back-propagation neural network
    Zhao X.H.
    Telecommunications and Radio Engineering (English translation of Elektrosvyaz and Radiotekhnika), 2021, 80 (02): : 35 - 46
  • [46] Dimensionality reduction by combining category information and latent semantic index for text categorization
    Zheng, Wenbin
    An, Lixin
    Xu, Zhanyi
    Journal of Information and Computational Science, 2013, 10 (08): : 2463 - 2469
  • [47] Application of Back-propagation Neural Network to Categorization of Physical Fitness Levels of Taiwanese Females
    Chiu, Ching-Hua
    JOURNAL OF MEDICAL AND BIOLOGICAL ENGINEERING, 2011, 31 (01) : 31 - 35
  • [48] A Comprehensive Analysis of using Semantic Information in Text Categorization
    Celik, Kerem
    Gungor, Tunga
    2013 IEEE INTERNATIONAL SYMPOSIUM ON INNOVATIONS IN INTELLIGENT SYSTEMS AND APPLICATIONS (IEEE INISTA), 2013,
  • [49] Fast text categorization using concise semantic analysis
    Li Zhixing
    Xiong Zhongyang
    Zhang Yufang
    Liu Chunyong
    Li Kuan
    PATTERN RECOGNITION LETTERS, 2011, 32 (03) : 441 - 448
  • [50] A neural network model for hierarchical multilingual text categorization
    Chau, RN
    Yeh, CS
    Smith, KA
    ADVANCES IN NEURAL NETWORKS - ISNN 2005, PT 2, PROCEEDINGS, 2005, 3497 : 238 - 245