Text categorization based on combination of modified back propagation neural network and latent semantic analysis

被引:27
|
作者
Wang, Wei [2 ]
Yu, Bo [1 ]
机构
[1] Xi An Jiao Tong Univ, Sch Elect & Informat Engn, Xian 710049, Peoples R China
[2] Sichuan Univ, Inst Image & Informat, Sch Elect & Informat, Chengdu 610065, Peoples R China
来源
NEURAL COMPUTING & APPLICATIONS | 2009年 / 18卷 / 08期
关键词
Text categorization; Latent semantic analysis; Singular value decomposition; Back propagation neural network; Modified back propagation neural network;
D O I
10.1007/s00521-008-0193-3
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper proposed a new text categorization model based on the combination of modified back propagation neural network (MBPNN) and latent semantic analysis (LSA). The traditional back propagation neural network (BPNN) has slow training speed and is easy to trap into a local minimum, and it will lead to a poor performance and efficiency. In this paper, we propose the MBPNN to accelerate the training speed of BPNN and improve the categorization accuracy. LSA can overcome the problems caused by using statistically derived conceptual indices instead of individual words. It constructs a conceptual vector space in which each term or document is represented as a vector in the space. It not only greatly reduces the dimension but also discovers the important associative relationship between terms. We test our categorization model on 20-newsgroup corpus and reuter-21578 corpus, experimental results show that the MBPNN is much faster than the traditional BPNN. It also enhances the performance of the traditional BPNN. And the application of LSA for our system can lead to dramatic dimensionality reduction while achieving good classification results.
引用
收藏
页码:875 / 881
页数:7
相关论文
共 50 条
  • [31] NLP Based Latent Semantic Analysis for Legal Text Summarization
    Merchant, Kaiz
    Pande, Yash
    2018 INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING, COMMUNICATIONS AND INFORMATICS (ICACCI), 2018, : 1803 - 1807
  • [32] Text Clustering Based on Domain Ontology and Latent Semantic Analysis
    Li Yaxiong
    Pan Deng
    MECHATRONICS ENGINEERING, COMPUTING AND INFORMATION TECHNOLOGY, 2014, 556-562 : 3536 - +
  • [33] A Comprehensive Method for Text Summarization Based on Latent Semantic Analysis
    Wang, Yingjie
    Ma, Jun
    NATURAL LANGUAGE PROCESSING AND CHINESE COMPUTING, NLPCC 2013, 2013, 400 : 394 - 401
  • [34] A fuzzy neural network based on back-propagation
    Jin, Huang
    Quan, Gan
    Linhui, Cai
    ADVANCES IN NEURAL NETWORKS - ISNN 2007, PT 2, PROCEEDINGS, 2007, 4492 : 151 - +
  • [35] Wavelength Calibration Based on Back Propagation Neural Network
    Zhang, Liang
    Dai, Yinzhen
    Lin, Chun
    Lyu, Ruiqi
    Wang, Lei
    Hu, Tianlin
    PROCEEDINGS OF 2014 IEEE INTERNATIONAL CONFERENCE ON ANTI-COUNTERFEITING, SECURITY AND IDENTIFICATION (ASID), 2014, : 109 - 112
  • [36] Latent Semantic Analysis: An Approach to Understand Semantic of Text
    Kherwa, Pooja
    Bansal, Poonam
    2017 INTERNATIONAL CONFERENCE ON CURRENT TRENDS IN COMPUTER, ELECTRICAL, ELECTRONICS AND COMMUNICATION (CTCEEC), 2017, : 870 - 874
  • [37] Hailstone Classifier Based on Back Propagation Neural Network
    Liu, Xiangyang
    Wan, Huisong
    Zhang, Yuanyuan
    Jiang, Shuming
    ADVANCES IN MECHATRONICS AND CONTROL ENGINEERING II, PTS 1-3, 2013, 433-435 : 685 - 690
  • [38] Fingerprint Verification Based on Back Propagation Neural Network
    Balti, Ala
    Sayadi, Mounir
    Fnaiech, Farhat
    CONTROL ENGINEERING AND APPLIED INFORMATICS, 2013, 15 (03): : 53 - 60
  • [39] Back propagation neural network analysis for the detection of explosives based on tagged neutron
    Gong, Ke
    Xiao, Shu-Jun
    Jing, Shi-Wei
    Zheng, Yu-Lai
    JOURNAL OF RADIOANALYTICAL AND NUCLEAR CHEMISTRY, 2020, 326 (01) : 329 - 336
  • [40] Indoor Location Algorithm of Back Propagation Neural Network Based on Residual Analysis
    Chen Cheng
    Wang Ping
    Xing Jianchun
    2014 33RD CHINESE CONTROL CONFERENCE (CCC), 2014, : 467 - 471