DOCUMENT CLASSIFICATION USING INFORMATION THEORY AND A FAST BACK-PROPAGATION NEURAL NETWORK

被引:1
|
作者
Li, Howard [1 ]
Paull, Liam [1 ]
Biletskiy, Yevgen [1 ]
Yang, Simon X. [2 ]
机构
[1] Univ New Brunswick, Dept Elect & Comp Engn, Fredericton, NB E3B 5A3, Canada
[2] Univ Guelph, Sch Engn, Guelph, ON N1G 2W1, Canada
来源
基金
加拿大自然科学与工程研究理事会;
关键词
document classification; information gain; Shannon entropy; artificial neural networks;
D O I
10.1080/10798587.2010.10643061
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, a fast back-propagation neural network is developed to build document classifiers and the information gain method is used for feature selection. According to the rank of the information gain of all the words contained in the documents, those words that contain more information to classify the documents are selected as the input features of the artificial neural network (ANN) classifiers. The neural network developed assumes a three-layer structure with a fast back-propagation learning algorithm. Because of the information contained in the vectors selected, (fie learning efficiency of the developed ANN is very high. For the output of the ANN, Shannon entropy is used to tune the threshold of (lie binary classifiers. The classifiers are tested using the Reuters corpus. Two performance measures are used to evaluate the performance of the classifiers and generally the results of this study are better than those claimed in literature.
引用
收藏
页码:25 / 37
页数:13
相关论文
共 50 条
  • [1] Development of a patent document classification and search platform using a back-propagation network
    Trappey, Amy J. C.
    Hsu, Fu-Chiang
    Trappey, Charles V.
    Lin, Chia-I
    EXPERT SYSTEMS WITH APPLICATIONS, 2006, 31 (04) : 755 - 765
  • [2] Classification of CmⅠ Energy Levels Using the Back-propagation Neural Network
    贺黎明
    董广勇
    夏明思
    金乾元
    华东理工大学学报(自然科学版), 1995, (04) : 523 - 528
  • [3] Classification of the Period Undergraduate Study Using Back-propagation Neural Network
    Prasetyawan, Purwono
    Ahmad, Imam
    Borman, Rohmat Indra
    Ardiansyah
    Pahlevi, Yogi Aziz
    Kurniawan, Dwi Ely
    PROCEEDINGS OF THE 2018 INTERNATIONAL CONFERENCE ON APPLIED ENGINEERING (ICAE), 2018,
  • [4] Service classification using adaptive back-propagation neural network and semantic similarity
    Kuang, Li
    Wu, Jian
    Deng, Shuiguang
    Li, Ying
    Wu, Zhaohui
    2006 10TH INTERNATIONAL CONFERENCE ON COMPUTER SUPPORTED COOPERATIVE WORK IN DESIGN, PROCEEDINGS, VOLS 1 AND 2, 2006, : 834 - 838
  • [5] Classification of Landslide Features Using a LiDAR DEM and Back-Propagation Neural Network
    Wu, Jee-Cheng
    Chang, Chia-Hao
    ADVANCES IN REMOTE SENSING AND GEO INFORMATICS APPLICATIONS, 2019, : 155 - 158
  • [6] Fast Neural Network Algorithm For Solving Classification Tasks Batch Error Back-Propagation Algorithm
    Albarakati, Noor
    Kecman, Vojislav
    2013 PROCEEDINGS OF IEEE SOUTHEASTCON, 2013,
  • [7] CLASSIFICATION OF MULTISPECTRAL REMOTE-SENSING DATA USING A BACK-PROPAGATION NEURAL NETWORK
    HEERMANN, PD
    KHAZENIE, N
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 1992, 30 (01): : 81 - 88
  • [8] Visualization of buckling deformation using back-propagation neural network
    Yamamoto, K
    Garrett, JH
    PROCEEDINGS OF THE SEVENTH INTERNATIONAL CONFERENCE ON COMPUTING IN CIVIL AND BUILDING ENGINEERING, VOLS 1-4, 1997, : 285 - 290
  • [9] Using Back-Propagation Neural Network for Automatic Wheezing Detection
    Lin, Bor-Shing
    Jan, Gene Eu
    Wu, Huey-Dong
    Lin, Bor-Shyh
    Chen, Sao-Jie
    2015 INTERNATIONAL CONFERENCE ON INTELLIGENT INFORMATION HIDING AND MULTIMEDIA SIGNAL PROCESSING (IIH-MSP), 2015, : 49 - 52
  • [10] A study of using back-propagation neural network in joint procurement
    Chen, Pin-Chang
    JOURNAL OF INFORMATION & OPTIMIZATION SCIENCES, 2009, 30 (06): : 1283 - 1297