An approach to text classification using dimensionality reduction and combination of classifiers

被引:10
|
作者
Jain, G [1 ]
Ginwala, A [1 ]
Aslandogan, YA [1 ]
机构
[1] Univ Texas, Dept Comp Sci & Engn, Arlington, TX 76019 USA
关键词
D O I
10.1109/IRI.2004.1431521
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Text classification involves assignment of predetermined categories to textual resources. Applications of text classification include recommendation systems, personalization, help desk automation, content filtering and routing, selective alerting, and text mining. This paper describes an experiment for improving the classification accuracy of a large text corpus by the use of dimensionality reduction and multiple-classifier combination techniques. Three different classifiers have been used namely Naive Bayes, J48 Decision Tree and Decision Table. The results of these classifiers are combined using techniques such as Simple Voting, Weighted Voting and Probability-based Voting. The classification accuracy is further improved by the use of a dimensionality reduction method based on concept indexing. Experiments conducted on the Reuters 21578 dataset indicate that the combination approach provides an improved and scalable method for text classification. Also, it is observed that concept indexing helps with classification accuracy in addition to efficiency and scalability.
引用
收藏
页码:564 / 569
页数:6
相关论文
共 50 条
  • [41] A NOVEL DIMENSIONALITY REDUCTION APPROACH TO IMPROVE MICROARRAY DATA CLASSIFICATION
    Hamim, Mohammed
    El Mouden, Ismail
    Ouzir, Mounir
    Moutachaouik, Hicham
    Hain, Mustapha
    [J]. IIUM ENGINEERING JOURNAL, 2021, 22 (01): : 1 - 23
  • [42] Classification of Imbalanced Banking Dataset using Dimensionality Reduction
    Valarmathi, B.
    Chellatamilan, T.
    Mittal, Hritik
    Jagrit
    Shubham
    [J]. PROCEEDINGS OF THE 2019 INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTING AND CONTROL SYSTEMS (ICCS), 2019, : 1353 - 1357
  • [43] An adaptive semantic dimensionality reduction approach for hyperspectral imagery classification
    Hamdi, Rawaa
    Sellami, Akrem
    Farah, Imed Riadh
    [J]. 2018 4TH INTERNATIONAL CONFERENCE ON ADVANCED TECHNOLOGIES FOR SIGNAL AND IMAGE PROCESSING (ATSIP), 2018,
  • [44] Combining Probabilistic Classifiers for Text Classification
    Fragos, Kostas
    Belsis, Petros
    Skourlas, Christos
    [J]. 3RD INTERNATIONAL CONFERENCE ON INTEGRATED INFORMATION (IC-ININFO), 2014, 147 : 307 - 312
  • [45] Exploration of dimensionality reduction for text visualization
    Huang, SP
    Ward, MO
    Rundensteiner, EA
    [J]. THIRD INTERNATIONAL CONFERENCE ON COORDINATED & MULTIPLE VIEWS IN EXPLORATORY VISUALIZATION, PROCEEDINGS, 2005, : 63 - 74
  • [46] Reduction of Training Noises for Text Classifiers
    Liu, Rey-Long
    [J]. INTELLIGENT INFORMATION AND DATABASE SYSTEMS (ACIIDS 2013), PT II, 2013, 7803 : 30 - 39
  • [47] Combination and optimization of classifiers in gender classification using genetic programming
    Khan, Asifullah
    Majid, Abdul
    Mirza, Anwar
    [J]. INTERNATIONAL JOURNAL OF KNOWLEDGE-BASED AND INTELLIGENT ENGINEERING SYSTEMS, 2005, 9 (01) : 1 - 11
  • [48] Block classification of a web page by using a combination of multiple classifiers
    Kang, Jinbeom
    Choi, Joongmin
    [J]. NCM 2008: 4TH INTERNATIONAL CONFERENCE ON NETWORKED COMPUTING AND ADVANCED INFORMATION MANAGEMENT, VOL 2, PROCEEDINGS, 2008, : 290 - 295
  • [49] The Role of Dimensionality Reduction in Classification
    Wang, Weiran
    Carreira-Perpinan, Miguel A.
    [J]. PROCEEDINGS OF THE TWENTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2014, : 2128 - 2134
  • [50] Dimensionality Reduction for Ordinal Classification
    Zine-El-Abidine, Mouad
    Dutagaci, Helin
    Rousseau, David
    [J]. 29TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO 2021), 2021, : 1531 - 1535