Text Document Classification with PCA and One-Class SVM

被引:5
|
作者
Kumar, B. Shravan [1 ,2 ]
Ravi, Vadlamani [1 ]
机构
[1] Inst Dev & Res Banking Technol, Ctr Excellence Analyt, Castle Hills Rd 1, Hyderabad 500057, Andhra Pradesh, India
[2] Univ Hyderabad, Sch Comp & Informat Sci, Hyderabad 500046, Andhra Pradesh, India
关键词
Text mining; Dimensionality reduction; Document classification; Principal component analysis; One-class support vector machine; PRINCIPAL COMPONENT ANALYSIS; DIMENSION REDUCTION; SELECTION;
D O I
10.1007/978-981-10-3153-3_11
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We propose a document classifier based on principal component analysis (PCA) and one-class support vector machine (OCSVM), where PCA helps achieve dimensionality reduction and OCSVM performs classification. Initially, PCA is invoked on the document-term matrix resulting in choosing the top few principal components. Later, OCSVM is trained on the records of the matrix corresponding to the negative class. Then, we tested the trained OCSVM with the records of the matrix corresponding to the positive class. The effectiveness of the proposed model is demonstrated on the popular datasets, viz., 20NG, malware, Syskill, & Webert, and customer feedbacks of a Bank. We observed that the hybrid yielded very high accuracies in all datasets.
引用
收藏
页码:107 / 115
页数:9
相关论文
共 50 条
  • [41] SHRINKAGE METHODS FOR ONE-CLASS CLASSIFICATION
    Nader, Patric
    Honeine, Paul
    Beauseroy, Pierre
    2015 23RD EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2015, : 135 - 139
  • [42] One-class remote sensing classification: one-class vs. binary classifiers
    Deng, Xueqing
    Li, Wenkai
    Liu, Xiaoping
    Guo, Qinghua
    Newsam, Shawn
    INTERNATIONAL JOURNAL OF REMOTE SENSING, 2018, 39 (06) : 1890 - 1910
  • [43] A dynamic one-class classification algorithm
    Xiao, JH
    Progress in Intelligence Computation & Applications, 2005, : 211 - 216
  • [44] Instance reduction for one-class classification
    Bartosz Krawczyk
    Isaac Triguero
    Salvador García
    Michał Woźniak
    Francisco Herrera
    Knowledge and Information Systems, 2019, 59 : 601 - 628
  • [45] Active Learning for One-Class Classification
    Barnabe-Lortie, Vincent
    Bellinger, Colin
    Japkowicz, Nathalie
    2015 IEEE 14TH INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS (ICMLA), 2015, : 390 - 395
  • [46] Feature extraction for one-class classification
    Tax, DMJ
    Müller, KR
    ARTIFICAIL NEURAL NETWORKS AND NEURAL INFORMATION PROCESSING - ICAN/ICONIP 2003, 2003, 2714 : 342 - 349
  • [47] Kernel whitening for one-class classification
    Tax, DMJ
    Juszczak, P
    PATTERN RECOGNITION WITH SUPPORT VECTOR MACHINES, PROCEEDINGS, 2002, 2388 : 40 - +
  • [48] One-class classification with Gaussian processes
    Kemmler, Michael
    Rodner, Erik
    Wacker, Esther-Sabrina
    Denzler, Joachim
    PATTERN RECOGNITION, 2013, 46 (12) : 3507 - 3518
  • [49] One-Class Classification with Gaussian Processes
    Kemmler, Michael
    Rodner, Erik
    Denzler, Joachim
    COMPUTER VISION - ACCV 2010, PT II, 2011, 6493 : 489 - 500
  • [50] Kernel whitening for one-class classification
    Tax, DMJ
    Juszczak, P
    INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2003, 17 (03) : 333 - 347