Robust discriminant analysis of latent semantic feature for text categorization

被引:0
|
作者
Hu, Jiani [1 ]
Deng, Weihong [1 ]
Guo, Jun [1 ]
机构
[1] Beijing Univ Posts & Telecommun, Beijing 100876, Peoples R China
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper proposes a Discriminative Semantic Feature (DSF) method for vector space model based text categorization. The DSF method, which involves two stages, first reduces the dimension of the document vector space by Latent Semantic Indexing (LSI), and then applies a Robust linear Discriminant analysis Model (RDM), which improves the classical LDA by a energy-adaptive regularization criteria, to extract the discriminative semantic feature with enhanced discrimination power. As a result, DSF method can not only uncover latent semantic structure but also capture the discriminative feature. Comparative experiments on various state-of-art dimension reduction schemes such as our DSF, LSI, orthogonal centroid, two-stage LSI+LDA, LDA/QR and LDA/GSVD, are also performed. Experiments using the Reuters-21578 text collection show the proposed method performs better than other algorithms.
引用
收藏
页码:400 / 409
页数:10
相关论文
共 50 条
  • [1] An Application of Latent Semantic Analysis for Text Categorization
    Kou, G.
    Peng, Y.
    INTERNATIONAL JOURNAL OF COMPUTERS COMMUNICATIONS & CONTROL, 2015, 10 (03) : 357 - 369
  • [2] A Latent Semantic Analysis-based Approach to Geographic Feature Categorization from Text
    Huang, Yuxia
    FIFTH IEEE INTERNATIONAL CONFERENCE ON SEMANTIC COMPUTING (ICSC 2011), 2011, : 87 - 94
  • [3] Local and Global Latent Semantic Analysis for Text Categorization
    Ghanem, Khadoudja
    INTERNATIONAL JOURNAL OF INFORMATION RETRIEVAL RESEARCH, 2014, 4 (03) : 1 - 13
  • [4] Web text categorization based on latent semantic analysis
    Wang Jianfeng
    Yuan Jinsha
    ICCSE'2006: PROCEEDINGS OF THE FIRST INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE & EDUCATION: ADVANCED COMPUTER TECHNOLOGY, NEW EDUCATION, 2006, : 826 - 828
  • [5] Latent semantic analysis for text categorization using neural network
    Yu, Bo
    Xu, Zong-ben
    Li, Cheng-hua
    KNOWLEDGE-BASED SYSTEMS, 2008, 21 (08) : 900 - 904
  • [6] Latent semantic analysis approaches to categorization
    Laham, D
    PROCEEDINGS OF THE NINETEENTH ANNUAL CONFERENCE OF THE COGNITIVE SCIENCE SOCIETY, 1997, : 979 - 979
  • [7] A discriminative and semantic feature selection method for text categorization
    Zong, Wei
    Wu, Feng
    Chu, Lap-Keung
    Sculli, Domenic
    INTERNATIONAL JOURNAL OF PRODUCTION ECONOMICS, 2015, 165 : 215 - 222
  • [8] Text categorization via generalized discriminant analysis
    Li, Tao
    Zhu, Shenghuo
    Ogihara, Mitsunori
    INFORMATION PROCESSING & MANAGEMENT, 2008, 44 (05) : 1684 - 1697
  • [9] Local Latent Semantic Analysis Based on Support Vector Machine for Imbalanced Text Categorization
    Wan, Yuan
    Tong, Hengqing
    Deng, Yanfang
    2010 THE 3RD INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND INDUSTRIAL APPLICATION (PACIIA2010), VOL III, 2010, : 168 - 171
  • [10] Local Latent Semantic Analysis Based on Support Vector Machine for Imbalanced Text Categorization
    Wan, Yuan
    Tong, Hengqing
    Deng, Yanfang
    APPLIED INFORMATICS AND COMMUNICATION, PT III, 2011, 226 : 321 - 329