Abstracting for Dimensionality Reduction in Text Classification

被引:1
|
作者
McAllister, Richard A. [1 ]
Angryk, Rafal A. [1 ]
机构
[1] Montana State Univ, Dept Comp Sci, Bozeman, MT 59717 USA
关键词
D O I
10.1002/int.21543
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
There is a growing interest in efficient models of text mining and an emergent need for new data structures that address word relationships. Detailed knowledge about the taxonomic environment of keywords that are used in text documents can provide valuable insight into the nature of the subject matter contained therein. Such insight may be used to enhance the data structures used in the text data mining task as relationships become usefully apparent. A popular scalable technique used to infer these relationships, while reducing dimensionality, has been Latent Semantic Analysis. We present a new approach, which uses an ontology of lexical abstractions to create abstraction profiles of documents and uses these profiles to perform text organization based on a process that we call frequent abstraction analysis. We introduce TATOO, the Text Abstraction TOOlkit, which is a full implementation of this new approach. We present our data model via an example of how taxonomically derived abstractions can be used to supplement semantic data structures for the text classification task. (C) 2012 Wiley Periodicals, Inc.
引用
收藏
页码:115 / 138
页数:24
相关论文
共 50 条
  • [41] Quadratic mutual information for dimensionality reduction and classification
    Gray, David M.
    Principe, Jose C.
    AUTOMATIC TARGET RECOGNITION XX; ACQUISITION, TRACKING, POINTING, AND LASER SYSTEMS TECHNOLOGIES XXIV; AND OPTICAL PATTERN RECOGNITION XXI, 2010, 7696
  • [42] COMPUTATIONAL COMPLEXITY ASPECTS OF DIMENSIONALITY REDUCTION AND CLASSIFICATION
    COVER, TM
    IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS, 1971, SMC1 (04): : 402 - &
  • [43] Iterative Nearest Neighbors for Classification and Dimensionality Reduction
    Timofte, Radu
    Van Gool, Luc
    2012 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2012, : 2456 - 2463
  • [44] Supervised nonlinear dimensionality reduction for visualization and classification
    Geng, X
    Zhan, DC
    Zhou, ZH
    IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS, 2005, 35 (06): : 1098 - 1107
  • [45] Dictionary learning based dimensionality reduction for classification
    Schnass, Karin
    Vandergheynst, Pierre
    2008 3RD INTERNATIONAL SYMPOSIUM ON COMMUNICATIONS, CONTROL AND SIGNAL PROCESSING, VOLS 1-3, 2008, : 780 - +
  • [46] Quantum discriminant analysis for dimensionality reduction and classification
    Cong, Iris
    Duan, Luming
    NEW JOURNAL OF PHYSICS, 2016, 18
  • [47] Variational quantum algorithms for dimensionality reduction and classification
    Liang, Jin-Min
    Shen, Shu-Qian
    Li, Ming
    Li, Lei
    PHYSICAL REVIEW A, 2020, 101 (03)
  • [48] Schroedinger Eigenmaps for Dimensionality Reduction and Image Classification
    Chen, Guoming
    2020 13TH INTERNATIONAL CONGRESS ON IMAGE AND SIGNAL PROCESSING, BIOMEDICAL ENGINEERING AND INFORMATICS (CISP-BMEI 2020), 2020, : 158 - 162
  • [49] Dimensionality reduction of face images for gender classification
    Buchala, S
    Davey, N
    Frank, RJ
    Gale, TM
    2004 2ND INTERNATIONAL IEEE CONFERENCE INTELLIGENT SYSTEMS, VOLS 1 AND 2, PROCEEDINGS, 2004, : 88 - 93
  • [50] A Combining Dimensionality Reduction Approach for Cancer Classification
    Han, Lijun
    Zhou, Changjun
    Wang, Bin
    Zhang, Qiang
    MULTI-DISCIPLINARY TRENDS IN ARTIFICIAL INTELLIGENCE, MIWAI 2015, 2015, 9426 : 340 - 347