Fast Extraction of Semantic Features from a Latent Semantic Indexed Text Corpus

被引:0
|
作者
A. Kabán
M. A. Girolami
机构
[1] Helsinki University of Technology,Laboratory of Computer and Information Science
来源
Neural Processing Letters | 2002年 / 15卷
关键词
latent semantic indexing; probabilistic latent semantic analysis; projection pursuit; semantic feature extraction; text analysis;
D O I
暂无
中图分类号
学科分类号
摘要
This paper proposes a projection-based symmetrical factorisation method for extracting semantic features from collections of text documents stored in a Latent Semantic space. Preliminary experimental results demonstrate this yields a comparable representation to that provided by a novel probabilistic approach which reconsiders the entire indexing problem of text documents and works directly in the original high dimensional vector-space representation of text. The employed projection index is derived here from the a priori constraints on the problem. The principal advantage of this approach is computational efficiency and is obtained by the exploitation of the Latent Semantic Indexing as a preprocessing stage. Simulation results on subsets of the 20-Newsgroups text corpus in various settings are provided.
引用
收藏
页码:31 / 43
页数:12
相关论文
共 50 条
  • [41] Text structure analysis based on latent semantic indexing
    Lin, Hongfei
    Zhan, Xuegang
    Yao, Tianshun
    [J]. Moshi Shibie yu Rengong Zhineng/Pattern Recognition and Artificial Intelligence, 2000, 13 (01): : 47 - 51
  • [42] Multidimensional Latent Semantic Networks for Text Humor Recognition
    Xiong, Siqi
    Wang, Rongbo
    Huang, Xiaoxi
    Chen, Zhiqun
    [J]. SENSORS, 2022, 22 (15)
  • [43] Random indexing of text samples for latent semantic analysis
    Kanerva, P
    Kristoferson, J
    Holst, H
    [J]. PROCEEDINGS OF THE TWENTY-SECOND ANNUAL CONFERENCE OF THE COGNITIVE SCIENCE SOCIETY, 2000, : 1036 - 1036
  • [44] Automatic Text Summarization Using Latent Semantic Analysis
    Mashechkin, I. V.
    Petrovskiy, M. I.
    Popov, D. S.
    Tsarev, D. V.
    [J]. PROGRAMMING AND COMPUTER SOFTWARE, 2011, 37 (06) : 299 - 305
  • [45] Learning the Latent Semantic Space for Ranking in Text Retrieval
    Yan, Jun
    Yan, Shuicheng
    Liu, Ning
    Chen, Zheng
    [J]. ICDM 2008: EIGHTH IEEE INTERNATIONAL CONFERENCE ON DATA MINING, PROCEEDINGS, 2008, : 1115 - +
  • [46] Local and Global Latent Semantic Analysis for Text Categorization
    Ghanem, Khadoudja
    [J]. INTERNATIONAL JOURNAL OF INFORMATION RETRIEVAL RESEARCH, 2014, 4 (03) : 1 - 13
  • [47] THE APPLICATION OF LATENT SEMANTIC INDEXING AND ONTOLOGY IN TEXT CLASSIFICATION
    Yang, Xi-Quan
    Sun, Na
    Sun, Tie-Li
    Cao, Xue-Ya
    Zheng, Xiao-Juan
    [J]. INTERNATIONAL JOURNAL OF INNOVATIVE COMPUTING INFORMATION AND CONTROL, 2009, 5 (12A): : 4491 - 4499
  • [48] Application of latent semantic indexing to processing of noisy text
    Price, RJ
    Zukas, AE
    [J]. INTELLIGENCE AND SECURITY INFORMATICS, PROCEEDINGS, 2005, 3495 : 602 - 603
  • [49] Automatic text summarization using latent semantic analysis
    I. V. Mashechkin
    M. I. Petrovskiy
    D. S. Popov
    D. V. Tsarev
    [J]. Programming and Computer Software, 2011, 37 : 299 - 305
  • [50] Web text categorization based on latent semantic analysis
    Wang Jianfeng
    Yuan Jinsha
    [J]. ICCSE'2006: PROCEEDINGS OF THE FIRST INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE & EDUCATION: ADVANCED COMPUTER TECHNOLOGY, NEW EDUCATION, 2006, : 826 - 828