Fast extraction of semantic features from a latent semantic indexed text corpus

被引:2
|
作者
Kabán, A [1 ]
Girolami, MA [1 ]
机构
[1] Aalto Univ, Lab Comp & Informat Sci, FIN-02015 Espoo, Finland
关键词
latent semantic indexing; probabilistic latent semantic analysis; projection pursuit; semantic feature extraction; text analysis;
D O I
10.1023/A:1013801028884
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper proposes a projection-based symmetrical factorisation method for extracting semantic features from collections of text documents stored in a Latent Semantic space. Preliminary experimental results demonstrate this yields a comparable representation to that provided by a novel probabilistic approach which reconsiders the entire indexing problem of text documents and works directly in the original high dimensional vector-space representation of text. The employed projection index is derived here from the a priori constraints on the problem. The principal advantage of this approach is computational efficiency and is obtained by the exploitation of the Latent Semantic Indexing as a preprocessing stage. Simulation results on subsets of the 20-Newsgroups text corpus in various settings are provided.
引用
收藏
页码:31 / 34
页数:4
相关论文
共 50 条
  • [21] Semantic Representation Extraction from Unstructured Arabic Text
    Zakria, Gehad
    Farouk, Mamdouh
    Fathy, Khaled
    Makar, Malak N.
    PROCEEDINGS OF 2019 8TH INTERNATIONAL CONFERENCE ON SOFTWARE AND INFORMATION ENGINEERING (ICSIE 2019), 2019, : 222 - 226
  • [22] Automatic extraction of corollaries from semantic structure of text
    Nurtazin, Abyz T.
    Khisamiev, Zarif G.
    OPEN ENGINEERING, 2016, 6 (01): : 353 - 358
  • [23] Semantic Search by Latent Ontological Features
    Tru H. Cao
    Vuong M. Ngo
    New Generation Computing, 2012, 30 : 53 - 71
  • [24] Semantic Search by Latent Ontological Features
    Cao, Tru H.
    Ngo, Vuong M.
    NEW GENERATION COMPUTING, 2012, 30 (01) : 53 - 71
  • [25] An Adaptive Latent Semantic Analysis for Text mining
    Hong T. Tu
    Tuoi T. Phan
    Khu P. Nguyen
    2017 INTERNATIONAL CONFERENCE ON SYSTEM SCIENCE AND ENGINEERING (ICSSE), 2017, : 588 - 593
  • [26] Text summarization using Latent Semantic Analysis
    Ozsoy, Makbule Gulcin
    Alpaslan, Ferda Nur
    Cicekli, Ilyas
    JOURNAL OF INFORMATION SCIENCE, 2011, 37 (04) : 405 - 417
  • [27] An Application of Latent Semantic Analysis for Text Categorization
    Kou, G.
    Peng, Y.
    INTERNATIONAL JOURNAL OF COMPUTERS COMMUNICATIONS & CONTROL, 2015, 10 (03) : 357 - 369
  • [28] The interpretation of dream meaning: Resolving ambiguity using Latent Semantic Analysis in a small corpus of text
    Altszyler, Edgar
    Ribeiro, Sidarta
    Sigman, Mariano
    Fernandez Slezak, Diego
    CONSCIOUSNESS AND COGNITION, 2017, 56 : 178 - 187
  • [29] Boosting for text classification with semantic features
    Bloehdorn, Stephan
    Hotho, Andreas
    ADVANCES IN WEB MINING AND WEB USAGE ANALYSIS, 2006, 3932 : 149 - 166
  • [30] Semantic relation extraction aware of N-gram features from unstructured biomedical text
    Wang, Zheng
    Xu, Shuo
    Zhu, Lijun
    JOURNAL OF BIOMEDICAL INFORMATICS, 2018, 86 : 59 - 70