An Adaptive Latent Semantic Analysis for Text mining

被引:0
|
作者
Hong T. Tu [1 ]
Tuoi T. Phan [1 ]
Khu P. Nguyen [2 ]
机构
[1] HCMC Univ Technol & Educ, VNU HCMC, 268 LyThuong Kiet, Hcmc, Vietnam
[2] UIT, VNU HCMC, Ward 6, Thuduc Dist, Hcmc, Vietnam
关键词
Latent semantic analysis; convex optimization; regularization; coordinate descent; matrix decomposition;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Latent Semantic Analysis or LSA uses a method of singular value decomposition of co-occurrence document-term matrix to derive a latent class model. Despite its success, there are some shortcomings in this technique. Recent works have improved the standard LSA using method of probability distribution, regularization, sparseness constraint. But there are still some other deficiencies. It is dealt with this paper, an adapted technique called hk-LSA based on reducing dimension of vector space and like-probabilistic relationships between document and latent-topic space is proposed. The adaptive technique overcomes some weak points of LSA such as processing density of orthogonal matrices, complexity in matrix decomposition, facing with alternative iteration algorithms, etc. The experiments show consistent and substantial improvements of the hk-LSA over LSA.
引用
收藏
页码:588 / 593
页数:6
相关论文
共 50 条
  • [31] An efficient framework of utilizing the latent semantic analysis in text extraction
    Ahmad Hussein Ababneh
    Joan Lu
    Qiang Xu
    International Journal of Speech Technology, 2019, 22 : 785 - 815
  • [32] Robust discriminant analysis of latent semantic feature for text categorization
    Hu, Jiani
    Deng, Weihong
    Guo, Jun
    FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY, PROCEEDINGS, 2006, 4223 : 400 - 409
  • [33] Latent Semantic Analysis for Mining Rules in Big Data Environment
    Kim, Kyung Tae
    Seol, Woo Sik
    Kim, Ung Mo
    Youn, Hee Yong
    2014 INTERNATIONAL CONFERENCE ON CYBER-ENABLED DISTRIBUTED COMPUTING AND KNOWLEDGE DISCOVERY (CYBERC), 2014, : 193 - 200
  • [34] Text segmentation by latent semantic indexing
    Ishioka, T
    NEW DEVELOPMENTS IN PSYCHOMETRICS, 2003, : 689 - 696
  • [35] Chinese text summarization using a trainable summarizer and latent semantic analysis
    Yeh, JY
    Ke, HR
    Yang, WP
    DIGITAL LIBRARIES: PEOPLE, KNOWLEDGE, AND TECHNOLOGY, PROCEEDINGS, 2002, 2555 : 76 - 87
  • [36] Automatic Text Summarization of Konkani Texts Using Latent Semantic Analysis
    D'Silva, Jovi
    Sharma, Uzzal
    More, Chaitali
    INTERNATIONAL CONFERENCE ON INNOVATIVE COMPUTING AND COMMUNICATIONS, ICICC 2022, VOL 1, 2023, 473 : 425 - 437
  • [37] Learning from text: Matching readers and texts by latent semantic analysis
    Wolfe, MBW
    Schreiner, ME
    Rehder, B
    Laham, D
    Foltz, PW
    Kintsch, W
    Landauer, TK
    DISCOURSE PROCESSES, 1998, 25 (2-3) : 309 - 336
  • [38] Hybrid Latent Semantic Analysis and Random Indexing Model for Text Summarization
    Chatterjee, Niladri
    Yadav, Nidhika
    INFORMATION AND COMMUNICATION TECHNOLOGY FOR COMPETITIVE STRATEGIES, 2019, 40 : 149 - 156
  • [39] Overview and semantic issues of text mining
    Stavrianou, Anna
    Andritsos, Periklis
    Nicoloyannis, Nicolas
    SIGMOD RECORD, 2007, 36 (03) : 23 - 34
  • [40] Semantic Network Analysis Pipeline-Interactive Text Mining Framework for Exploration of Semantic Flows in Large Corpus of Text
    Cenek, Martin
    Bulkow, Rowan
    Pak, Eric
    Oyster, Levi
    Ching, Boyd
    Mulagada, Ashika
    APPLIED SCIENCES-BASEL, 2019, 9 (24):