An Adaptive Latent Semantic Analysis for Text mining

被引:0
|
作者
Hong T. Tu [1 ]
Tuoi T. Phan [1 ]
Khu P. Nguyen [2 ]
机构
[1] HCMC Univ Technol & Educ, VNU HCMC, 268 LyThuong Kiet, Hcmc, Vietnam
[2] UIT, VNU HCMC, Ward 6, Thuduc Dist, Hcmc, Vietnam
关键词
Latent semantic analysis; convex optimization; regularization; coordinate descent; matrix decomposition;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Latent Semantic Analysis or LSA uses a method of singular value decomposition of co-occurrence document-term matrix to derive a latent class model. Despite its success, there are some shortcomings in this technique. Recent works have improved the standard LSA using method of probability distribution, regularization, sparseness constraint. But there are still some other deficiencies. It is dealt with this paper, an adapted technique called hk-LSA based on reducing dimension of vector space and like-probabilistic relationships between document and latent-topic space is proposed. The adaptive technique overcomes some weak points of LSA such as processing density of orthogonal matrices, complexity in matrix decomposition, facing with alternative iteration algorithms, etc. The experiments show consistent and substantial improvements of the hk-LSA over LSA.
引用
收藏
页码:588 / 593
页数:6
相关论文
共 50 条
  • [21] Semantic Pattern Mining for Text Mining
    Song, Xiaoli
    Wang, XiaoTong
    Hu, Xiaohua
    2016 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2016, : 150 - 155
  • [22] Web information mining and semantic analysis in heterogeneous unstructured text data using enhanced latent Dirichlet allocation
    Venugopal, Madamanchi
    Sharma, Virendra K.
    Sharma, Kalpana
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2023, 35 (01):
  • [23] lsemantica: A command for text similarity based on latent semantic analysis
    Schwarz, Carlo
    STATA JOURNAL, 2019, 19 (01): : 129 - 142
  • [24] Web Text Classification Based on Improved Latent Semantic Analysis
    Wang, Lan
    Wan, Yuan
    2011 SECOND ETP/IITA CONFERENCE ON TELECOMMUNICATION AND INFORMATION (TEIN 2011), VOL 1, 2011, : 176 - 179
  • [25] An efficient framework of utilizing the latent semantic analysis in text extraction
    Ababneh, Ahmad Hussein
    Lu, Joan
    Xu, Qiang
    INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2019, 22 (03) : 785 - 815
  • [26] Latent semantic analysis for text categorization using neural network
    Yu, Bo
    Xu, Zong-ben
    Li, Cheng-hua
    KNOWLEDGE-BASED SYSTEMS, 2008, 21 (08) : 900 - 904
  • [27] NLP Based Latent Semantic Analysis for Legal Text Summarization
    Merchant, Kaiz
    Pande, Yash
    2018 INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING, COMMUNICATIONS AND INFORMATICS (ICACCI), 2018, : 1803 - 1807
  • [28] Text summarization using a trainable summarizer and latent semantic analysis
    Yeh, JY
    Ke, HR
    Yang, WP
    Meng, IH
    INFORMATION PROCESSING & MANAGEMENT, 2005, 41 (01) : 75 - 95
  • [29] A Comprehensive Method for Text Summarization Based on Latent Semantic Analysis
    Wang, Yingjie
    Ma, Jun
    NATURAL LANGUAGE PROCESSING AND CHINESE COMPUTING, NLPCC 2013, 2013, 400 : 394 - 401
  • [30] Text Clustering Based on Domain Ontology and Latent Semantic Analysis
    Li Yaxiong
    Pan Deng
    MECHATRONICS ENGINEERING, COMPUTING AND INFORMATION TECHNOLOGY, 2014, 556-562 : 3536 - +