A Local Latent Semantic Analysis-based Kernel for Document Similarities

被引:0
|
作者
Aseervatham, Sujeevan [1 ]
机构
[1] Univ Paris 13, CNRS, LIPN, UMR 7030, F-93430 Villetaneuse, France
关键词
D O I
10.1109/IJCNN.2008.4633792
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The document similarity measure is a key point in textual data processing. It is the main responsible of the performance of a processing system. Since a decade, kernels are used as similarity functions within inner-product based algorithms such as the SVM for NLP problems and especially for text categorization. In this paper, we present a semantic space constructed from latent concepts. The concepts are extracted using the Latent Semantic Analysis (LSA). To take into account of the specificity of each document category, we use the local LSA to define the global semantic space. Furthermore, we propose a weighted semantic kernel for the global space. The experimental results of the kernel, on text categorization tasks, show that this kernel performs better than global LSA kernels and especially for small LSA dimensions.
引用
收藏
页码:214 / 219
页数:6
相关论文
共 50 条
  • [31] Kernel Local Fisher Discriminant Analysis-Based Prediction on Protein O-Glycosylation Sites Using SVM
    Yang, Xuemei
    Sun, Shiliang
    ADVANCED INTELLIGENT COMPUTING THEORIES AND APPLICATIONS, ICIC 2015, PT III, 2015, 9227 : 700 - 705
  • [32] Using latent semantic analysis to identify similarities in source code to support program understanding
    Maletic, JI
    Marcus, A
    12TH IEEE INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2000, : 46 - 53
  • [33] Metaphor Analysis Method Based on Latent Semantic Analysis
    陶然
    卫亚萍
    杨唐峰
    JournalofDonghuaUniversity(EnglishEdition), 2021, 38 (01) : 83 - 90
  • [34] Visualizing Document Similarity Using N-Grams and Latent Semantic Analysis
    Hussein, Ashraf S.
    PROCEEDINGS OF THE 2016 SAI COMPUTING CONFERENCE (SAI), 2016, : 269 - 279
  • [35] Document Space Dimension Reduction by Latent Semantic Analysis and Hebbian Neural Network
    Mokris, I.
    Skovajsova, L.
    2008 6TH INTERNATIONAL SYMPOSIUM ON INTELLIGENT SYSTEMS AND INFORMATICS, 2008, : 60 - 63
  • [36] Self-Organising Map for Document Categorization Using Latent Semantic Analysis
    Mahalakshmi, B.
    Duraiswamy, K.
    2010 INTERNATIONAL CONFERENCE ON INNOVATIVE COMPUTING TECHNOLOGIES (ICICT), 2010,
  • [37] Tutoring systems based on latent semantic analysis
    Lemaire, B
    ARTIFICIAL INTELLIGENCE IN EDUCATION: OPEN LEARNING ENVIRONMENTS: NEW COMPUTATIONAL TECHNOLOGIES TO SUPPORT LEARNING, EXPLORATION AND COLLABORATION, 1999, 50 : 527 - 534
  • [38] Improved spoken document summarization using Probabilistic Latent Semantic Analysis (PLSA)
    Kong, Sheng-Yi
    Lee, Lin-shan
    2006 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-13, 2006, : 941 - 944
  • [39] Update Summarization Based on Latent Semantic Analysis
    Steinberger, Josef
    Jezek, Karel
    TEXT, SPEECH AND DIALOGUE, PROCEEDINGS, 2009, 5729 : 77 - 84
  • [40] LATENT SEMANTIC ANALYSIS BASED ON SPACE INTEGRATION
    Cai, Dongfeng
    Chang, Liwei
    Ji, Duo
    2012 IEEE 2ND INTERNATIONAL CONFERENCE ON CLOUD COMPUTING AND INTELLIGENT SYSTEMS (CCIS) VOLS 1-3, 2012, : 1430 - 1434