Linguistic feature extraction using independent component analysis

被引:8
|
作者
Honkela, T [1 ]
Hyvärinen, A [1 ]
机构
[1] Aalto Univ, Neural Networks Res Ctr, Lab Comp & Informat Sci, Helsinki, Finland
关键词
D O I
10.1109/IJCNN.2004.1379914
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Our aim is to find syntactic and semantic relationships of words based on the analysis of corpora. We propose the application of independent component analysis, which seems to have clear advantages over two classic methods: latent semantic analysis and self-organizing maps. Latent semantic analysis is a simple method for automatic generation of concepts that are useful, e.g., in encoding documents for information retrieval purposes. However, these concepts cannot easily be interpreted by humans. Self-organizing maps can be used to generate an explicit diagram which characterizes the relationships between words. The resulting map reflects syntactic categories in the overall organization and semantic categories in the local level. The self-organizing map does not, however, provide any explicit distinct categories for the words. Independent component analysis applied on word context data gives distinct features which reflect syntactic and semantic categories. Thus, independent component analysis gives features or categories that are both explicit and can easily be interpreted by humans. This result can be obtained without any human supervision or tagged corpora that would have some predetermined morphological, syntactic or semantic information.
引用
收藏
页码:279 / 284
页数:6
相关论文
共 50 条
  • [41] SIGNAL FEATURE EXTRACTION BASED UPON INDEPENDENT COMPONENT ANALYSIS AND WAVELET TRANSFORM
    Ji Zhong Jin Tao Qin Shuren College of Mechanical Engineering
    Chinese Journal of Mechanical Engineering, 2005, (01) : 123 - 126
  • [42] Extracting Epileptic Feature Spikes Using Independent Component Analysis
    颜红梅
    夏阳
    Journal of Electronic Science and Technology, 2005, (04) : 369 - 371
  • [43] Feature extraction using evolutionary weighted principal component analysis
    Liu, N
    Wang, H
    INTERNATIONAL CONFERENCE ON SYSTEMS, MAN AND CYBERNETICS, VOL 1-4, PROCEEDINGS, 2005, : 346 - 350
  • [44] Hyperspectral Feature Extraction Using Total Variation Component Analysis
    Rasti, Behnood
    Ulfarsson, Magnus Orn
    Sveinsson, Johannes R.
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2016, 54 (12): : 6976 - 6985
  • [45] Fault feature extraction using independent component analysis with reference and its application on fault diagnosis of rotating machinery
    Gang Yu
    Neural Computing and Applications, 2015, 26 : 187 - 198
  • [46] Fault feature extraction using independent component analysis with reference and its application on fault diagnosis of rotating machinery
    Yu, Gang
    NEURAL COMPUTING & APPLICATIONS, 2015, 26 (01): : 187 - 198
  • [47] Linguistic feature analysis for protein interaction extraction
    Fayruzov, Timur
    De Cock, Martine
    Cornelis, Chris
    Hoste, Veronique
    BMC BIOINFORMATICS, 2009, 10
  • [48] Watermark detection and extraction using independent component analysis method
    Yu, D
    Sattar, F
    Ma, KK
    EURASIP JOURNAL ON APPLIED SIGNAL PROCESSING, 2002, 2002 (01) : 92 - 104
  • [49] Novel feature extraction method for signal analysis based on independent component analysis and wavelet transform
    Topolski, Mariusz
    Kozal, Jedrzej
    PLOS ONE, 2021, 16 (12):
  • [50] Linguistic feature analysis for protein interaction extraction
    Timur Fayruzov
    Martine De Cock
    Chris Cornelis
    Veronique Hoste
    BMC Bioinformatics, 10