Linguistic feature extraction using independent component analysis

被引:8
|
作者
Honkela, T [1 ]
Hyvärinen, A [1 ]
机构
[1] Aalto Univ, Neural Networks Res Ctr, Lab Comp & Informat Sci, Helsinki, Finland
关键词
D O I
10.1109/IJCNN.2004.1379914
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Our aim is to find syntactic and semantic relationships of words based on the analysis of corpora. We propose the application of independent component analysis, which seems to have clear advantages over two classic methods: latent semantic analysis and self-organizing maps. Latent semantic analysis is a simple method for automatic generation of concepts that are useful, e.g., in encoding documents for information retrieval purposes. However, these concepts cannot easily be interpreted by humans. Self-organizing maps can be used to generate an explicit diagram which characterizes the relationships between words. The resulting map reflects syntactic categories in the overall organization and semantic categories in the local level. The self-organizing map does not, however, provide any explicit distinct categories for the words. Independent component analysis applied on word context data gives distinct features which reflect syntactic and semantic categories. Thus, independent component analysis gives features or categories that are both explicit and can easily be interpreted by humans. This result can be obtained without any human supervision or tagged corpora that would have some predetermined morphological, syntactic or semantic information.
引用
收藏
页码:279 / 284
页数:6
相关论文
共 50 条
  • [31] Electrogastrogram extraction using independent component analysis with references
    Cheng Peng
    Xiang Qian
    Datian Ye
    Neural Computing and Applications, 2007, 16 : 581 - 587
  • [32] Electrogastrogram extraction using independent component analysis with references
    Peng, Cheng
    Qian, Xiang
    Ye, Datian
    NEURAL COMPUTING & APPLICATIONS, 2007, 16 (06): : 581 - 587
  • [33] Unsupervised feature extraction of in vivo magnetic resonance spectra of brain tumours using independent component analysis
    Ladroue, C
    Tate, AR
    Howe, FA
    Griffiths, JR
    INTELLIGENT DATA ENGINEERING AND AUTOMATED LEARNING - IDEAL 2002, 2002, 2412 : 441 - 446
  • [34] Independent component analysis applied to feature extraction for robust automatic speech recognition
    Potamitis, L
    Fakotakis, N
    Kokkinakis, G
    ELECTRONICS LETTERS, 2000, 36 (23) : 1977 - 1978
  • [35] A new method of image feature extraction and denoising based on independent component analysis
    Yu, Ying
    Yang, Jian
    2006 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND BIOMIMETICS, VOLS 1-3, 2006, : 380 - +
  • [36] ECG arrhythmias recognition system based on independent component analysis feature extraction
    Jiang, Xing
    Zhang, Liqing
    Zhao, Qibin
    Albayrak, Sahin
    TENCON 2006 - 2006 IEEE REGION 10 CONFERENCE, VOLS 1-4, 2006, : 1308 - +
  • [37] Machinery fault feature extraction based on independent component analysis and correlation coefficient
    Zhao, Zhi-Hong
    Yang, Shao-Pu
    Shen, Yong-Jun
    Zhendong yu Chongji/Journal of Vibration and Shock, 2013, 32 (06): : 67 - 72
  • [38] Hybrid Independent Component Analysis and Rough Set Approach for Audio Feature Extraction
    Xin He
    Ling Guo
    Jianyu Wang
    Xianzhong Zhou
    PROCEEDINGS OF THE 2008 CHINESE CONFERENCE ON PATTERN RECOGNITION (CCPR 2008), 2008, : 412 - +
  • [39] Independent component analysis applied to feature extraction from colour and stereo images
    Hoyer, PO
    Hyvärinen, A
    NETWORK-COMPUTATION IN NEURAL SYSTEMS, 2000, 11 (03) : 191 - 210
  • [40] Image feature extraction and Poisson noise removal based on independent component analysis
    Department of Information Engineering, University of Electronic Science and Technology of China, Chengdu 610054, China
    Guangdian Gongcheng, 2006, 11 (128-132):