Mining text for word senses using independent component analysis

被引:0
|
作者
Rapp, R [1 ]
机构
[1] Univ Mainz, D-6500 Mainz, Germany
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The assumption that the problem of ambiguity in text analysis can only be solved if statistical dependencies of higher than second order are considered leads us to independent component analysis (ICA), a statistical formalism that takes higher-order dependencies into account. By assuming independence, ICA is capable of detecting a set of hidden vectors if only different linear mixtures of these vectors are observable. As a test case for ICA's applicability to natural language processing we look at the task of word sense induction. Our starting point is that we consider the co-occurrence vector of an ambiguous word as a linear mixture of its unknown sense vectors. If corpora from different domains are available, this should give us the different linear Mixtures that are required for ICA. It turns out that the independent sense vectors derived by ICA from the distributional differences of word usage reflect a word's meanings surprisingly well.
引用
收藏
页码:422 / 426
页数:5
相关论文
共 50 条
  • [1] Using WordNet to disambiguate word senses for text classification
    Liu, Ying
    Scheuermann, Peter
    Li, Xingsen
    Zhu, Xingquan
    [J]. COMPUTATIONAL SCIENCE - ICCS 2007, PT 3, PROCEEDINGS, 2007, 4489 : 781 - +
  • [2] Mining HIV dynamics using independent component analysis
    Draghici, S
    Graziano, F
    Kettoola, S
    Sethi, I
    Towfic, G
    [J]. BIOINFORMATICS, 2003, 19 (08) : 981 - 986
  • [3] Weather data mining using independent component analysis
    Basak, J
    Sudarshan, A
    Trivedi, D
    Santhanam, MS
    [J]. JOURNAL OF MACHINE LEARNING RESEARCH, 2004, 5 : 239 - 253
  • [4] Discovering word senses from text using random indexing
    Chatterjee, Niladri
    Mohan, Shiwali
    [J]. COMPUTATIONAL LINGUISTICS AND INTELLIGENT TEXT PROCESSING, 2008, 4919 : 299 - +
  • [5] Mining EEG-fMRI using independent component analysis
    Eichele, Tom
    Calhoun, Vince D.
    Debener, Stefan
    [J]. INTERNATIONAL JOURNAL OF PSYCHOPHYSIOLOGY, 2009, 73 (01) : 53 - 61
  • [6] Data mining with independent component analysis
    Wang, Fasong
    Li, Hongwei
    Li, Rui
    [J]. WCICA 2006: SIXTH WORLD CONGRESS ON INTELLIGENT CONTROL AND AUTOMATION, VOLS 1-12, CONFERENCE PROCEEDINGS, 2006, : 6043 - +
  • [7] An effective text mining framework using adaptive principle component analysis
    K. Kala
    [J]. Multimedia Tools and Applications, 2022, 81 : 44467 - 44485
  • [8] An effective text mining framework using adaptive principle component analysis
    Kala, K.
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (30) : 44467 - 44485
  • [9] Justice of the Marquesa: A Twitter Trend Analysis Using Text Mining and Word Clouds
    Valle-Cruz, David
    Vega-Hernandez, Josue E.
    Sandoval-Almazan, Rodrigo
    [J]. DG.O 2017: THE PROCEEDINGS OF THE 18TH ANNUAL INTERNATIONAL CONFERENCE ON DIGITAL GOVERNMENT RESEARCH: INNOVATIONS AND TRANSFORMATIONS IN GOVERNMENT, 2017, : 592 - 593
  • [10] MINING MULTIMODAL DATA WITH INDEPENDENT COMPONENT ANALYSIS
    Eichele, Tom
    [J]. PSYCHOPHYSIOLOGY, 2009, 46 : S5 - S5