Mining text for word senses using independent component analysis

被引:0
|
作者
Rapp, R [1 ]
机构
[1] Univ Mainz, D-6500 Mainz, Germany
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The assumption that the problem of ambiguity in text analysis can only be solved if statistical dependencies of higher than second order are considered leads us to independent component analysis (ICA), a statistical formalism that takes higher-order dependencies into account. By assuming independence, ICA is capable of detecting a set of hidden vectors if only different linear mixtures of these vectors are observable. As a test case for ICA's applicability to natural language processing we look at the task of word sense induction. Our starting point is that we consider the co-occurrence vector of an ambiguous word as a linear mixture of its unknown sense vectors. If corpora from different domains are available, this should give us the different linear Mixtures that are required for ICA. It turns out that the independent sense vectors derived by ICA from the distributional differences of word usage reflect a word's meanings surprisingly well.
引用
收藏
页码:422 / 426
页数:5
相关论文
共 50 条
  • [41] ICAR:: Independent component analysis using redundancies
    Albera, L
    Ferréol, A
    Chevalier, P
    Comon, P
    [J]. 2004 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS, VOL 5, PROCEEDINGS, 2004, : 672 - 675
  • [42] Independent component analysis using multilayer networks
    Li, Weiqin
    Zhang, Haibo
    Zhao, Feng
    [J]. IEEE SIGNAL PROCESSING LETTERS, 2007, 14 (11) : 856 - 859
  • [43] a Gait recognition using independent component analysis
    Lu, JW
    Zhang, EH
    Zhang, ZG
    Xue, YX
    [J]. ADVANCES IN NEURAL NETWORKS - ISNN 2005, PT 2, PROCEEDINGS, 2005, 3497 : 183 - +
  • [44] Independent component analysis using a genetic algorithm
    Hillis, DB
    Sadler, BM
    Swami, A
    [J]. APPLICATIONS AND SCIENCE OF COMPUTATIONAL INTELLIGENCE III, 2000, 4055 : 208 - 218
  • [45] Independent Component Analysis Using Bregman Divergences
    Wang, Xi
    Fyfe, Colin
    [J]. TRENDS IN APPLIED INTELLIGENT SYSTEMS, PT II, PROCEEDINGS, 2010, 6097 : 627 - 636
  • [46] Group independent component analysis of language fMRI from word generation tasks
    Tie, Yanmei
    Whalen, Stephen
    Suarez, Ralph O.
    Golby, Alexandra J.
    [J]. NEUROIMAGE, 2008, 42 (03) : 1214 - 1225
  • [47] Circular economy conceptualization using text mining analysis
    Alizadeh, Morteza
    Kashef, Amirarash
    Wang, Yu
    Wang, Jun
    Kremer, Gill E. Okudan
    Ma, Junfeng
    [J]. SUSTAINABLE PRODUCTION AND CONSUMPTION, 2023, 35 : 643 - 654
  • [48] Business documents analysis using text mining techniques
    Almanaseer, Orabe
    Alkhaleefah, Mohammad
    Elmanaseer, Sakha'a
    [J]. International Review on Computers and Software, 2012, 7 (04) : 1663 - 1677
  • [49] Chat analysis to understand students using text mining
    Yao Leiyue
    Xiong Jianying
    [J]. 2011 INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS AND NEURAL COMPUTING (FSNC 2011), VOL I, 2011, : 358 - 361
  • [50] Circular economy conceptualization using text mining analysis
    Alizadeh, Morteza
    Kashef, Amirarash
    Wang, Yu
    Wang, Jun
    Kremer, Gul E. Okudan
    Ma, Junfeng
    [J]. SUSTAINABLE PRODUCTION AND CONSUMPTION, 2023, 35