Websom for textual data mining

被引:54
|
作者
Lagus, K [1 ]
Honkela, T [1 ]
Kaski, S [1 ]
Kohonen, T [1 ]
机构
[1] Aalto Univ, Neural Networks Res Ctr, Lab Comp & Informat Sci, FIN-02015 Espoo, Finland
关键词
data mining; document filtering; exploratory data analysis; information retrieval; self-organizing map; SOM; text document collection; WEBSOM;
D O I
10.1023/A:1006586221250
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
New methods that are user-friendly and efficient are needed for guidance among the masses of textual information available in the Internet and the World Wide Web. We have developed a method and a tool called the WEBSOM which utilizes the self-organizing map algorithm (SOM) for organizing large collections of text documents onto visual document maps. The approach to processing text is statistically oriented, computationally feasible, and scalable - over a million text documents have been ordered on a single map. In the article we consider different kinds of information needs and tasks regarding organizing, visualizing, searching, categorizing and filtering textual data. Furthermore, we discuss and illustrate with examples how document maps can aid in these situations. An example is presented where a document map is utilized as a tool for visualizing and filtering a stream of incoming electronic mail messages.
引用
收藏
页码:345 / 364
页数:20
相关论文
共 50 条
  • [1] Websom for Textual Data Mining
    Krista Lagus
    Timo Honkela
    Samuel Kaski
    Teuvo Kohonen
    [J]. Artificial Intelligence Review, 1999, 13 : 345 - 364
  • [2] Textual data mining by parsing
    Bellacicco, A
    [J]. DATA MINING III, 2002, 6 : 311 - 319
  • [3] Reviewing textual data mining in japan
    Ohsumi, N
    Yasuda, A
    [J]. SOCIOLOGICAL THEORY AND METHODS, 2004, 19 (02) : 135 - 159
  • [4] Mining massive document collections by the WEBSOM method
    Lagus, K
    Kaski, S
    Kohonen, T
    [J]. INFORMATION SCIENCES, 2004, 163 (1-3) : 135 - 156
  • [5] Detecting emerging concepts in textual data mining
    Pottenger, WM
    Yang, TH
    [J]. COMPUTATIONAL INFORMATION RETRIEVAL, 2001, : 89 - 105
  • [6] Mining causality knowledge from textual data
    Pechsiri, C
    Kawtrakul, A
    Piriyakul, R
    [J]. PROCEEDINGS OF THE IASTED INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND APPLICATIONS, 2006, : 85 - +
  • [7] Mining explanation knowledge from textual data
    Pechsiri, Chaveevan
    Kawtrakul, Asance
    Piriyakul, Rapepun
    [J]. PROCEEDINGS OF THE IASTED INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTER SCIENCE AND TECHNOLOGY, 2006, : 322 - +
  • [8] A software infrastructure for research in textual data mining
    Holzman, LE
    Fisher, TA
    Galitsky, LM
    Kontostathis, A
    Pottenger, WM
    [J]. 15TH IEEE INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2003, : 112 - 121
  • [9] Textual Data Mining to Support Science and Technology Management
    Paul Losiewicz
    Douglas W. Oard
    Ronald N. Kostoff
    [J]. Journal of Intelligent Information Systems, 2000, 15 : 99 - 119
  • [10] Textual data mining to support science and technology management
    Losiewicz, P
    Oard, DW
    Kostoff, RN
    [J]. JOURNAL OF INTELLIGENT INFORMATION SYSTEMS, 2000, 15 (02) : 99 - 119