An efficient tool for discovering simple combinatorial patterns from large text databases

被引:0
|
作者
Arimura, H
Wataki, A
Fujino, R
Shimozono, S
Arikawa, S
机构
[1] Kyushu Univ, Dept Informat, Fukuoka 8128581, Japan
[2] Kyushu Inst Technol, Dept Artificial Intelligence, Iizuka, Fukuoka 8208502, Japan
来源
DISCOVERY SCIENCE | 1998年 / 1532卷
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this poster, we present demonstration of a prototype system for efficient discovery of combinatorial patterns, called proximity word-association patterns, from a collection of texts. The algorithm computes the best k-proximity d-word patterns in almost linear expected time in the total input length n, which is drastically faster than a straightforward algorithm of O(n(2d+l)) time complexity.
引用
收藏
页码:393 / 394
页数:2
相关论文
共 50 条
  • [1] An Efficient Approach to Discovering Sequential Patterns in Large Databases
    Yen, Show-Jane
    Cho, Chung-Wen
    [J]. LECTURE NOTES IN COMPUTER SCIENCE <D>, 2000, 1910 : 685 - 690
  • [2] A fast algorithm for discovering optimal string patterns in large text databases
    Arimura, H
    Wataki, A
    Fujino, R
    Araikawa, S
    [J]. ALGORITHMIC LEARNING THEORY, 1998, 1501 : 247 - 261
  • [3] An efficient approach to discovering knowledge from large databases
    Yen, SJ
    Chen, ALP
    [J]. PROCEEDINGS OF THE FOURTH INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED INFORMATION SYSTEMS, 1996, : 8 - 18
  • [4] Fast algorithm to discovering sequential patterns from large databases
    Hu Huirong
    [J]. PROCEEDINGS OF THE 24TH CHINESE CONTROL CONFERENCE, VOLS 1 AND 2, 2005, : 1352 - 1355
  • [5] Efficient discovery of optimal word-association patterns in large text databases
    Shimozono, S
    Arimura, H
    Arikawa, S
    [J]. NEW GENERATION COMPUTING, 2000, 18 (01) : 49 - 60
  • [6] Efficient discovery of optimal word-association patterns in large text databases
    Shinichi Shimozono
    Hiroki Arimura
    Setsuo Arikawa
    [J]. New Generation Computing, 2000, 18 : 49 - 60
  • [7] On discovering "potentially useful" patterns from databases
    Xie, Ying
    Johnsten, Tom
    Raghavan, Vijay V.
    Ramachandran, K.
    [J]. 2006 IEEE INTERNATIONAL CONFERENCE ON GRANULAR COMPUTING, 2006, : 494 - +
  • [8] Discovering patterns of medical practice in large administrative health databases
    Semenova, T
    [J]. DATA & KNOWLEDGE ENGINEERING, 2004, 51 (02) : 149 - 160
  • [9] Discovering association patterns in large spatio-temporal databases
    Lee, Eric M. H.
    Chan, Keith C. C.
    [J]. ICDM 2006: SIXTH IEEE INTERNATIONAL CONFERENCE ON DATA MINING, WORKSHOPS, 2006, : 349 - +
  • [10] Efficient discovery of new information in large text databases
    Bradford, RB
    [J]. INTELLIGENCE AND SECURITY INFORMATICS, PROCEEDINGS, 2005, 3495 : 374 - 380