Efficient discovery of optimal word-association patterns in large text databases

被引:0
|
作者
Shinichi Shimozono
Hiroki Arimura
Setsuo Arikawa
机构
[1] Kyushu Inst of Tech.,Dept. of Artificial Intelligence
[2] Kyushu Univ.,Dept. of Informatics
[3] Precursory Research for Embryonic Science and Technology at Japan Science and Technology Corporation,undefined
来源
New Generation Computing | 2000年 / 18卷
关键词
Text Databases; Data Mining; Optimization; Proximity Word-association Patterns; Discovery Science;
D O I
暂无
中图分类号
学科分类号
摘要
We study efficient discovery of proximity word-association patterns, defined by a sequence of strings and a proximity gap, from a collection of texts with the positive and the negative labels. We present an algorithm that finds alld-stringsk-proximity word-association patterns that maximize the number of texts whose matching agree with their labels. It runs in expected time complexityO(kd−1n logdn) and spaceO(kd−1n) with the total lengthn of texts, if texts are uniformly random strings. We also show that the problem to find one of the best word-association patterns with arbitrarily many strings in MAX SNP-hard.
引用
收藏
页码:49 / 60
页数:11
相关论文
共 50 条
  • [1] Efficient discovery of optimal word-association patterns in large text databases
    Shimozono, S
    Arimura, H
    Arikawa, S
    [J]. NEW GENERATION COMPUTING, 2000, 18 (01) : 49 - 60
  • [2] Efficient discovery of new information in large text databases
    Bradford, RB
    [J]. INTELLIGENCE AND SECURITY INFORMATICS, PROCEEDINGS, 2005, 3495 : 374 - 380
  • [3] An efficient algorithm for pattern discovery in large text databases
    Li, D
    Wang, K
    Deogun, JS
    Donis, RO
    [J]. IKE'03: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON INFORMATION AND KNOWLEDGE ENGINEERING, VOLS 1 AND 2, 2003, : 96 - 102
  • [4] An Optimal Algorithm for Matching String Patterns in Large Text Databases
    Kumar, K. S. M. V.
    Raju, S. Viswanadha
    Govardha, Ka.
    [J]. INTERNATIONAL JOURNAL OF COMPUTER SCIENCE AND NETWORK SECURITY, 2013, 13 (06): : 31 - 40
  • [5] Word-association patterns of mental hospital patients
    Klionsky, EJ
    Okawa, JB
    Holmstrom, RW
    Silber, DE
    Karp, SA
    [J]. PSYCHOLOGICAL REPORTS, 1998, 83 (03) : 1419 - 1424
  • [6] RESPONSE PATTERNS IN A CONTINUOUS WORD-ASSOCIATION TASK
    YAMA, M
    [J]. JAPANESE JOURNAL OF PSYCHOLOGY, 1986, 57 (05): : 287 - 292
  • [7] Efficient Discovery of Partial Periodic Patterns in Large Temporal Databases
    Kiran, Rage Uday
    Veena, Pamalla
    Ravikumar, Penugonda
    Saideep, Chennupati
    Zettsu, Koji
    Shang, Haichuan
    Toyoda, Masashi
    Kitsuregawa, Masaru
    Reddy, P. Krishna
    [J]. ELECTRONICS, 2022, 11 (10)
  • [8] Discovery of direct and indirect association patterns in large transaction databases
    Ouyang, Weimin
    Luo, Shuanghu
    Huang, Qinhua
    [J]. CIS: 2007 INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND SECURITY, PROCEEDINGS, 2007, : 167 - +
  • [9] A fast algorithm for discovering optimal string patterns in large text databases
    Arimura, H
    Wataki, A
    Fujino, R
    Araikawa, S
    [J]. ALGORITHMIC LEARNING THEORY, 1998, 1501 : 247 - 261
  • [10] Efficient discovery of periodic-frequent patterns in very large databases
    Kiran, R. Uday
    Kitsuregawa, Masaru
    Reddy, P. Krishna
    [J]. JOURNAL OF SYSTEMS AND SOFTWARE, 2016, 112 : 110 - 121