Mining border descriptions of emerging patterns from dataset pairs

被引:0
|
作者
Guozhu Dong
Jinyan Li
机构
[1] Wright State University,Department of Computer Science and Engineering
[2] Institute for Infocomm Research,undefined
来源
关键词
Border algorithms; Border descriptions; Changes; Classification rules; Contrasts; Differences; Emerging patterns; Minimal/maximal patterns; Trends;
D O I
暂无
中图分类号
学科分类号
摘要
The mining of changes or differences or other comparative patterns from a pair of datasets is an interesting problem. This paper is focused on the mining of one type of comparative pattern called emerging patterns. Emerging patterns are denoted by EPs and are defined as patterns for which support increases from one dataset to the other with a big ratio. The number of EPs is sometimes huge. To provide a good structure for and to reduce the size of mining results, we use borders to concisely describe large collections of EPs in a lossless way. Such a border consists of only the minimal (under set inclusion) and the maximal EPs in the collection. We also present an algorithm for efficiently computing the borders of some desired EPs by manipulating the input borders only. Our experience with many datasets in the UCI Repository and recent cancer diagnosis datasets demonstrated that: Both the EP pattern type and our algorithm are useful for building accurate classifiers and useful for mining multifactor interactions, for example, minimal gene groups potentially responsible for the development of cancer.
引用
下载
收藏
页码:178 / 202
页数:24
相关论文
共 50 条
  • [31] Observation of sales trends by mining emerging patterns in dynamic markets
    Cheng-Hsiung Weng
    Tony, Cheng-Kui Huang
    Applied Intelligence, 2018, 48 : 4515 - 4529
  • [32] Efficient Mining of Jumping Emerging Patterns with Occurrence Counts for Classification
    Kobylinski, Lukasz
    Walczak, Krzysztof
    TRANSACTIONS ON ROUGH SETS XIII, 2011, 6499 : 73 - 88
  • [33] Discovery of emerging design patterns in ontologies using tree mining
    Lawrynowicz, Agnieszka
    Potoniec, Jedrzej
    Robaczyk, Michal
    Tudorache, Tania
    SEMANTIC WEB, 2018, 9 (04) : 517 - 544
  • [34] Observation of sales trends by mining emerging patterns in dynamic markets
    Weng, Cheng-Hsiung
    Tony, Cheng-Kui Huang
    APPLIED INTELLIGENCE, 2018, 48 (11) : 4515 - 4529
  • [35] Mining Closed Colossal Frequent Patterns from High-Dimensional Dataset: Serial Versus Parallel Framework
    Sureshan, Sudeep
    Penumacha, Anusha
    Jain, Siddharth
    Vanahalli, Manjunath
    Patil, Nagamma
    PROGRESS IN INTELLIGENT COMPUTING TECHNIQUES: THEORY, PRACTICE, AND APPLICATIONS, VOL 1, 2018, 518 : 317 - 326
  • [36] Mining from incomplete patterns
    Onet, Adrian
    2009 INTERNATIONAL CONFERENCE ON NEW TRENDS IN INFORMATION AND SERVICE SCIENCE (NISS 2009), VOLS 1 AND 2, 2009, : 394 - 399
  • [37] Mining Frequent Closed Itemsets from Distributed Dataset
    Ju, Chunhua
    Ni, Dongjun
    PROCEEDINGS OF THE 2008 INTERNATIONAL SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE AND DESIGN, VOL 1, 2008, : 37 - 41
  • [38] Mining search-phrase definitions from item descriptions
    Nguyen, Hung V.
    Davulcu, Hasan
    2008 IEEE 24TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING, VOLS 1-3, 2008, : 1346 - 1348
  • [39] Mining Semantic Descriptions of Bioinformatics Web Resources from the Literature
    Afzal, Hammad
    Stevens, Robert
    Nenadic, Goran
    SEMANTIC WEB: RESEARCH AND APPLICATIONS, 2009, 5554 : 535 - 549
  • [40] Bridging Causal Relevance and Pattern Discriminability: Mining Emerging Patterns from High-Dimensional Data
    Yu, Kui
    Ding, Wei
    Wang, Hao
    Wu, Xindong
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2013, 25 (12) : 2721 - 2739