Mining unknown patterns in data when the features are correlated

被引:0
|
作者
Lynch, Robert S., Jr. [1 ]
Willett, Peter K. [2 ]
机构
[1] USN, Undersea Warfare Ctr, Signal Proc Branch, Newport, RI USA
[2] Univ Connecticut, ECE Dept, Storrs, CT USA
关键词
adaptive classification; noninformative prior; discrete data; unknown data distribution;
D O I
10.1117/12.719423
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, a previously introduced data mining technique, utilizing the Mean Field Bayesian Data Reduction Algorithm (BDRA), is extended for use in finding unknown data clusters in a fused multidimensional feature space. In extending the BDRA for this application its built-in dimensionality reduction aspects are exploited for isolating and automatically mining all points contained in each unknown cluster. In previous work, this approach was shown to have comparable performance to the classifier that knows all cluster information when mining up to two features containing multiple unknown clusters. However, unlike results shown in previous work based on lower dimensional feature spaces, the results in this paper are based on utilizing up to twenty fused features. This is due to improvements in the training algorithm that now mines for candidate data clusters by processing all points in a quantized cell simultaneously. This is opposed to the previous method that processed all points sequentially. This improvement in processing has resulted in a substantial reduction in the run time of the algorithm. Finally, performance is illustrated and compared with simulated data containing multiple clusters, and where the relevant feature space contains both correlated and uncorrelated classification information.
引用
收藏
页数:12
相关论文
共 50 条
  • [31] Mining periodic patterns in sequence data
    Huang, KY
    Chang, CH
    DATA WAREHOUSING AND KNOWLEDGE DISCOVERY, PROCEEDINGS, 2004, 3181 : 401 - 410
  • [32] Mining Sequential Patterns in Data Stream
    Huang, Qinhua
    Ouyang, Weimin
    ADVANCES IN NEURAL NETWORKS - ISNN 2009, PT 2, PROCEEDINGS, 2009, 5552 : 865 - 874
  • [33] Mining Patterns of Sensitive Data Usage
    Avdiienko, Vitalii
    2015 IEEE/ACM 37TH IEEE INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING, VOL 2, 2015, : 891 - 894
  • [34] Data Mining for Discovering Patterns in Migration
    Franco-Arcega, Anilu
    Franco-Sanchez, Kristell D.
    Castro-Espinoza, Felix A.
    Garcia-Islas, Luis H.
    NATURE-INSPIRED COMPUTATION AND MACHINE LEARNING, PT II, 2014, 8857 : 285 - 295
  • [35] Mining Infrequent Patterns in Data Stream
    Lakshmi, R.
    Hemalatha, C. Sweetlin
    Vaidehi, V.
    2014 INTERNATIONAL CONFERENCE ON RECENT TRENDS IN INFORMATION TECHNOLOGY (ICRTIT), 2014,
  • [36] Mining for classes and patterns in behavioural data
    Adams, NM
    Hand, DJ
    Till, R
    JOURNAL OF THE OPERATIONAL RESEARCH SOCIETY, 2001, 52 (09) : 1017 - 1024
  • [37] Mining ordinal patterns for data cleaning
    Liu, YB
    Liu, DY
    PROCEEDINGS OF THE 2004 IEEE INTERNATIONAL CONFERENCE ON INFORMATION REUSE AND INTEGRATION (IRI-2004), 2004, : 438 - 443
  • [38] Mining ordinal patterns for data cleaning
    Liu, Y. B.
    Liu, D. Y.
    COMPUTATIONAL METHODS, PTS 1 AND 2, 2006, : 1267 - +
  • [39] Database support for data mining patterns
    Kotsifakos, E
    Ntoutsi, I
    Theodoridis, Y
    ADVANCES IN INFORMATICS, PROCEEDINGS, 2005, 3746 : 14 - 24
  • [40] New data and features for advanced data mining in Manteia
    Tassy, Olivier
    NUCLEIC ACIDS RESEARCH, 2017, 45 (D1) : D707 - D711