Mining unknown patterns in data when the features are correlated

被引:0
|
作者
Lynch, Robert S., Jr. [1 ]
Willett, Peter K. [2 ]
机构
[1] USN, Undersea Warfare Ctr, Signal Proc Branch, Newport, RI USA
[2] Univ Connecticut, ECE Dept, Storrs, CT USA
关键词
adaptive classification; noninformative prior; discrete data; unknown data distribution;
D O I
10.1117/12.719423
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, a previously introduced data mining technique, utilizing the Mean Field Bayesian Data Reduction Algorithm (BDRA), is extended for use in finding unknown data clusters in a fused multidimensional feature space. In extending the BDRA for this application its built-in dimensionality reduction aspects are exploited for isolating and automatically mining all points contained in each unknown cluster. In previous work, this approach was shown to have comparable performance to the classifier that knows all cluster information when mining up to two features containing multiple unknown clusters. However, unlike results shown in previous work based on lower dimensional feature spaces, the results in this paper are based on utilizing up to twenty fused features. This is due to improvements in the training algorithm that now mines for candidate data clusters by processing all points in a quantized cell simultaneously. This is opposed to the previous method that processed all points sequentially. This improvement in processing has resulted in a substantial reduction in the run time of the algorithm. Finally, performance is illustrated and compared with simulated data containing multiple clusters, and where the relevant feature space contains both correlated and uncorrelated classification information.
引用
收藏
页数:12
相关论文
共 50 条
  • [41] Correlated-PCA: Principal Components' Analysis when Data and Noise are Correlated
    Vaswani, Namrata
    Guo, Han
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 29 (NIPS 2016), 2016, 29
  • [42] Generating synthetic data to match data mining patterns
    Eno, Josh
    Thompson, Craig W.
    IEEE INTERNET COMPUTING, 2008, 12 (03) : 78 - 82
  • [43] Mining Co-location Patterns with Dominant Features
    Fang, Yuan
    Wang, Lizhen
    Wang, Xiaoxuan
    Zhou, Lihua
    WEB INFORMATION SYSTEMS ENGINEERING, WISE 2017, PT I, 2017, 10569 : 183 - 198
  • [44] A Heuristic for Combining Correlated Experts When There Are Few Data
    Soule, David
    Grushka-Cockayne, Yael
    Merrick, Jason
    MANAGEMENT SCIENCE, 2024, 70 (10) : 6637 - 6668
  • [45] Efficient factorial experiments when the data are spatially correlated
    Martin, RJ
    Eccleston, JA
    Chan, BSP
    JOURNAL OF STATISTICAL PLANNING AND INFERENCE, 2004, 126 (01) : 377 - 395
  • [46] An algorithm for the design of factorial experiments when the data are correlated
    Elliott, LJ
    Eccleston, JA
    Martin, RJ
    STATISTICS AND COMPUTING, 1999, 9 (03) : 195 - 201
  • [47] An algorithm for the design of factorial experiments when the data are correlated
    L. J. Elliott
    J. A. Eccleston
    R. J. Martin
    Statistics and Computing, 1999, 9 : 195 - 201
  • [48] Mining Attribute-structure Correlated Patterns in Large Attributed Graphs
    Silva, Arlei
    Meira, Wagner, Jr.
    Zaki, Mohammed J.
    PROCEEDINGS OF THE VLDB ENDOWMENT, 2012, 5 (05): : 466 - 477
  • [49] Mining Correlated Bursty Topic Patterns from Coordinated Text Streams
    Wang, Xuanhui
    Zhai, ChengXiang
    Hu, Xiao
    Sproat, Richard
    KDD-2007 PROCEEDINGS OF THE THIRTEENTH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2007, : 784 - 793
  • [50] Effective approaches for mining correlated and low-average-cost patterns
    Liu, Xuan
    Chen, Genlang
    Wen, Shiting
    Zuo, Wanli
    KNOWLEDGE-BASED SYSTEMS, 2024, 302