Mining unknown patterns in data when the features are correlated

被引:0
|
作者
Lynch, Robert S., Jr. [1 ]
Willett, Peter K. [2 ]
机构
[1] USN, Undersea Warfare Ctr, Signal Proc Branch, Newport, RI USA
[2] Univ Connecticut, ECE Dept, Storrs, CT USA
关键词
adaptive classification; noninformative prior; discrete data; unknown data distribution;
D O I
10.1117/12.719423
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, a previously introduced data mining technique, utilizing the Mean Field Bayesian Data Reduction Algorithm (BDRA), is extended for use in finding unknown data clusters in a fused multidimensional feature space. In extending the BDRA for this application its built-in dimensionality reduction aspects are exploited for isolating and automatically mining all points contained in each unknown cluster. In previous work, this approach was shown to have comparable performance to the classifier that knows all cluster information when mining up to two features containing multiple unknown clusters. However, unlike results shown in previous work based on lower dimensional feature spaces, the results in this paper are based on utilizing up to twenty fused features. This is due to improvements in the training algorithm that now mines for candidate data clusters by processing all points in a quantized cell simultaneously. This is opposed to the previous method that processed all points sequentially. This improvement in processing has resulted in a substantial reduction in the run time of the algorithm. Finally, performance is illustrated and compared with simulated data containing multiple clusters, and where the relevant feature space contains both correlated and uncorrelated classification information.
引用
收藏
页数:12
相关论文
共 50 条
  • [21] Are Information or Data Patterns Correlated with Consciousness?
    David Gamez
    Topoi, 2016, 35 : 225 - 239
  • [22] Mining of Classification Patterns in Clinical Data through Data Mining Algorithms
    Jacob, Shomona Gracia
    Ramani, R. Geetha
    PROCEEDINGS OF THE 2012 INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING, COMMUNICATIONS AND INFORMATICS (ICACCI'12), 2012, : 997 - 1003
  • [23] Mobile user data mining: Mining relationship patterns
    Goh, J
    Taniar, D
    EMBEDDED AND UBIQUITOUS COMPUTING - EUC 2005, 2005, 3824 : 735 - 744
  • [24] Efficient mining of new concise representations of rare correlated patterns
    Bouasker, Souad
    Hamrouni, Tarek
    Ben Yahia, Sadok
    INTELLIGENT DATA ANALYSIS, 2015, 19 (02) : 359 - 390
  • [25] CCMine: Efficient mining of confidence-closed correlated patterns
    Kim, WY
    Lee, YK
    Han, JW
    ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PROCEEDINGS, 2004, 3056 : 569 - 579
  • [26] Combined analysis of correlated data when data cannot be pooled
    Jones, Elinor M.
    Sheehan, Nuala A.
    Gaye, Amadou
    Laflamme, Philippe
    Burton, Paul
    STAT, 2013, 2 (01): : 72 - 85
  • [27] An algorithmic approach to mining unknown clusters in training data
    Lynch, Robert S., Jr.
    Willett, Peter K.
    DATA MINING, INTRUSION DETECTION, INFORMATION ASSURANCE, AND DATA NETWORKS SECURITY 2006, 2006, 6241
  • [28] System of data mining for bioinformatics patterns
    Altamiranda, J.
    Aguilar, J.
    Hernandez, L.
    IV LATIN AMERICAN CONGRESS ON BIOMEDICAL ENGINEERING 2007, BIOENGINEERING SOLUTIONS FOR LATIN AMERICA HEALTH, VOLS 1 AND 2, 2008, 18 (1,2): : 573 - 577
  • [29] Mining Regular Patterns in Data Streams
    Tanbeer, Syed Khairuzzaman
    Ahmed, Chowdhury Farhan
    Jeong, Byeong-Soo
    DATABASE SYSTEMS FOR ADVANCED APPLICATIONS, PT I, PROCEEDINGS, 2010, 5981 : 399 - 413
  • [30] Data mining of user navigation patterns
    Borges, J
    Levene, M
    WEB USAGE ANALYSIS AND USER PROFILING, 2000, 1836 : 92 - 111