LOCAL PECULIARITY ORIENTED DATA MINING AND ITS APPLICATION IN OUTLIER DETECTION

被引:2
|
作者
Yang, Jian [1 ]
Zhong, Ning [1 ,2 ]
Yao, Yiyu [1 ,3 ]
Wang, Jue [4 ]
机构
[1] Beijing Univ Technol, Int WIC Inst, Beijing, Peoples R China
[2] Maebashi Inst Technol, Dept Life Sci & Informat, Maebashi, Gunma, Japan
[3] Univ Regina, Dept Comp Sci, Regina, SK S4S 0A2, Canada
[4] Chinese Acad Sci, Inst Automat, State Key Lab Management & Control Complex Syst, Beijing, Peoples R China
基金
北京市自然科学基金; 中国国家自然科学基金;
关键词
Data mining; peculiarity factor; local peculiarity factor; local peculiarity oriented mining; outlier detection; RULE;
D O I
10.1142/S0219622012500319
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Peculiarity oriented mining (POM), aimed at discovering peculiarity rules hidden in a dataset, is a data mining method. Peculiarity factor (PF) is one of the most important concepts in POM. In this paper, it is proved that PF can accurately characterize the peculiarity of data sampled from a normal distribution. However, for a general one-dimensional distribution, it does not have the property. A local version of PF, called LPF, is proposed to solve the difficulty. LPF can effectively describe the peculiarity of data sampled from a continuous one-dimensional distribution. Based on LPF, a framework of local peculiarity oriented mining is presented, which consists of two steps, namely, peculiar data identification and peculiar data analysis. Two algorithms for peculiar data identification and a case study of peculiar data analysis are given to make the framework practical. Experiments on several benchmark datasets show their good performance.
引用
收藏
页码:1155 / 1181
页数:27
相关论文
共 50 条
  • [41] Outlier Detection in Spatial Databases Using Clustering Data Mining
    Karmaker, Amitava
    Rahman, Syed M.
    [J]. PROCEEDINGS OF THE 2009 SIXTH INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY: NEW GENERATIONS, VOLS 1-3, 2009, : 1657 - +
  • [42] A Business- Data-Oriented Workflow Mining Algorithms and Its Application
    Wang, Yong
    Zhang, Jianchuan
    Cui, Jiahe
    Song, Hongtao
    Li, Zhigang
    [J]. 2015 EIGHTH INTERNATIONAL CONFERENCE ON INTERNET COMPUTING FOR SCIENCE AND ENGINEERING (ICICSE), 2015, : 231 - 236
  • [43] Enhanced approach for mining local outlier
    Jiang, Shengyi
    Li, Qinghua
    Wang, Hui
    Meng, Zhonglou
    [J]. Jisuanji Yanjiu yu Fazhan/Computer Research and Development, 2005, 42 (02): : 210 - 216
  • [44] RESEARCH OF OUTLIER MINING FRAMEWORK FOR DATA STREAMS BASED ON MULTI-AGENT SYSTEM AND ITS APPLICATION
    Li Zhongwei
    [J]. 2011 INTERNATIONAL CONFERENCE ON INSTRUMENTATION, MEASUREMENT, CIRCUITS AND SYSTEMS (ICIMCS 2011), VOL 3: COMPUTER-AIDED DESIGN, MANUFACTURING AND MANAGEMENT, 2011, : 271 - 274
  • [45] Functional outlier detection by a local depth with application to NOx levels
    Carlo Sguera
    Pedro Galeano
    Rosa E. Lillo
    [J]. Stochastic Environmental Research and Risk Assessment, 2016, 30 : 1115 - 1130
  • [46] Functional outlier detection by a local depth with application to NO x levels
    Sguera, Carlo
    Galeano, Pedro
    Lillo, Rosa E.
    [J]. STOCHASTIC ENVIRONMENTAL RESEARCH AND RISK ASSESSMENT, 2016, 30 (04) : 1115 - 1130
  • [47] Multivariate Conditional Outlier Detection and Its Clinical Application
    Hong, Charmgil
    Hauskrecht, Milos
    [J]. THIRTIETH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2016, : 4216 - 4217
  • [48] Local Entropies for Kernel Selection and Outlier Detection in Functional Data
    Martos, Gabriel
    Munoz, Alberto
    [J]. PROGRESS IN PATTERN RECOGNITION, IMAGE ANALYSIS, COMPUTER VISION, AND APPLICATIONS, CIARP 2015, 2015, 9423 : 611 - 618
  • [49] Fast Memory Efficient Local Outlier Detection in Data Streams
    Salehi, Mahsa
    Leckie, Christopher
    Bezdek, James C.
    Vaithianathan, Tharshan
    Zhang, Xuyun
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2016, 28 (12) : 3246 - 3260
  • [50] Comparison of local outlier detection techniques in spatial multivariate data
    Ernst, Marie
    Haesbroeck, Gentiane
    [J]. DATA MINING AND KNOWLEDGE DISCOVERY, 2017, 31 (02) : 371 - 399