LOCAL PECULIARITY ORIENTED DATA MINING AND ITS APPLICATION IN OUTLIER DETECTION

被引:2
|
作者
Yang, Jian [1 ]
Zhong, Ning [1 ,2 ]
Yao, Yiyu [1 ,3 ]
Wang, Jue [4 ]
机构
[1] Beijing Univ Technol, Int WIC Inst, Beijing, Peoples R China
[2] Maebashi Inst Technol, Dept Life Sci & Informat, Maebashi, Gunma, Japan
[3] Univ Regina, Dept Comp Sci, Regina, SK S4S 0A2, Canada
[4] Chinese Acad Sci, Inst Automat, State Key Lab Management & Control Complex Syst, Beijing, Peoples R China
基金
北京市自然科学基金; 中国国家自然科学基金;
关键词
Data mining; peculiarity factor; local peculiarity factor; local peculiarity oriented mining; outlier detection; RULE;
D O I
10.1142/S0219622012500319
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Peculiarity oriented mining (POM), aimed at discovering peculiarity rules hidden in a dataset, is a data mining method. Peculiarity factor (PF) is one of the most important concepts in POM. In this paper, it is proved that PF can accurately characterize the peculiarity of data sampled from a normal distribution. However, for a general one-dimensional distribution, it does not have the property. A local version of PF, called LPF, is proposed to solve the difficulty. LPF can effectively describe the peculiarity of data sampled from a continuous one-dimensional distribution. Based on LPF, a framework of local peculiarity oriented mining is presented, which consists of two steps, namely, peculiar data identification and peculiar data analysis. Two algorithms for peculiar data identification and a case study of peculiar data analysis are given to make the framework practical. Experiments on several benchmark datasets show their good performance.
引用
收藏
页码:1155 / 1181
页数:27
相关论文
共 50 条
  • [1] Relational peculiarity oriented data mining
    Zhong, N
    Liu, CN
    Yao, YY
    Ohshima, M
    Huang, MX
    Huang, JJ
    [J]. FOURTH IEEE INTERNATIONAL CONFERENCE ON DATA MINING, PROCEEDINGS, 2004, : 575 - 578
  • [2] Peculiarity Oriented EEG Data Stream Mining
    Motomura, Shinichi
    Ohshima, Muneaki
    Zhong, Ning
    [J]. BRAIN AND HEALTH INFORMATICS, 2013, 8211 : 147 - 157
  • [3] Peculiarity oriented mining in multiple human brain data
    Zhong, N
    Nakamaru, A
    Ohshima, M
    Wu, JL
    Mizuhara, H
    [J]. INTELLIGENT DATA ENGINEERING AND AUTOMATED LEARNING, 2003, 2690 : 742 - 750
  • [4] Peculiarity oriented multidatabase mining
    Zhong, N
    Yao, YY
    Ohshima, M
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2003, 15 (04) : 952 - 960
  • [5] Outlier Detection Algorithms in Data Mining
    Xi, Jingke
    [J]. 2008 INTERNATIONAL SYMPOSIUM ON INTELLIGENT INFORMATION TECHNOLOGY APPLICATION, VOL I, PROCEEDINGS, 2008, : 94 - 97
  • [6] Data Streams Oriented Outlier Detection Method: A Fast Minimal Infrequent Pattern Mining
    Zhou, ZhongYu
    Pi, DeChang
    [J]. INTERNATIONAL ARAB JOURNAL OF INFORMATION TECHNOLOGY, 2021, 18 (06) : 864 - 870
  • [7] Relational peculiarity-oriented mining
    Muneaki Ohshima
    Ning Zhong
    YiYu Yao
    Chunnian Liu
    [J]. Data Mining and Knowledge Discovery, 2007, 15 : 249 - 273
  • [8] Relational peculiarity-oriented mining
    Ohshima, Muneaki
    Zhong, Ning
    Yao, Yiyu
    Liu, Chunnian
    [J]. DATA MINING AND KNOWLEDGE DISCOVERY, 2007, 15 (02) : 249 - 273
  • [9] Outlier detection algorithms in data mining systems
    Petrovskiy, MI
    [J]. PROGRAMMING AND COMPUTER SOFTWARE, 2003, 29 (04) : 228 - 237
  • [10] Outlier Detection Algorithms in Data Mining Systems
    M. I. Petrovskiy
    [J]. Programming and Computer Software, 2003, 29 : 228 - 237