A Bayesian Classification Algorithm Based on Selective Patterns

被引：0

作者：

Ju Z. ^{[1
,2
]}

Wang Z. ^{[1
]}

机构：

[1] School of Computer and Information Technology, Beijing Jiaotong University, Beijing

[2] Unit 32178, Beijing

来源：

Jisuanji Yanjiu yu Fazhan/Computer Research and Development | 2020年 / 57卷 / 08期

基金：

中国国家自然科学基金;

关键词：

Bayesian classifier; Classification; Dependency; Pattern discovery; Selective patterns;

D O I：

10.7544/issn1000-1239.2020.20200196

中图分类号：

学科分类号：

摘要：

Data mining is mainly related to the theories and methods on how to discover knowledge from data in very large databases, while classification is an important topic in data mining. In the field of classification research, the Naïve Bayesian classifier is a simple but effective learning technique, which has been widely used. It is commonly thought to assume that the probability of each attribute belonging to a given class value is independent of all other attributes. However, there are lots of contexts where the dependencies between attributes are more complex. It is an important technique to construct a classifier using specific patterns based on "attribute-value" pairs in lots of researchers' work, while the dependencies among the attributes implied in the patterns and others will have significant impacts on classification results, thus the dependency between attributes is exploited adequately here. A Bayesian classification algorithm based on selective patterns is proposed, which could not only make use of the excellent classification ability based on Bayesian network classifiers, but also further weaken restrictions of the conditional independence assumption by further analyzing the dependencies between attributes in the patterns. The classification accuracies will benefit from fully considering the characteristics of datasets, mining and employing patterns which own high discrimination, and building the dependent relationship between attributes in a proper way. The empirical research results have shown that the average accuracy of the proposed classification algorithm on 10 datasets has been increased by 1.65% and 4.29%, comparing with the benchmark algorithms NB and AODE, respectively. © 2020, Science Press. All right reserved.

引用

页码：1605 / 1616

页数：11

共 26 条

[1] Domingos P, Pazzani M., Beyond independence: Conditions for the optimality of the simple Bayesian classifier, Proc of the 13th Int Conf on Machine Learning, pp. 105-112, (1996)
[2] Friedman N, Goldszmidt M., Building classifiers using Bayesian networks, Proc of the 13th National Conf on Artificial Intelligence, pp. 1277-1284, (1996)
[3] Keogh E, Pazzani M., Learning augmented Bayesian classifiers: A comparison of distribution-based and classification-based approaches, Proc of the 7th Int Workshop on Artificial Intelligence and Statistics, pp. 225-230, (1999)
[4] Webb G, Boughton J, Wang Zhihai, Not so naïve Bayes: Aggregating one-dependence estimators, Machine Learning, 58, 1, pp. 5-24, (2005)
[5] Chen Shenglei, Martinez A M, Webb G I., Highly scalable attribute selection for averaged one-dependence estimators, LNCS 8444: Advances in Knowledge Discovery and Data Mining, pp. 86-97, (2014)
[6] Chen Shenglei, Martinez A M, Webb G I, Et al., Selective AnDE for large data learning: A low-bias memory constrained approach, Knowledge and Information Systems, 50, 2, pp. 475-503, (2017)
[7] Meretakis D, Wuthrich B., Extending naïve bayes classifiers using long itemsets, Proc of the 5th ACM SIGKDD Int Conf on Knowledge Discovery and Data Mining, pp. 165-174, (1999)
[8] Dong Guozhu, Li Jinyan, Efficient mining of emerging patterns: Discovering trends and differences, Proc of the 5th ACM SIGKDD Int Conf on Knowledge Discovery and Data Mining, pp. 43-52, (1999)
[9] Fan Hongjian, Ramamohanarao K., A Bayesian approach to use emerging patterns for classification, Proc of the 14th Australasian Database Conf, pp. 39-48, (2003)
[10] Li Jinyan, Dong Guozhu, Ramamohanarao K., Making use of the most expressive jumping emerging patterns for classification, Proc of the 14th Pacific-Asia Conf on Knowledge Discovery and Data Mining, pp. 220-232, (2000)

← 1 2 3 →