An Efficient Method for Mining Rare Association Rules: A Case Study on Air Pollution

被引:7
|
作者
Borah, Anindita [1 ]
Nath, Bhabesh [1 ]
机构
[1] Tezpur Univ, Dept Comp Sci & Engn, Tezpur, Assam, India
关键词
Air pollution; data mining; association rule; rare association rule; rare pattern; PATTERN; FREQUENT;
D O I
10.1142/S0218213021500184
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Most pattern mining techniques almost singularly focus on identifying frequent patterns and very less attention has been paid to the generation of rare patterns. However, in several domains, recognizing less frequent but strongly related patterns have greater advantage over the former ones. Identification of compelling and meaningful rare associations among such patterns may proved to be significant for air quality management that has become an indispensable task in today's world. The rare correlations between air pollutants and other parameters may aid in restricting the air pollution to a manageable level. To this end, efficient and competent rare pattern mining techniques are needed that can generate the complete set of rare patterns, further identifying significant rare association rules among them. Moreover, a notable issue with databases is their continuous update over time due to the addition of new records. The users requirement or behavior may change with the incremental update of databases that makes it difficult to determine a suitable support threshold for the extraction of interesting rare association rules. This paper, presents an efficient rare pattern mining technique to capture the complete set of rare patterns from a real environmental dataset. The proposed approach does not restart the entire mining process upon threshold update and generates the complete set of rare association rules in a single database scan. It can effectively perform incremental mining and also provides flexibility to the user to regulate the value of support threshold for generating the rare patterns. Significant rare association rules representing correlations between air pollutants and other environmental parameters are further extracted from the generated rare patterns to identify the substantial causes of air pollution. Performance analysis shows that the proposed method is more efficient than existing rare pattern mining approaches in providing significant directions to the domain experts for air pollution monitoring.
引用
收藏
页数:35
相关论文
共 50 条
  • [1] CBAR: an efficient method for mining association rules
    Tsay, YJ
    Chiang, JY
    KNOWLEDGE-BASED SYSTEMS, 2005, 18 (2-3) : 99 - 105
  • [2] Scalable and Efficient Method for Mining Association Rules
    AlZoubi, Wael A.
    Abu Bakar, Azuraliza
    Omar, Khairuddin
    2009 INTERNATIONAL CONFERENCE ON ELECTRICAL ENGINEERING AND INFORMATICS, VOLS 1 AND 2, 2009, : 36 - 41
  • [3] A method for association rules mining
    Ma, J
    Chen, G
    Kerre, EE
    Ruan, D
    APPLIED COMPUTATIONAL INTELLIGENCE, 2004, : 173 - 178
  • [4] Efficient mining of intertransaction association rules
    Tung, AKH
    Lu, HJ
    Han, JW
    Feng, L
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2003, 15 (01) : 43 - 56
  • [5] Expert deduction rules in data mining with association rules: a case study
    Rauch, Jan
    KNOWLEDGE AND INFORMATION SYSTEMS, 2019, 59 (01) : 167 - 195
  • [6] Expert deduction rules in data mining with association rules: a case study
    Jan Rauch
    Knowledge and Information Systems, 2019, 59 : 167 - 195
  • [7] Association Rules Mining for Hospital Readmission: A Case Study
    Miswan, Nor Hamizah
    Sulaiman, Ismat Mohd
    Chan, Chee Seng
    Ng, Chong Guan
    MATHEMATICS, 2021, 9 (21)
  • [8] CCAR: An efficient method for mining class association rules with itemset constraints
    Nguyen, Dang
    Vo, Bay
    Le, Bac
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2015, 37 : 115 - 124
  • [9] GRG: An efficient method for association rules mining on frequent closed itemsets
    Li, L
    Zhai, DH
    Jin, F
    PROCEEDINGS OF THE 2003 IEEE INTERNATIONAL SYMPOSIUM ON INTELLIGENT CONTROL, 2003, : 854 - 859
  • [10] Efficient method for mining multiple-level and generalized association rules
    Mao Y.-X.
    Chen T.-B.
    Shi B.-L.
    Ruan Jian Xue Bao/Journal of Software, 2011, 22 (12): : 2965 - 2980