Combining normalizing flows with decision trees for interpretable unsupervised outlier detection

被引:0
|
作者
Papastefanopoulos, Vasilis [1 ]
Linardatos, Pantelis [1 ]
Kotsiantis, Sotiris [1 ]
机构
[1] Department of Mathematics, University of Patras, Patras,26504, Greece
关键词
D O I
10.1016/j.engappai.2024.109770
中图分类号
学科分类号
摘要
Outlier detection is critical for ensuring data integrity across various domains, from fraud detection in finance to anomaly identification in healthcare. Despite the importance of anomaly detection, most methods focus on performance, with interpretability remaining underexplored in unsupervised learning. Interpretability is essential in contexts where understanding why certain data points are classified as outliers is as important as the detection itself. This study introduces an interpretable approach to unsupervised outlier detection by combining normalizing flows and decision trees. Normalizing flows transform complex data distributions into simpler, tractable forms, allowing precise density estimation and the generation of pseudo-labels that differentiate inliers from outliers. These pseudo-labels are subsequently used to train a decision tree, offering both a structured decision-making process and interpretability in an unsupervised context, thereby addressing a key gap in the field. Our method was evaluated against 23 established outlier detection algorithms across 17 datasets using Precision, Recall, F1 Score, and Matthews Correlation Coefficient (MCC). The results showed that our approach ranked 4th in F1 Score, 6th in MCC, 3rd in Precision, and 19th in Recall. While it performed strongly on some datasets and less so on others, this variability is likely due to dataset-specific characteristics. Post-hoc statistical significance testing demonstrated that interpretability in unsupervised outlier detection can be achieved without significantly compromising performance, making it a valuable option for applications that require transparent and understandable anomaly detection. © 2024 Elsevier Ltd
引用
收藏
相关论文
共 50 条
  • [41] An Unsupervised Boosting Strategy for Outlier Detection Ensembles
    Campos, Guilherme O.
    Zimek, Arthur
    Meira, Wagner, Jr.
    ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PAKDD 2018, PT I, 2018, 10937 : 564 - 576
  • [42] On normalization and algorithm selection for unsupervised outlier detection
    Sevvandi Kandanaarachchi
    Mario A. Muñoz
    Rob J. Hyndman
    Kate Smith-Miles
    Data Mining and Knowledge Discovery, 2020, 34 : 309 - 354
  • [43] Unsupervised outlier detection in quality control: an overview
    Archimbaud, Aurore
    JOURNAL OF THE SFDS, 2018, 159 (03): : 1 - 39
  • [44] Unsupervised Sequential Outlier Detection With Deep Architectures
    Lu, Weining
    Cheng, Yu
    Xiao, Cao
    Chang, Shiyu
    Huang, Shuai
    Liang, Bin
    Huang, Thomas
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2017, 26 (09) : 4321 - 4330
  • [45] Stacking-based ensemble learning of decision trees for interpretable prostate cancer detection
    Wang, Yuyan
    Wang, Dujuan
    Geng, Na
    Wang, Yanzhang
    Yin, Yunqiang
    Jin, Yaochu
    APPLIED SOFT COMPUTING, 2019, 77 : 188 - 204
  • [46] ClimAlign: Unsupervised statistical downscaling of climate variables via normalizing flows
    Groenke, Brian
    Madaus, Luke
    Monteleoni, Claire
    PROCEEDINGS OF 2020 10TH INTERNATIONAL CONFERENCE ON CLIMATE INFORMATICS (CI2020), 2020, : 60 - 66
  • [47] CFLOW-AD: Real-Time Unsupervised Anomaly Detection with Localization via Conditional Normalizing Flows
    Gudovskiy, Denis
    Ishizaka, Shun
    Kozuka, Kazuki
    2022 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2022), 2022, : 1819 - 1828
  • [48] Anomaly Detection in Trajectory Data with Normalizing Flows
    Dias, Madson L. D.
    Mattos, Cesar Lincoln C.
    da Silva, Ticiana L. C.
    de Macedo, Jose Antonio F.
    Silva, Wellington C. P.
    2020 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2020,
  • [49] Normalizing Flows for Human Pose Anomaly Detection
    Hirschorn, Or
    Avidan, Shai
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 13499 - 13508
  • [50] Unsupervised Outlier Detection Technique for Intrusion Detection in Cloud Computing
    Kumar, Manoj
    Mathur, Robin
    2014 INTERNATIONAL CONFERENCE FOR CONVERGENCE OF TECHNOLOGY (I2CT), 2014,