Leveraging an Isolation Forest to Anomaly Detection and Data Clustering

被引:1
|
作者
Yepmo, Veronne [1 ]
Smits, Gregory [2 ]
Lesot, Marie -Jeanne [3 ]
Pivert, Olivier [1 ]
机构
[1] Univ Rennes, IRISA, Lannion, France
[2] Lab STICC, IMT Atlantique, Brest, France
[3] Sorbonne Univ, LIP6, Paris, France
关键词
Anomaly/outlier detection; Isolation forest; Clustering; FUZZY; ALGORITHM; NOISE;
D O I
10.1016/j.datak.2024.102302
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Understanding why some points in a data set are considered as anomalies cannot be done without taking into account the structure of the regular points. Whereas many machine learning methods are dedicated to the identification of anomalies on one side, or to the identification of the data inner -structure on the other side, a solution is introduced to answers these two tasks using a same data model, a variant of an isolation forest. The initial algorithm to construct an isolation forest is indeed revisited to preserve the data inner structure without affecting the efficiency of the outlier detection. Experiments conducted both on synthetic and real -world data sets show that, in addition to improving the detection of abnormal data points, the proposed variant of isolation forest allows for a reconstruction of the subspaces of high density. Therefore, the former can serve as a basis for a unified approach to detect global and local anomalies, which is a necessary condition to then provide users with informative descriptions of the data.
引用
收藏
页数:16
相关论文
共 50 条
  • [1] HYPERSPECTRAL ANOMALY DETECTION BASED ON ISOLATION FOREST WITH BAND CLUSTERING
    Huang, Yuancheng
    Xue, Yuanyuan
    Su, Yuanchao
    Han, Shanshan
    IGARSS 2020 - 2020 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM, 2020, : 2416 - 2419
  • [2] Anomaly Detection in Streaming Data using Isolation Forest
    Kareem, Mohammed Shaker
    Muhammed, Lamia AbedNoor
    PROCEEDINGS 2024 SEVENTH INTERNATIONAL WOMEN IN DATA SCIENCE CONFERENCE AT PRINCE SULTAN UNIVERSITY, WIDS-PSU 2024, 2024, : 223 - 228
  • [3] Anomaly credit data detection based on enhanced Isolation Forest
    Zhang, Xiaodong
    Yao, Yuan
    Lv, Congdong
    Wang, Tao
    INTERNATIONAL JOURNAL OF ADVANCED MANUFACTURING TECHNOLOGY, 2022, 122 (01): : 185 - 192
  • [4] An Improved Data Anomaly Detection Method Based on Isolation Forest
    Xu, Dong
    Wang, Yanjun
    Meng, Yulong
    Zhang, Ziying
    2017 10TH INTERNATIONAL SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE AND DESIGN (ISCID), VOL 2, 2017, : 287 - 291
  • [5] Anomaly credit data detection based on enhanced Isolation Forest
    Xiaodong Zhang
    Yuan Yao
    Congdong Lv
    Tao Wang
    The International Journal of Advanced Manufacturing Technology, 2022, 122 : 185 - 192
  • [6] Generalized isolation forest for anomaly detection
    Lesouple, Julien
    Baudoin, Cedric
    Spigai, Marc
    Tourneret, Jean-Yves
    PATTERN RECOGNITION LETTERS, 2021, 149 : 109 - 119
  • [7] Anomaly Detection with Generalized Isolation Forest
    Downey, Brett E.
    Leung, Carson K.
    Pazdor, Adam G. M.
    Petrillo, Ryan A. L.
    Popov, Denys
    Schneider, Benjamin R.
    ADVANCED INFORMATION NETWORKING AND APPLICATIONS, VOL 2, AINA 2024, 2024, 200 : 356 - 368
  • [8] Deep Isolation Forest for Anomaly Detection
    Xu, Hongzuo
    Pang, Guansong
    Wang, Yijie
    Wang, Yongjun
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2023, 35 (12) : 12591 - 12604
  • [9] OptIForest: Optimal Isolation Forest for Anomaly Detection
    Xiang, Haolong
    Zhang, Xuyun
    Hu, Hongsheng
    Qi, Lianyong
    Dou, Wanchun
    Dras, Mark
    Beheshti, Amin
    Xu, Xiaolong
    PROCEEDINGS OF THE THIRTY-SECOND INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2023, 2023, : 2379 - 2387
  • [10] Hyperspectral Anomaly Detection With Kernel Isolation Forest
    Li, Shutao
    Zhang, Kunzhong
    Duan, Puhong
    Kang, Xudong
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2020, 58 (01): : 319 - 329