Anomaly credit data detection based on enhanced Isolation Forest

被引:4
|
作者
Zhang, Xiaodong [1 ]
Yao, Yuan [1 ]
Lv, Congdong [1 ]
Wang, Tao [2 ]
机构
[1] Nanjing Audit Univ, Sch Informat Engn, Nanjing 211815, Peoples R China
[2] JUSFOUN BIG DATA, Beijing 10000, Peoples R China
基金
国家重点研发计划;
关键词
Credit evaluation; Anomaly detection; Class-imbalance; Cost-sensitive; EasyEnsemble; Isolation forest; SVM;
D O I
10.1007/s00170-022-09251-8
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In view of the real-world problem of falsity and errors credit data, and the performance degradation of the credit evaluation model caused by these problems, we proposed an outlier detection algorithm, which considered two characteristics of class-imbalance and cost-sensitive in credit data. We use an anomaly detection model called EIF to optimize the credit evaluation models. EIF uses the EasyEnsemble algorithm to construct balanced data sets, and train an Isolation Forest model for anomaly detection by the balanced datasets with different disturbances. On the one hand, the balanced dataset ensures that the class-imbalance problem is solved by undersampling, on the other hand, each sub-model learns from the overall minority class samples in order to solve the cost-sensitive problem. Experiments were performed on UCI German dataset, and the test set with fake data was constructed by correlation. Compared with other anomaly detection algorithms in common credit evaluation models, the EIF-optimized model has a higher F1 score and a lower cost-sensitive error rate. In conclusion, the EIF model is effective in enhancing the performance of the credit evaluation model for forged credit datasets.
引用
收藏
页码:185 / 192
页数:8
相关论文
共 50 条
  • [1] Anomaly credit data detection based on enhanced Isolation Forest
    Xiaodong Zhang
    Yuan Yao
    Congdong Lv
    Tao Wang
    The International Journal of Advanced Manufacturing Technology, 2022, 122 : 185 - 192
  • [2] `An enhanced variable selection and Isolation Forest based methodology for anomaly detection with OES data
    Puggini, Luca
    McLoone, Sean
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2018, 67 : 126 - 135
  • [3] An Improved Data Anomaly Detection Method Based on Isolation Forest
    Xu, Dong
    Wang, Yanjun
    Meng, Yulong
    Zhang, Ziying
    2017 10TH INTERNATIONAL SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE AND DESIGN (ISCID), VOL 2, 2017, : 287 - 291
  • [4] Leveraging an Isolation Forest to Anomaly Detection and Data Clustering
    Yepmo, Veronne
    Smits, Gregory
    Lesot, Marie -Jeanne
    Pivert, Olivier
    DATA & KNOWLEDGE ENGINEERING, 2024, 151
  • [5] Anomaly Detection in Streaming Data using Isolation Forest
    Kareem, Mohammed Shaker
    Muhammed, Lamia AbedNoor
    PROCEEDINGS 2024 SEVENTH INTERNATIONAL WOMEN IN DATA SCIENCE CONFERENCE AT PRINCE SULTAN UNIVERSITY, WIDS-PSU 2024, 2024, : 223 - 228
  • [6] Distribution Forest: An Anomaly Detection Method Based on Isolation Forest
    Yao, Chengfei
    Ma, Xiaoqing
    Chen, Biao
    Zhao, Xiaosong
    Bai, Gang
    ADVANCED PARALLEL PROCESSING TECHNOLOGIES (APPT 2019), 2019, 11719 : 135 - 147
  • [7] Isolation Forest Based Anomaly Detection Framework on Non-IID Data
    Xiang, Haolong
    Wang, Jiayu
    Ramamohanarao, Kotagiri
    Salcic, Zoran
    Dou, Wanchun
    Zhang, Xuyun
    IEEE INTELLIGENT SYSTEMS, 2021, 36 (03) : 31 - 40
  • [8] Spectral-Spatial Anomaly Detection of Hyperspectral Data Based on Improved Isolation Forest
    Song, Xiangyu
    Aryal, Sunil
    Ting, Kai Ming
    Liu, Zhen
    He, Bin
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
  • [9] Anomaly Detection for Data Streams Based on Isolation Forest Using Scikit-Multiflow
    Togbe, Maurras Ulbricht
    Barry, Mariam
    Boly, Aliou
    Chabchoub, Yousra
    Chiky, Raja
    Montiel, Jacob
    Tran, Vinh-Thuy
    COMPUTATIONAL SCIENCE AND ITS APPLICATIONS, ICCSA 2020, PART IV, 2020, 12252 : 15 - 30
  • [10] A probabilistic approach driven credit card anomaly detection with CBLOF and isolation forest models
    Chugh, Bharti
    Malik, Nitin
    Gupta, Deepak
    Alkahtani, Badr S.
    Alexandria Engineering Journal, 2025, 114 : 231 - 242