Research on outlier detection of data based on machine learning

被引:0
|
作者
Wang, Chunyang [1 ]
机构
[1] Chengdu Univ Informat Technol, Chengdu, Sichuan, Peoples R China
关键词
Outliers; Isolated Forests; DBSCAN;
D O I
10.1145/3472634.3474072
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
With the increasing magnitude of data, the accuracy of single data cannot be guaranteed. In order to improve the accuracy of model prediction data, and further for the subsequent data processing, this paper focuses on the detection of data on outliers. This paper introduces the density clustering and outlier detection methods of Isolation Forest in outlier recognition, and mainly describes the principle and process of outlier recognition using density clustering and Isolation Forest. Based on this, according to the features of data, an improved algorithm combining density clustering and Isolation Forest is proposed. Finally, through the existing common outlier detection data set, the statistical outlier recognition method, the existing machine learning algorithm and the improved algorithm proposed in this paper are compared to identify outliers, which verifies the stability of the proposed method and the effective improvement compared with the existing outlier detection.
引用
收藏
页码:200 / 203
页数:4
相关论文
共 50 条
  • [1] Machine Learning in Therapeutic Research: The Hard Work of Outlier Detection in Large Data
    Cleophas, Ton J.
    [J]. AMERICAN JOURNAL OF THERAPEUTICS, 2016, 23 (03) : e837 - e843
  • [2] Machine Learning based Outlier Detection in IoT Greenhouse
    Abid, Aymen
    Cheikhrouhou, Omar
    Zaibi, Ghada
    Kachouri, Abdennaceur
    [J]. 2024 IEEE 27TH INTERNATIONAL SYMPOSIUM ON REAL-TIME DISTRIBUTED COMPUTING, ISORC 2024, 2024,
  • [3] Outlier data mining model for sports data analysis based on machine learning
    Yin, Zhimeng
    Cui, Wei
    [J]. JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2021, 40 (02) : 2733 - 2742
  • [4] Performing Machine Learning Based Outlier Detection for Automotive Grade Products
    Yang, Y. L.
    Tsao, P. C.
    Lin, C. W.
    Lee, Ross
    Ni, Olivia
    Chen, T. T.
    Ting, Y. J.
    Lai, C. T.
    Yeh, Jason
    Yang, Arnold
    Huang, Wayne
    Chen, Peng
    Tsai, Charly
    Yang, Ryan
    Huang, Y. S.
    Hsu, B. C.
    Lee, M. Z.
    Lee, T. H.
    Huang, Michael
    Chen, Coming
    Chu, Liham
    Kao, H. W.
    Tsai, N. S.
    [J]. 2023 IEEE INTERNATIONAL RELIABILITY PHYSICS SYMPOSIUM, IRPS, 2023,
  • [5] Outlier Detection Approaches Based on Machine Learning in the Internet-of-Things
    Jiang, Jinfang
    Han, Guangjie
    Liu, Li
    Shu, Lei
    Guizani, Mohsen
    [J]. IEEE WIRELESS COMMUNICATIONS, 2020, 27 (03) : 53 - 59
  • [6] A Machine Learning Methods: Outlier detection in WSN
    Ayadi, Hayfa
    Zouinkhi, Ahmed
    Boussaid, Boumedyen
    Abdelkrim, M. Naceur
    [J]. 2015 16TH INTERNATIONAL CONFERENCE ON SCIENCES AND TECHNIQUES OF AUTOMATIC CONTROL AND COMPUTER ENGINEERING (STA), 2015, : 722 - 727
  • [7] Kernel Extreme Learning Machine-Based Dynamic Interval Construction for Outlier Detection of Telemetry Data
    Xie, Haoran
    Zhan, Yafeng
    Ren, Shuqian
    Lu, Jianhua
    [J]. 2022 IEEE 96TH VEHICULAR TECHNOLOGY CONFERENCE (VTC2022-FALL), 2022,
  • [8] Research on Outlier Detection Algorithm of Medical Database Based on Deep Learning
    Li, Jie
    Zeng, Han
    Dai, Hengzhang
    [J]. BASIC & CLINICAL PHARMACOLOGY & TOXICOLOGY, 2020, 127 : 11 - 12
  • [9] Model optimization of English intelligent translation based on outlier detection and machine learning
    Bian, Yuzhu
    Li, Jiaxin
    Zhao, Yuge
    [J]. SOFT COMPUTING, 2023, 27 (14) : 10297 - 10303
  • [10] Machine learning based technique for outlier detection and result prediction in combustion diagnostics
    Chen, Mingfei
    Zhou, Kaile
    Liu, Dong
    [J]. ENERGY, 2024, 290