Machine learning approaches for anomaly detection of water quality on a real-world data set*

被引:80
|
作者
Muharemi, Fitore [1 ]
Logofatu, Doina [1 ]
Leon, Florin [2 ]
机构
[1] Frankfurt Univ Appl Sci, Fac Comp Sci & Engn, Frankfurt, Germany
[2] Tech Univ Iasi, Comp Sci & Engn, Iasi, Romania
关键词
Classification; water quality; F1; score; imputation; event;
D O I
10.1080/24751839.2019.1565653
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Accurate detection of water quality changes is a crucial task of water companies. Water supply companies must provide safe drinking water. Nowadays in different areas, we find sensible sensors which monitor data during the time. Normally the data registered by the sensors contain a meaning, such as there can be any event. Sometimes the data are ill-understood and stating if there is an event which is difficult. This work represents the description of several approaches to identifying changes or anomalies occurring on water quality time series data. This work also discusses and proposes a solution to some challenges when dealing with time series data. The following models are applied to water quality data: logistic regression, linear discriminant analysis, support vector machines (SVM), artificial neural network (ANN), deep neural network (DNN), recurrent neural network (RNN) and long short-term memory (LSTM). The performance evaluation is conducted using F-score metric. A simulation study is conducted to check the performance of each algorithm using F-score. Solving imbalanced data is basically intentionally biasing the data to get interesting results instead of accurate results. The results show that all algorithms are vulnerable although SVM, ANN and logistic regressions tend to be a little less vulnerable, while DNN, RNN and LSTM are very vulnerable.
引用
收藏
页码:294 / 307
页数:14
相关论文
共 50 条
  • [1] Applying Contemporary Machine Learning Approaches to Nutrition Care Real-World Evidence: Findings From the National Quality Improvement Data Set
    Maduri, Chandramouli
    Hsueh, Pei-Yun Sabrina
    Li, Zhiguo
    Chen, Ching-Hua
    Papoutsakis, Constantina
    JOURNAL OF THE ACADEMY OF NUTRITION AND DIETETICS, 2021, 121 (12) : 2549 - +
  • [2] Real-World Anomaly Detection Using Deep Learning
    Koppikar, Unnati
    Sujatha, C.
    Patil, Prakashgoud
    Mudenagudi, Uma
    INTELLIGENT COMPUTING AND COMMUNICATION, ICICC 2019, 2020, 1034 : 333 - 342
  • [3] A study of real-world micrograph data quality and machine learning model robustness
    Zhong, Xiaoting
    Gallagher, Brian
    Eves, Keenan
    Robertson, Emily
    Mundhenk, T. Nathan
    Han, T. Yong-Jin
    NPJ COMPUTATIONAL MATERIALS, 2021, 7 (01)
  • [4] A study of real-world micrograph data quality and machine learning model robustness
    Xiaoting Zhong
    Brian Gallagher
    Keenan Eves
    Emily Robertson
    T. Nathan Mundhenk
    T. Yong-Jin Han
    npj Computational Materials, 7
  • [5] Towards Machine Learning with Zero Real-World Data
    Kang, Cholmin
    Jung, Hyunwoo
    Lee, Youngki
    WEARSYS'19: PROCEEDINGS OF THE 5TH ACM WORKSHOP ON WEARABLE SYSTEMS AND APPLICATIONS, 2019, : 41 - 46
  • [6] A Comparative Study of Unsupervised Machine Learning Methods for Anomaly Detection in Flight Data: Case Studies from Real-World Flight Operations
    Jasra, Sameer Kumar
    Valentino, Gianluca
    Muscat, Alan
    Camilleri, Robert
    AEROSPACE, 2025, 12 (02)
  • [7] A Real-Time Deep Learning Approach for Real-World Video Anomaly Detection
    Petrocchi, Stefano
    Giorgi, Giacomo
    Cimino, Mario G. C. A.
    ARES 2021: 16TH INTERNATIONAL CONFERENCE ON AVAILABILITY, RELIABILITY AND SECURITY, 2021,
  • [8] Real-World Data and Machine Learning to Predict Cardiac Amyloidosis
    Garcia-Garcia, Elena
    Maria Gonzalez-Romero, Gracia
    Martin-Perez, Encarna M.
    Zapata Cornejo, Enrique de Dios
    Escobar-Aguilar, Gema
    Cardenas Bonnet, Marlon Felix
    INTERNATIONAL JOURNAL OF ENVIRONMENTAL RESEARCH AND PUBLIC HEALTH, 2021, 18 (03) : 1 - 15
  • [9] Real-World Evidence: Integrating Machine Learning with Real-World Big Data for Predictive Analytics in Healthcare
    Vecchio, Nicolas
    CARDIOLOGY, 2024,
  • [10] Real-world Anomaly Detection in Surveillance Videos
    Sultani, Waqas
    Chen, Chen
    Shah, Mubarak
    2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 6479 - 6488