Analysing chromatographic data using data mining to monitor petroleum content in water

被引:0
|
作者
Holmes, Geoffrey [1 ]
Fletcher, Dale [1 ]
Reutemann, Peter [1 ]
Frank, Eibe [1 ]
机构
[1] Univ Waikato, Dept Comp Sci, Hamilton, New Zealand
关键词
Gas Chromatography Mass Spectrometry; GC-MS; BTEX; Data Mining; Model Trees; Regression; Data Preprocessing; Correlation Optimized Warping; Petroleum Monitoring;
D O I
10.1007/978-3-540-88351-7_21
中图分类号
F [经济];
学科分类号
02 ;
摘要
Chromatography is an important analytical technique that has widespread use in environmental applications. A typical application is the monitoring of water samples to determine if they contain petroleum. These tests are mandated in many countries to enable environmental agencies to determine if tanks used to store petrol are leaking into local water systems. Chromatographic techniques, typically using gas or liquid chromatography coupled with mass spectrometry, allow an analyst to detect a vast array of compounds-potentially in the order of thousands. Accurate analysis relies heavily on the skills of a limited pool of experienced analysts utilising semi-automatic techniques to analyse these datasets-making the outcomes subjective. The focus of current laboratory data analysis systems has been on refinements of existing approaches. The work described here represents a paradigm shift achieved through applying data mining techniques to tackle the problem. These techniques are compelling because the efficacy of preprocessing methods, which are essential in this application area, can be objectively evaluated. This paper presents preliminary results using a data mining framework to predict the concentrations of petroleum compounds in water samples. Experiments demonstrate that the framework can be used to produce models of sufficient accuracy-measured in terms of root mean squared error and correlation coefficients-to offer the potential for significantly reducing the time spent by analysts on this task.
引用
收藏
页码:278 / 290
页数:13
相关论文
共 50 条
  • [21] Analysing decision variables that influence preliminary feasibility studies using data mining techniques
    Yun, Sungmin
    Caldas, Carlos H.
    CONSTRUCTION MANAGEMENT AND ECONOMICS, 2009, 27 (01) : 73 - 87
  • [22] ANALYSING MULTIDIMENSIONAL DATABASES USING DATA MINING AND BUSINESS INTELLIGENCE TO PROVIDE DECISION SUPPORT
    Basra, Rajveer Singh
    Lu, Kevin J.
    ICEIS 2008: PROCEEDINGS OF THE TENTH INTERNATIONAL CONFERENCE ON ENTERPRISE INFORMATION SYSTEMS, VOL AIDSS: ARTIFICIAL INTELLIGENCE AND DECISION SUPPORT SYSTEMS, 2008, : 472 - 479
  • [23] Design and Implementation of Petroleum Geology Data Mining System
    Xu, Xiaohong
    Tian, Hu
    Fu, Jilin
    Sun, Zhihua
    Xu, Xin
    2013 INTERNATIONAL CONFERENCE ON INFORMATION SCIENCE AND TECHNOLOGY (ICIST), 2013, : 456 - 459
  • [24] Geophysical and hydrological data assimilation to monitor water content dynamics in the rocky unsaturated zone
    De Carlo, Lorenzo
    Berardi, Marco
    Vurro, Michele
    Caputo, Maria Clementina
    ENVIRONMENTAL MONITORING AND ASSESSMENT, 2018, 190 (05)
  • [25] Geophysical and hydrological data assimilation to monitor water content dynamics in the rocky unsaturated zone
    Lorenzo De Carlo
    Marco Berardi
    Michele Vurro
    Maria Clementina Caputo
    Environmental Monitoring and Assessment, 2018, 190
  • [26] Data mining of water quality data by chemometrical methods
    Vandeginste, BGM
    MONITORING OF WATER QUALITY: THE CONTRIBUTION OF ADVANCED TECHNOLOGIES, 1998, : 49 - 53
  • [27] Content-free collaborative learning modeling using data mining
    Anaya, Antonio R.
    Boticario, Jesus G.
    USER MODELING AND USER-ADAPTED INTERACTION, 2011, 21 (1-2) : 181 - 216
  • [28] Modelling content lifespan in online social networks using data mining
    Gibbons, John W.
    Agah, Arvin
    International Journal of Web Based Communities, 2015, 11 (3-4) : 234 - 263
  • [29] Using OLAP and data mining for content planning in natural language generation
    Favero, EL
    Robin, J
    NATURAL LANGUAGE PROCESSING AND INFORMATION SYSTEMS, 2001, 1959 : 164 - 175
  • [30] Content-free collaborative learning modeling using data mining
    Antonio R. Anaya
    Jesús G. Boticario
    User Modeling and User-Adapted Interaction, 2011, 21 : 181 - 216