OPLS methodology for analysis of pre-processing effects on spectroscopic data

被引:45
|
作者
Gabrielsson, Jon [1 ]
Jonsson, Hans
Airiau, Christian
Schmidt, Bernd
Escott, Richard
Trygg, Johan
机构
[1] Umea Univ, Res Grp Chemometr, SE-90187 Umea, Sweden
[2] GlaxoSmithKline Inc, Tonbridge, Kent, England
关键词
multi-block strategies; pre-processing; UV-data; OPLS; O2PLS; batch process;
D O I
10.1016/j.chemolab.2006.03.013
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Pre-processing of spectroscopic data is commonly applied to remove unwanted systematic variation. Possible loss of information and ambiguity regarding discarded variation are issues that complicate pre-treatment of data. In this paper, OPLS methodology is applied to evaluate different techniques for pre-processing of spectroscopic data gathered from a batch process. The objective is to present a rational scheme for analysis of preprocessing in order to understand the influence and effect of pre-treatment. O2PLS uses linear regression to divide the systematic variation in X and Y into three parts; one part with joint X-Y covariation, i.e. related to both X and Y, one part of X with Y-orthogonal variation and one part of Y with X-orthogonal variation. All of the investigated pre-treatment methods removed an additive baseline as expected. In the analysis of raw and differentiated data variation associated with the baseline was found in the Y-orthogonal part of X. Orthogonal information was also found in Y, which suggests that this preprocessing procedure not only removed variation. This would have been more difficult to detect without the O2PLS model since both raw and differentiated data must be analysed simultaneously. Development of a knowledge based strategy with OPLS methodology is an important step towards eliminating trial and error approaches to pre-processing. (c) 2006 Elsevier B.V. All rights reserved.
引用
收藏
页码:153 / 158
页数:6
相关论文
共 50 条
  • [41] Convergence analysis of stereophonic echo canceller with pre-processing - Relation between pre-processing and convergence
    Hirano, A
    Nakayama, K
    Watanabe, K
    ICASSP '99: 1999 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS VOLS I-VI, 1999, : 861 - 864
  • [42] Pre-processing of RDF data for METIS partitioning
    Benhamed S.
    Nait-Bahloul S.
    International Journal of Metadata, Semantics and Ontologies, 2023, 16 (02) : 152 - 171
  • [43] Ground data pre-processing for airborne scanner
    Zhu, Fuqing
    Hongwai Yu Haomibo Xuebao/Journal of Infrared and Millimeter Waves, 1992, 11 (03): : 227 - 234
  • [44] Big Data Pre-Processing: A Quality Framework
    Taleb, Ikbal
    Dssouli, Rachida
    Serhani, Mohamed Adel
    2015 IEEE INTERNATIONAL CONGRESS ON BIG DATA - BIGDATA CONGRESS 2015, 2015, : 191 - 198
  • [45] Pre-processing of meteorological data: Vertical profiles
    Erbrink, JJ
    Cenedese, A
    Cosemans, G
    Lasserre-Bigorry, A
    Weber, H
    Stubi, R
    INTERNATIONAL JOURNAL OF ENVIRONMENT AND POLLUTION, 1997, 8 (3-6) : 465 - 477
  • [46] PreP:: gene expression data pre-processing
    de la Nava, JG
    van Hijum, S
    Trelles, O
    BIOINFORMATICS, 2003, 19 (17) : 2328 - 2329
  • [47] Pre-processing of Partition Data for Enhancement of LOLIMOT
    Killian, Michaela
    Grosswindhager, Stefan
    Kozek, Martin
    Mayer, Barbara
    2013 8TH EUROSIM CONGRESS ON MODELLING AND SIMULATION (EUROSIM), 2013, : 271 - 275
  • [48] Improving Pipelining Tools for Pre-processing Data
    Novo-Loures, Maria
    Lage, Yeray
    Pavon, Reyes
    Laza, Rosalia
    Ruano-Ordas, David
    Ramon Mendez, Jose
    INTERNATIONAL JOURNAL OF INTERACTIVE MULTIMEDIA AND ARTIFICIAL INTELLIGENCE, 2022, 7 (04): : 214 - 224
  • [49] The Appliance of Data Pre-processing in Geological Modeling
    Zhang, Wei
    Li, Z. -P.
    Rong, Wang
    Wang, W. -X.
    2011 INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS AND NEURAL COMPUTING (FSNC 2011), VOL V, 2011, : 606 - 610
  • [50] Data pre-processing pipeline generation for AutoETL
    Giovanelli, Joseph
    Bilalli, Besim
    Abelló, Alberto
    Information Systems, 2022, 108