共 50 条
Finding Correspondence between Metabolomic Features in Untargeted Liquid Chromatography-Mass Spectrometry Metabolomics Datasets
被引:12
|作者:
Pinto, Rui Climaco
[1
,2
,7
]
Karaman, Ibrahim
[1
,2
]
Lewis, Matthew R.
[3
,4
]
Hallqvist, Jenny
[5
,6
]
Kaluarachchi, Manuja
[2
,7
]
Graca, Goncalo
[7
]
Chekmeneva, Elena
[3
,4
]
Durainayagam, Brenan
[1
,2
]
Ghanbari, Mohsen
[8
]
Ikram, M. Arfan
[8
]
Zetterberg, Henrik
[9
,10
,11
,12
]
Griffin, Julian
[2
,7
]
Elliott, Paul
[1
,2
]
Tzoulaki, Ioanna
[1
,13
]
Dehghan, Abbas
[1
,2
,8
]
Herrington, David
[14
]
Ebbels, Timothy
[7
]
机构:
[1] Imperial Coll London, Sch Publ Hlth, MRC PHE Ctr Environm & Hlth, Dept Epidemiol & Biostat, London W12 0BZ, England
[2] Imperial Coll London, UK Dementia Res Inst, London W12 0BZ, England
[3] Imperial Coll London, Dept Metab Digest & Reprod, MRC NIHR Natl Phenome Ctr, London SW7 2AZ, England
[4] Imperial Coll London, Sect Bioanalyt Chem, Dept Metab Digest & Reprod, London SW7 2AZ, England
[5] UCL, Great Ormond St Hosp, Ctr Translat Omics, London WC1N 1EH, England
[6] UCL, Queen Sq Inst Neurol, Dept Clin & Movement Neurosci, London WC1N 3BG, England
[7] Imperial Coll London, Dept Metab Digest & Reprod, Div Syst Med, Sect Bioinformat, London SW7 2AZ, England
[8] Erasmus MC, Dept Epidemiol, NL-3015 GD Rotterdam, Netherlands
[9] Univ Gothenburg, Sahlgrenska Acad, Inst Neurosci & Physiol, Dept Psychiat & Neurochem, S-43141 Molndal, Sweden
[10] Sahlgrens Univ Hosp, Clin Neurochem Lab, S-41345 Molndal, Sweden
[11] UCL, Dept Neurodegenerat Dis, London WC1N 3BG, England
[12] UCL, UK Dementia Res Inst, London WC1N 3BG, England
[13] Univ Ioannina, Sch Med, Dept Hyg & Epidemiol, Ioannina 45110, Greece
[14] Wake Forest Sch Med, Dept Internal Med, Winston Salem, NC 27101 USA
基金:
英国医学研究理事会;
英国工程与自然科学研究理事会;
英国经济与社会研究理事会;
英国惠康基金;
英国生物技术与生命科学研究理事会;
欧盟地平线“2020”;
美国国家卫生研究院;
关键词:
MS;
ALIGNMENT;
PLATFORM;
D O I:
10.1021/acs.analchem.1c03592
中图分类号:
O65 [分析化学];
学科分类号:
070302 ;
081704 ;
摘要:
Integration of multiple datasets can greatly enhance bioanalytical studies, for example, by increasing power to discover and validate biomarkers. In liquid chromatography-mass spectrometry (LC-MS) metabolomics, it is especially hard to combine untargeted datasets since the majority of metabolomic features are not annotated and thus cannot be matched by chemical identity. Typically, the information available for each feature is retention time (RT), mass-to-charge ratio (m/z), and feature intensity (FI). Pairs of features from the same metabolite in separate datasets can exhibit small but significant differences, making matching very challenging. Current methods to address this issue are too simple or rely on assumptions that cannot be met in all cases. We present a method to find feature correspondence between two similar LC-MS metabolomics experiments or batches using only the features' RT, m/z, and FI. We demonstrate the method on both real and synthetic datasets, using six orthogonal validation strategies to gauge the matching quality. In our main example, 4953 features were uniquely matched, of which 585 (96.8%) of 604 manually annotated features were correct. In a second example, 2324 features could be uniquely matched, with 79 (90.8%) out of 87 annotated features correctly matched. Most of the missed annotated matches are between features that behave very differently from modeled inter-dataset shifts of RT, MZ, and FI. In a third example with simulated data with 4755 features per dataset, 99.6% of the matches were correct. Finally, the results of matching three other dataset pairs using our method are compared with a published alternative method, metabCombiner, showing the advantages of our approach. The method can be applied using M2S (Match 2 Sets), a free, open-source MATLAB toolbox, available at https://github.com/rjdossan/M2S.
引用
收藏
页码:5493 / 5503
页数:11
相关论文