Automated Workflows for Accurate Mass-Based Putative Metabolite Identification in LC-MS-Derived Metabolomic Datasets

Publication
Bioinformatics, 27(8) 1108–1112. Oxford University Press https://doi.org/10.1093/bioinformatics/btr079

Abstract: Motivation: The study of metabolites (metabolomics) is increasingly being applied to investigate microbial, plant, environmental and mammalian systems. One of the limiting factors is that of chemically identifying metabolites from mass spectrometric signals present in complex datasets. Results: Three workflows have been developed to allow for the rapid, automated and high-throughput annotation and putative metabolite identification of electrospray LC-MS-derived metabolomic datasets. The collection of workflows are defined as PUTMEDID_LCMS and perform feature annotation, matching of accurate m/z to the accurate mass of neutral molecules and associated molecular formula and matching of the molecular formulae to a reference file of metabolites. The software is independent of the instrument and data pre-processing applied. The number of false positives is reduced by eliminating the inaccurate matching of many artifact, isotope, multiply charged and complex adduct peaks through complex interrogation of experimental data. Availability: The workflows, standard operating procedure and further information are publicly available at http://www.mcisb.org/resources/putmedid.html.