Interactive Design and Application of MassQL Queries after Preprocessing for the Annotation of PFAS in LC-TIMS-PASEF data
Posters | 2023 | Bruker | ASMSInstrumentation
Per- and polyfluoroalkyl substances (PFAS) represent a class of persistent environmental contaminants with growing regulatory and public health concerns. Rapid, reliable annotation of PFAS in complex samples such as drinking water is critical for monitoring exposure and guiding remediation efforts. Integrating a flexible query language with advanced data processing greatly enhances the capacity to discover both known and novel PFAS candidates in high-dimensional mass spectrometry datasets.
This work describes an interactive workflow to design, validate and apply Mass Spectrometry Query Language (MassQL) queries on preprocessed LC-TIMS-PASEF data within the MetaboScape 2023b environment. The main goals are:
Samples were analyzed on a Bruker timsTOF Pro equipped with a VIP-HESI source and coupled to an Elute UHPLC system using an Intensity Solo C18 column with a water/methanol gradient containing 5 mM ammonium acetate. Data were acquired in negative ion mode using PASEF, covering m/z 30–1000 and ion mobility 0.5–1.3 V/cm².
Raw data processing employed the T-ReX 4D algorithm in MetaboScape 2023b, generating a feature table. Known PFAS were initially dereplicated via internal target lists and spectral libraries, yielding 975 features. Interactive filtering and manual inspection guided the derivation of fragment and neutral loss rules, which were then expressed as a MassQL query.
Key software tools:
Applying the custom MassQL query to negative-mode features filtered the table down to seven candidates, two of which were previously unreported PFAS analogs. In silico fragmentation of a target compound (1-hydro-pentadecafluoroheptane) showed fragment ions corresponding to C2F5, C3F7 and C4F9 losses within 5 mDa tolerance. Measured CCS for this feature (155.7 Ų) agreed within 1.7% of the predicted value (153.1 Ų), supporting its structural assignment.
This interactive workflow demonstrates a closed-loop process: (1) explore features, (2) derive rules, (3) formalize MassQL query, (4) filter features, (5) validate candidate structures. It accelerates the discovery of both known and novel PFAS in complex matrices.
Emerging directions include integration of AI-driven spectral interpretation, automated query optimization, expansion to multi-omic data types, and development of public MassQL repositories. Enhanced collaboration among software platforms and community-curated query libraries will further streamline non-targeted contaminant screening workflows.
The presented workflow leverages MetaboScape’s interactive environment and MassQL to enable dynamic design and validation of queries on processed LC-TIMS-PASEF data. Combining feature exploration, in silico tools and predictive CCS accelerates PFAS annotation and the discovery of novel analogs, offering a powerful strategy for environmental monitoring.
Ion Mobility, LC/HRMS, LC/MS, LC/MS/MS, LC/TOF
IndustriesEnvironmental
ManufacturerBruker
Summary
Importance of the Topic
Per- and polyfluoroalkyl substances (PFAS) represent a class of persistent environmental contaminants with growing regulatory and public health concerns. Rapid, reliable annotation of PFAS in complex samples such as drinking water is critical for monitoring exposure and guiding remediation efforts. Integrating a flexible query language with advanced data processing greatly enhances the capacity to discover both known and novel PFAS candidates in high-dimensional mass spectrometry datasets.
Study Objectives and Overview
This work describes an interactive workflow to design, validate and apply Mass Spectrometry Query Language (MassQL) queries on preprocessed LC-TIMS-PASEF data within the MetaboScape 2023b environment. The main goals are:
- To extract characteristic fragmentation and mobility patterns for PFAS compounds.
- To formalize these patterns as MassQL queries.
- To screen both known and potential new PFAS in drinking water samples.
Methodology and Instrumentation
Samples were analyzed on a Bruker timsTOF Pro equipped with a VIP-HESI source and coupled to an Elute UHPLC system using an Intensity Solo C18 column with a water/methanol gradient containing 5 mM ammonium acetate. Data were acquired in negative ion mode using PASEF, covering m/z 30–1000 and ion mobility 0.5–1.3 V/cm².
Raw data processing employed the T-ReX 4D algorithm in MetaboScape 2023b, generating a feature table. Known PFAS were initially dereplicated via internal target lists and spectral libraries, yielding 975 features. Interactive filtering and manual inspection guided the derivation of fragment and neutral loss rules, which were then expressed as a MassQL query.
Key software tools:
- MassQL: domain-specific language for MS pattern queries.
- MetFrag In Silico Fragmentation: for matching experimental MS/MS spectra.
- CCS-Predict Pro: for collision cross-section prediction.
- SmartFormula with adapted Seven Golden Rules: for molecular formula generation.
- Compound Crawler accessing ChemSpider, PubChem, ChEBI.
Main Results and Discussion
Applying the custom MassQL query to negative-mode features filtered the table down to seven candidates, two of which were previously unreported PFAS analogs. In silico fragmentation of a target compound (1-hydro-pentadecafluoroheptane) showed fragment ions corresponding to C2F5, C3F7 and C4F9 losses within 5 mDa tolerance. Measured CCS for this feature (155.7 Ų) agreed within 1.7% of the predicted value (153.1 Ų), supporting its structural assignment.
This interactive workflow demonstrates a closed-loop process: (1) explore features, (2) derive rules, (3) formalize MassQL query, (4) filter features, (5) validate candidate structures. It accelerates the discovery of both known and novel PFAS in complex matrices.
Benefits and Practical Applications
- Interactive rule derivation reduces trial-and-error in query design.
- Integration of MS/MS fragmentation and CCS prediction enhances annotation confidence.
- Shareable MassQL queries promote reproducibility and community adoption.
- Applicable to routine PFAS screening in environmental and regulatory labs.
Future Trends and Perspectives
Emerging directions include integration of AI-driven spectral interpretation, automated query optimization, expansion to multi-omic data types, and development of public MassQL repositories. Enhanced collaboration among software platforms and community-curated query libraries will further streamline non-targeted contaminant screening workflows.
Conclusion
The presented workflow leverages MetaboScape’s interactive environment and MassQL to enable dynamic design and validation of queries on processed LC-TIMS-PASEF data. Combining feature exploration, in silico tools and predictive CCS accelerates PFAS annotation and the discovery of novel analogs, offering a powerful strategy for environmental monitoring.
Instrumentation
- Bruker timsTOF Pro with VIP-HESI
- Elute UHPLC system
- Intensity Solo C18 column
- MetaboScape 2023b with T-ReX 4D preprocessing
- MassQL query engine
- MetFrag In Silico Fragmentation
- CCS-Predict Pro
- SmartFormula with Seven Golden Rules
- Compound Crawler accessing ChemSpider, PubChem, ChEBI
References
- Jarmusch AK, et al. A Universal Language for Finding Mass Spectrometry Data Patterns. 2022.
- Ruttkies C, et al. MetFrag relaunch: incorporating strategies beyond in silico fragmentation. Journal of Cheminformatics. 2016;8:1–16.
- Kind T, Fiehn O. Seven Golden Rules for heuristic filtering of molecular formulas obtained by accurate mass spectrometry. BMC Bioinformatics. 2007;8:1–20.
- ChemSpider. www.chemspider.com.
- PubChem. pubchem.ncbi.nlm.nih.gov.
- ChEBI. www.ebi.ac.uk/chebi/.
Content was automatically generated from an orignal PDF document using AI and may contain inaccuracies.
Similar PDF
Confident annotation of GPNAE and HexGPNAE in Caenorhabditis elegans aided by homologous series extension in LC-MS/MS data
2025|Bruker|Posters
ASMS 2025 TP 474 Confident annotation of GPNAE and HexGPNAE in Caenorhabditis elegans aided by homologous series extension in LC-MS/MS data Nikolas Kessler1; Aiko Barsch1; Erica M. Forsberg1; Matthew R. Lewis1; Michael Witting2 Methods Mixed stage C. elegans reference samples…
Key words
gpnae, gpnaehomologous, homologoushexgpnae, hexgpnaeacylethanolamides, acylethanolamidesmassql, massqlseries, serieslipid, lipidglycerophospho, glycerophosphohexosylglycerophospho, hexosylglycerophosphocaenorhabditis, caenorhabditiselegans, elegansqueries, queriesannotated, annotatedannotation, annotationnacylethanolamides
Investigating the increased lifespan in C. elegans daf-2 mutants by 4D-Lipidomics
2019|Bruker|Applications
Investigating the increased lifespan in C. elegans daf-2 mutants by 4D-Lipidomics The small nematode Caenorhabditis elegans is one of the premier biomedical model organisms and employed in many aspects of basic and applied science Introduction Typical application areas for C.…
Key words
ccs, ccslipid, lipidcharacteristic, characteristicnegative, negativevalues, valueslipids, lipidspasef, pasefmetaboscape, metaboscapelipidblast, lipidblasttimstof, timstofccspredict, ccspredictpositive, positivespectra, spectrawild, wildassignment
4D-Lipidomics™ workflow for increased throughput
2021|Bruker|Applications
4D-Lipidomics™ workflow for increased throughput Lipid profiling from complex lipid extracts can be a challenging and time consuming task. The high complexity of samples and co-elution of isobaric or isomeric compounds complicate the confident annotation of lipids. The presented 4D-Lipidomics…
Key words
ccs, ccsannotation, annotationlipid, lipidannotations, annotationslipids, lipidstimstof, timstofaware, awarebased, basedvalues, valuescoverage, coveragedeep, deepcan, canmobility, mobilityrule, ruleannotated
CCS-aware SpatialOMx® enables highly confident and automatic lipid annotations with regiospecific context
2021|Bruker|Applications
Intensity (arb. unit) 132% 0% 100% PS 38:1 m/z 816.5747 ± 0.0122 1/K0 1.433 ± 0.01 Intensity (arb. unit) Mobility [1/K0] 136% 0% 100% PE 20:1_22:6 m/z 816.5517 ± 0.0122 1/K0 1.404 ± 0.01 m/z 1 mm CCS-aware SpatialOMx® enables…
Key words
ccs, ccsmaldi, maldispatialomx, spatialomximaging, imagingaware, awareannotation, annotationmobility, mobilitytims, timsannotated, annotatedannotations, annotationspasef, pasefbucket, bucketlipid, lipidsegmentation, segmentationlipids