Deep learning methods applied tothe analysis of metabolomics data
Posters | 2019 | ShimadzuInstrumentation
Chromatographic peak integration is a critical step in metabolomic data analysis, impacting quantitative accuracy and downstream biological interpretation. Manual integration is time-consuming and subject to operator variability, creating a bottleneck in high-throughput studies.
This study introduces a parameter-free deep learning approach based on the Single Shot MultiBox Detector architecture to automate peak integration in LC-MS/MS metabolomics. The primary goals were to establish a unified integration standard and exceed 90% accuracy in peak detection and area calculation.
The integration problem was reformulated as an object detection task, transforming chromatograms into multi-channel inputs including intensity and its derivatives. The SSD model outputs confidence scores and boundary offsets for peak start and end points. A data augmentation pipeline generated synthetic chromatograms by merging peaks into blank runs, expanding the training set from 11,011 to over 73,000 labeled examples. Evaluation metrics included precision, recall, F-measure, and a novel correction time metric that quantifies manual adjustment effort based on false positives, false negatives, and baseline corrections.
The deep learning method achieved a precision of 0.883, recall of 0.938, and F-measure of 0.910, outperforming optimized i-PeakFinder and Chromatopac software. The estimated correction time was reduced by 43% compared to i-PeakFinder and by 65% compared to Chromatopac. Qualitative comparisons showed high concordance with expert operator integrations across diverse metabolite panels.
The automated, parameter-free workflow enhances throughput, minimizes operator bias, and standardizes peak integration across varied sample matrices. It supports high-volume metabolomics studies in food analysis, clinical research, and industrial quality control.
Future developments may include adaptation to other chromatographic platforms, integration with automated metabolite annotation pipelines, augmentation with larger and more diverse datasets, and deployment in real-time data processing environments.
Parameter-free deep learning delivers reliable peak integration with over 90% accuracy and substantially reduces manual intervention, offering a scalable solution for high-throughput metabolomics workflows.
Software, LC/MS, LC/MS/MS
IndustriesMetabolomics
ManufacturerShimadzu
Summary
Importance of the Topic
Chromatographic peak integration is a critical step in metabolomic data analysis, impacting quantitative accuracy and downstream biological interpretation. Manual integration is time-consuming and subject to operator variability, creating a bottleneck in high-throughput studies.
Objectives and Overview of the Study
This study introduces a parameter-free deep learning approach based on the Single Shot MultiBox Detector architecture to automate peak integration in LC-MS/MS metabolomics. The primary goals were to establish a unified integration standard and exceed 90% accuracy in peak detection and area calculation.
Methodology and Instrumentation
The integration problem was reformulated as an object detection task, transforming chromatograms into multi-channel inputs including intensity and its derivatives. The SSD model outputs confidence scores and boundary offsets for peak start and end points. A data augmentation pipeline generated synthetic chromatograms by merging peaks into blank runs, expanding the training set from 11,011 to over 73,000 labeled examples. Evaluation metrics included precision, recall, F-measure, and a novel correction time metric that quantifies manual adjustment effort based on false positives, false negatives, and baseline corrections.
Instrumentation
- High-performance liquid chromatography–tandem mass spectrometry system (Shimadzu Corporation)
- Computational framework for deep learning implementation
Main Results and Discussion
The deep learning method achieved a precision of 0.883, recall of 0.938, and F-measure of 0.910, outperforming optimized i-PeakFinder and Chromatopac software. The estimated correction time was reduced by 43% compared to i-PeakFinder and by 65% compared to Chromatopac. Qualitative comparisons showed high concordance with expert operator integrations across diverse metabolite panels.
Benefits and Practical Applications of the Method
The automated, parameter-free workflow enhances throughput, minimizes operator bias, and standardizes peak integration across varied sample matrices. It supports high-volume metabolomics studies in food analysis, clinical research, and industrial quality control.
Future Trends and Opportunities
Future developments may include adaptation to other chromatographic platforms, integration with automated metabolite annotation pipelines, augmentation with larger and more diverse datasets, and deployment in real-time data processing environments.
Conclusion
Parameter-free deep learning delivers reliable peak integration with over 90% accuracy and substantially reduces manual intervention, offering a scalable solution for high-throughput metabolomics workflows.
References
- Shimadzu Corporation. LabSolutions Data Integrity. Available from Shimadzu website.
- Shimadzu Corporation. Chromatopac User Manual.
- Liu W, Anguelov D, Erhan D, et al. SSD: Single Shot MultiBox Detector. arXiv preprint arXiv:1512.02325; 2015.
Content was automatically generated from an orignal PDF document using AI and may contain inaccuracies.
Similar PDF
Validating and Comparing Component Detection Algorithms for LC-MS Data Assignment
2015|Thermo Fisher Scientific|Posters
who provided the sample and performed the mass spectrometry experiment. wher for nu of Tru There are three types of samples that we can consider for annotation: a) a simple mixture of one/several pure analytes b) a complex mixture of…
Key words
annotations, annotationsexhaustive, exhaustivetrue, truetotalrecall, totalrecallalgorithms, algorithmsrecall, recallbuspirone, buspironedatasets, datasetsannotation, annotationfeature, featuretargeted, targetedalgorithma, algorithmaalgorithmb, algorithmbannotated, annotatedmanual
Efficient Peak Integration for Metabolomics data Using Peakintelligence
2022|Shimadzu|Technical notes
High Performance Liquid Chromatograph Mass Spectrometer Efficient Peak Integration for Metabolomics data Using PeakintelligenceTM Application News Takanari Hattori, Miho Kawashima, Hiroyuki Yasuda, Junko Iida User Benefits The peak processing software Peakintelligence reduces the time and effort required for peak…
Key words
peakintelligence, peakintelligenceintegration, integrationtaught, taughtprevious, previouspeak, peakincorrectly, incorrectlyalgorithms, algorithmslearned, learnedmetabolomics, metabolomicscomputer, computeralgorithm, algorithmparameter, parameterdata, datamachine, machineshorten
Efficient Peak Integration for Metabolomics Data Using Peakintelligence -Application to Single Quadrupole LC-MS data -
2022|Shimadzu|Applications
High Performance Liquid Chromatograph Mass Spectrometer Efficient Peak Integration for Metabolomics Data Using PeakintelligenceTM - Application to Single Quadrupole LC-MS data - Application News Takanari Hattori, Miho Kawashima, Hiroyuki Yasuda, Junko Iida User Benefits The peak processing software Peakintelligence…
Key words
peakintelligence, peakintelligenceintegration, integrationpeak, peaktaught, taughtprevious, previousalgorithms, algorithmslearned, learnedmetabolomics, metabolomicscomputer, computerdata, dataalgorithm, algorithmincorrectly, incorrectlymachine, machinesetting, settingmanual
Efficient Peak Integration for Analyzing Residual Pesticides in Foods Using Peakintelligence™ LC/MS Peak Processing Software
2022|Shimadzu|Applications
LC/MS Application News Efficient Peak Integration for Analyzing Residual Pesticides in Foods Using Peakintelligence™ LC/MS Peak Processing Software Miho Kawashima User Benefits Reduces the time and trouble required for peak integration. With no need to specify parameter settings,…
Key words
peakintelligence, peakintelligenceintegration, integrationpeak, peaktaught, taughtprevious, previouspeaks, peaksincorrectly, incorrectlymisdetected, misdetectedpesticides, pesticidesmachine, machinelearning, learningresidual, residualalgorithms, algorithmslearned, learnedcomputer