Novor.ai – Increased precision and accuracy utilizing an AI model for de novo sequencing
Posters | 2025 | Bruker | ASMSInstrumentation
Peptide de novo sequencing enables the determination of peptide sequences without relying on existing protein databases, which is essential for immunopeptidomics, metaproteomics and other applications lacking enzyme specificity. High-resolution instruments like timsTOF generate complex spectra that demand accurate and fast de novo algorithms to uncover novel epitopes and biomarkers.
This work introduces Novor.ai, an AI-driven enhancement of the established Novor de novo sequencing algorithm. The primary objective was to assess its precision and recall on previously unseen MHC class I and II datasets, comparing performance against the existing BPS Novor algorithm optimized for timsTOF data.
Novor.ai achieved an average 19% improvement in correct amino acid identifications and a 16% peptide‐level gain on the MHC-I dataset compared to BPS Novor. For MHC-II data, peptide assignment accuracy increased by 26%. Precision–recall curves and percentage‐correct metrics highlight these performance gains. Although Novor.ai operates more slowly than BPS Novor, the marked accuracy improvements justify further optimization. Direct comparisons with other AI de novo tools remain to be conducted.
Further refinement of model parameters and architecture is expected to boost processing speed and accuracy. Expanding benchmarks to include other AI de novo sequencing tools and diverse instrument platforms will validate broader applicability. Implementation of transfer learning for non‐timsTOF datasets and integration into clinical proteomics pipelines represent promising future directions.
Novor.ai significantly outperforms the existing BPS Novor algorithm in both precision and accuracy for timsTOF immunopeptidomic data. Continued optimization and comprehensive benchmarking against alternative AI models will establish its role in next‐generation de novo sequencing.
Software
IndustriesProteomics
ManufacturerBruker
Summary
Significance of the Topic
Peptide de novo sequencing enables the determination of peptide sequences without relying on existing protein databases, which is essential for immunopeptidomics, metaproteomics and other applications lacking enzyme specificity. High-resolution instruments like timsTOF generate complex spectra that demand accurate and fast de novo algorithms to uncover novel epitopes and biomarkers.
Goals and Study Overview
This work introduces Novor.ai, an AI-driven enhancement of the established Novor de novo sequencing algorithm. The primary objective was to assess its precision and recall on previously unseen MHC class I and II datasets, comparing performance against the existing BPS Novor algorithm optimized for timsTOF data.
Methodology and Used Instrumentation
- AI Model Design: Decision‐tree scoring functions were replaced by AI‐derived scoring functions, while final sequence assembly was performed by a dynamic programming algorithm that enforces precursor mass error tolerances.
- Training Data: More than 7.3 million MS2 spectra were used, sourced from NIST and MassIVE‐KB spectral libraries, the SysteMHC Atlas and a custom timsTOF MHC peptide library.
- Validation Data: MHC class I and II samples from Hoenisch‐Gravel et al. were run in triplicate; ground truth assignments were obtained via the TIMS2Rescore workflow in Bruker ProteoScape coupled with ProLuCID at 1% FDR.
- Instrumentation: Bruker timsTOF mass spectrometer with trapped ion mobility spectrometry, data processed through Bruker ProteoScape and TIMS2Rescore.
Main Results and Discussion
Novor.ai achieved an average 19% improvement in correct amino acid identifications and a 16% peptide‐level gain on the MHC-I dataset compared to BPS Novor. For MHC-II data, peptide assignment accuracy increased by 26%. Precision–recall curves and percentage‐correct metrics highlight these performance gains. Although Novor.ai operates more slowly than BPS Novor, the marked accuracy improvements justify further optimization. Direct comparisons with other AI de novo tools remain to be conducted.
Benefits and Practical Applications
- Improved confidence in immunopeptidomic analyses, enhancing tumor antigen discovery and epitope mapping workflows.
- Database‐independent sequencing suitable for metaproteomics and non‐specific enzymatic digestions.
- Valuable for laboratories using high‐resolution timsTOF platforms seeking robust de novo sequencing solutions.
Future Trends and Potential Applications
Further refinement of model parameters and architecture is expected to boost processing speed and accuracy. Expanding benchmarks to include other AI de novo sequencing tools and diverse instrument platforms will validate broader applicability. Implementation of transfer learning for non‐timsTOF datasets and integration into clinical proteomics pipelines represent promising future directions.
Conclusion
Novor.ai significantly outperforms the existing BPS Novor algorithm in both precision and accuracy for timsTOF immunopeptidomic data. Continued optimization and comprehensive benchmarking against alternative AI models will establish its role in next‐generation de novo sequencing.
Reference
- Hoenisch‐Gravel N. et al. TOFIMS mass spectrometry‐based immunopeptidomics refines tumor antigen identification. Nat Commun 14, 7472 (2023).
Content was automatically generated from an orignal PDF document using AI and may contain inaccuracies.
Similar PDF
A CCS-centric HLA-specific trained de novo module for precise and accurate real-time immunopeptide identification on the Bruker ProteoScape platform
2024|Bruker|Posters
US HUPO 2024, ABSTRACT 651 A CCS-centric HLA-specific trained de novo module for precise and accurate real-time immunopeptide identification on the Bruker ProteoScape platform Rui Zhang1; Qixin Liu1; Mingjie Xie1; Dennis Trede2; Tharan Srikumar3; Jonathan Krieger3; Bin Ma1; George Rosenberger4…
Key words
novor, novormhc, mhcbps, bpsrecall, recallcorrect, correctproteoscape, proteoscapenovo, novotimstof, timstofmodel, modelimmunopeptidomics, immunopeptidomicsscoring, scoringpeptide, peptidesequencing, sequencingprecision, precisionbruker
Precise and accurate real-time de novo sequencing of timsTOF data with the Novor algorithm on the Bruker ProteoScape™ platform
2023|Bruker|Posters
ASMS 2023 Precise and accurate real-time de novo sequencing of timsTOF data with the Novor algorithm on the Bruker ProteoScape™ platform The results suggest even greater performance improvements of BPS Novor for challenging immunopeptidomic applications, where typically a lower fraction…
Key words
novor, novorbps, bpsnovo, novosequencing, sequencingprolucid, prolucidreal, realrecall, recalltraining, trainingtimstof, timstofdataset, datasetpeptide, peptidecorrect, correctpre, preproteoscape, proteoscapegpu
Real-time de novo sequencing of peptide antigens using Bruker ProteoScape™ for 'Run & Done' 4D-immunopeptidomics
2023|Bruker|Posters
ASMS 2023 - THP 378 Real-time de novo sequencing of peptide antigens using Bruker ProteoScape™ for 'Run & Done' 4D-immunopeptidomics A 5000 4500 4000 3500 3000 2500 2000 1500 1000 500 0 1Monash University, Clayton, Australia, 2Rapid Novor Inc., Kitchener,…
Key words
novor, novorbps, bpsproteoscape, proteoscapenovo, novochymo, chymoelastase, elastasehye, hyealgorithm, algorithmimmunopeptidomics, immunopeptidomicsgluc, glucbruker, brukerreal, realcorrect, correctpepsin, pepsinrecall
TIMSrescore: timsTOF-optimized PSM rescoring boosts identification rates for immunopeptidomics
2024|Bruker|Posters
US HUPO 2024, ABSTRACT 641 TIMSrescore: timsTOF-optimized PSM rescoring boosts identification rates for immunopeptidomics Dennis Trede1, Arthur Declercqi2, Ralf Gabriels2, Robbin Bouwmeester2, Jonathan Krieger3, Tharan Srikumar3, Sven Degroeve2 and Lennart Martens2 1Bruker Daltonik GmbH, Fahrenheitstraße 4, 28359 Bremen, Germany 2VIB-UGent…
Key words
timsrescore, timsrescoremhc, mhcengine, enginetimstof, timstofelastase, elastaserescoring, rescoringsearch, searchpsms, psmsrcc, rccramp, rampimmunopeptidomics, immunopeptidomicspeptides, peptidespeptide, peptidereprocessing, reprocessingramps