
Open access
Date
2021-07-07Type
- Working Paper
ETH Bibliography
yes
Altmetrics
Abstract
Structural elucidation of small molecules de novo from mass spectra is a longstanding, yet unsolved problem. Current methods rely on finding some similarity with spectra of known compounds deposited in spectral libraries, but do not solve the problem of predicting structures for novel or poorly represented compound classes. We present MSNovelist that combines fingerprint prediction with an encoder-decoder neural network to generate structures de novo from fragment spectra. In evaluation, MSNovelist correctly reproduced 61% of database annotations for a GNPS reference dataset. In a bryophyte MS2 dataset, our de novo structure prediction substantially outscored the best database candidate for seven features, and a potential novel natural product with a flavonoid core was identified. MSNovelist allows predicting structures solely from MS2 data, and is therefore ideally suited to complement library-based annotation in the case of poorly represented analyte classes and novel compounds. Show more
Permanent link
https://doi.org/10.3929/ethz-b-000525489Publication status
publishedExternal links
Journal / series
bioRxivPublisher
Cold Spring Harbor LaboratoryOrganisational unit
08839 - Zamboni, Nicola (Tit.-Prof.)
More
Show all metadata
ETH Bibliography
yes
Altmetrics