Species-level microbial sequence classification is improved by source-environment information
Abstract
Popular naive Bayes taxonomic classifiers for amplicon sequences assume that all species in the reference database are equally likely to be observed. We demonstrate that classification accuracy degrades linearly with the degree to which that assumption is violated, and in practice it is always violated. By incorporating environment-specific taxonomic abundance information, we demonstrate that species-level resolution is attainable. Show more
Permanent link
https://doi.org/10.3929/ethz-b-000431207Publication status
publishedExternal links
Journal / series
bioRxivPages / Article No.
Publisher
Cold Spring Harbor LaboratoryOrganisational unit
09714 - Bokulich, Nicholas / Bokulich, Nicholas
Related publications and datasets
Is previous version of: https://doi.org/10.3929/ethz-b-000431166
More
Show all metadata
ETH Bibliography
no
Altmetrics