proGenomes3: approaching one million accurately and consistently annotated high-quality prokaryotic genomes
Abstract
The interpretation of genomic, transcriptomic and other microbial 'omics data is highly dependent on the availability of well-annotated genomes. As the number of publicly available microbial genomes continues to increase exponentially, the need for quality control and consistent annotation is becoming critical. We present proGenomes3, a database of 907 388 high-quality genomes containing 4 billion genes that passed stringent criteria and have been consistently annotated using multiple functional and taxonomic databases including mobile genetic elements and biosynthetic gene clusters. proGenomes3 encompasses 41 171 species-level clusters, defined based on universal single copy marker genes, for which pan-genomes and contextual habitat annotations are provided. Show more
Permanent link
https://doi.org/10.3929/ethz-b-000590583Publication status
publishedExternal links
Journal / series
Nucleic Acids ResearchVolume
Pages / Article No.
Publisher
Oxford University PressOrganisational unit
09583 - Sunagawa, Shinichi / Sunagawa, Shinichi
Funding
184955 - Resolving ocean microbial micro-diversity and its environmental drivers at global scale (SNF)
More
Show all metadata