Journal: Genome Biology and Evolution
Loading...
Abbreviation
Genome Biol Evol
Publisher
Oxford University Press
27 results
Search Results
Publications 1 - 10 of 27
- The genome of the rayed Mediterranean limpet Patella caerulea (Linnaeus, 1758)Item type: Journal Article
Genome Biology and EvolutionHalstead-Nussloch, Gwyneth; Signorini, Silvia Giorgia; Giulio, Marco; et al. (2024)Patella caerulea (Linnaeus, 1758) is a mollusc limpet species of the class Gastropoda. Endemic to the Mediterranean Sea, it is considered a keystone species due to its primary role in structuring and regulating the ecological balance of tidal and subtidal habitats. It is currently being used as a bioindicator to assess the environmental quality of coastal marine waters and as a model species to understand adaptation to ocean acidification. Here, we provide a high-quality reference genome assembly and annotation for P. caerulea. We generated similar to 30 Gb of Pacific Biosciences high-fidelity data from a single individual and provide a final 749.8 Mb assembly containing 62 contigs, including the mitochondrial genome (14,938 bp). With an N50 of 48.8 Mb and 98% of the assembly contained in the 18 largest contigs, this assembly is near chromosome-scale. Benchmarking Universal Single-Copy Orthologs scores were high (Mollusca, 87.8% complete; Metazoa, 97.2% complete) and similar to metrics observed for other chromosome-level Patella genomes, highlighting a possible bias in the Mollusca database for Patellids. We generated transcriptomic Illumina data from a second individual collected at the same locality and used it together with protein evidence to annotate the genome. A total of 23,938 protein-coding gene models were found. By comparing this annotation with other published Patella annotations, we found that the distribution and median values of exon and gene lengths was comparable with other Patella species despite different annotation approaches. The present high-quality P. caerulea reference genome, available on GenBank (BioProject: PRJNA1045377; assembly: GCA_036850965.1), is an important resource for future ecological and evolutionary studies. - Impact of Transposable Elements on Methylation and Gene Expression across Natural Accessions of Brachypodium distachyonItem type: Journal Article
Genome Biology and EvolutionWyler, Michele; Stritt, Christoph; Walser, Jean-Claude; et al. (2020)Transposable elements (TEs) constitute a large fraction of plant genomes and are mostly present in a transcriptionally silent state through repressive epigenetic modifications, such as DNA methylation. TE silencing is believed to influence the regulation of adjacent genes, possibly as DNA methylation spreads away from the TE. Whether this is a general principle or a context-dependent phenomenon is still under debate, pressing for studying the relationship between TEs, DNA methylation, and nearby gene expression in additional plant species. Here, we used the grass Brachypodium distachyon as a model and produced DNA methylation and transcriptome profiles for 11 natural accessions. In contrast to what is observed in Arabidopsis thaliana, we found that TEs have a limited impact on methylation spreading and that only few TE families are associated with a low expression of their adjacent genes. Interestingly, we found that a subset of TE insertion polymorphisms is associated with differential gene expression across accessions. Thus, although not having a global impact on gene expression, distinct TE insertions may contribute to specific gene expression patterns in B. distachyon. - Genome-wide detection of genes under positive selection in worldwide populations of the barley scald pathogenItem type: Journal Article
Genome Biology and EvolutionMohd-Assaad, Norfarhan; McDonald, Bruce A.; Croll, Daniel (2018)Coevolution between hosts and pathogens generates strong selection pressures to maintain resistance and infectivity, respectively. Genomes of plant pathogens often encode major effect loci for the ability to successfully infect specific host genotypes. Hence, spatial heterogeneity in host genotypes coupled with abiotic factors could lead to locally adapted pathogen populations. However, the genetic basis of local adaptation is poorly understood. Rhynchosporium commune, the pathogen causing barley scald disease, interacts at least partially in a gene-for-gene manner with its host. We analyzed global field populations of 125 R. commune isolates to identify candidate genes for local adaptation. Whole genome sequencing data showed that the pathogen is subdivided into three genetic clusters associated with distinct geographic and climatic regions. Using haplotype-based selection scans applied independently to each genetic cluster, we found strong evidence for selective sweeps throughout the genome. Comparisons of loci under selection among clusters revealed little overlap, suggesting that ecological differences associated with each cluster led to variable selection regimes. The strongest signals of selection were found predominantly in the two clusters composed of isolates from Central Europe and Ethiopia. The strongest selective sweep regions encoded protein functions related to biotic and abiotic stress responses. Selective sweep regions were enriched in genes encoding functions in cellular localization, protein transport activity, and DNA damage responses. In contrast to the prevailing view that a small number of gene-for-gene interactions govern plant pathogen evolution, our analyses suggest that the evolutionary trajectory is largely determined by spatially heterogeneous biotic and abiotic selection pressures. - Endosperm and Seed Transcriptomes Reveal Possible Roles for Small RNA Pathways in Wild Tomato Hybrid Seed FailureItem type: Journal Article
Genome Biology and EvolutionFlorez-Rueda, Ana Marcela; Fiscalini, Flurin; Roth, Morgane; et al. (2021)Crosses between the wild tomato species Solanum peruvianum and Solanum chilense result in hybrid seed failure (HSF), characterized by endosperm misdevelopment and embryo arrest. We previously showed that genomic imprinting, the parent-of-origin dependent expression of alleles, is perturbed in the hybrid endosperm, with many of the normally paternally expressed genes losing their imprinted status. Here, we report transcriptome-based analyses of gene and small RNA (sRNA) expression levels. We identified 2,295 genes and 387 sRNA clusters as differentially expressed when comparing reciprocal hybrid seed to seeds and endosperms from the two within-species crosses. Our analyses uncovered a pattern of overdominance in endosperm gene expression in both hybrid cross directions, in marked contrast to the patterns of sRNA expression in whole seeds. Intriguingly, patterns of increased gene expression resemble the previously reported increased maternal expression proportions in hybrid endosperms. We identified physical clusters of sRNAs; differentially expressed sRNAs exhibit reduced transcript abundance in hybrid seeds of both cross directions. Moreover, sRNAs map to genes coding for key proteins involved in epigenetic regulation of gene expression, suggesting a regulatory feedback mechanism. We describe examples of genes that appear to be targets of sRNA-mediated gene silencing; in these cases, reduced sRNA abundance is concomitant with increased gene expression in hybrid seeds. Our analyses also show that S. peruvianum dominance impacts gene and sRNA expression in hybrid seeds. Overall, our study indicates roles for sRNA-mediated epigenetic regulation in HSF between closely related wild tomato species. - Mutation Rate and Effective Population Size of the Model Cooperative Bacterium Myxococcus xanthusItem type: Journal Article
Genome Biology and EvolutionWielgoss, Sebastien; Van Dyken, James David; Velicer, Gregory J. (2024)Intrinsic rates of genetic mutation have diverged greatly across taxa and exhibit statistical associations with several other parameters and features. These include effective population size (Nₑ), genome size, and gametic multicellularity, with the latter being associated with both increased mutation rates and decreased effective population sizes. However, data sufficient to test for possible relationships between microbial multicellularity and mutation rate (µ) are lacking. Here, we report estimates of two key population-genetic parameters, Nₑ and µ, for Myxococcus xanthus, a bacterial model organism for the study of aggregative multicellular development, predation, and social swarming. To estimate µ, we conducted an ∼400-day mutation accumulation experiment with 46 lineages subjected to regular single colony bottlenecks prior to clonal regrowth. Upon conclusion, we sequenced one clonal-isolate genome per lineage. Given collective evolution for 85,323 generations across all lines, we calculate a per base-pair mutation rate of ∼5.5 × 10⁻¹⁰ per site per generation, one of the highest mutation rates among free-living eubacteria. Given our estimate of µ, we derived Nₑ at ∼10⁷ from neutral diversity at four-fold degenerate sites across two dozen M. xanthus natural isolates. This estimate is below average for eubacteria and strengthens an already clear negative correlation between µ and Nₑ in prokaryotes. The higher and lower than average mutation rate and Nₑ for M. xanthus, respectively, amplify the question of whether any features of its multicellular life cycle—such as group-size reduction during fruiting-body development—or its highly structured spatial distribution have significantly influenced how these parameters have evolved. - Host-Associated Genomic Features of the Novel Uncultured Intracellular Pathogen Ca. Ichthyocystis Revealed by Direct Sequencing of EpitheliocystsItem type: Journal Article
Genome Biology and EvolutionQi, Weihong; Vaughan, Lloyd; Katharios, Pantelis; et al. (2016)Advances in single-cell and mini-metagenome sequencing have enabled important investigations into uncultured bacteria. In this study, we applied the mini-metagenome sequencing method to assemble genome drafts of the uncultured causative agents of epitheliocystis, an emerging infectious disease in the Mediterranean aquaculture species gilthead seabream. We sequenced multiple cyst samples and constructed 11 genome drafts from a novel beta-proteobacterial lineage, Candidatus Ichthyocystis. The draft genomes demonstrate features typical of pathogenic bacteria with an obligate intracellular lifestyle: a reduced genome of up to 2.6 Mb, reduced G + C content, and reduced metabolic capacity. Reconstruction of metabolic pathways reveals that Ca . Ichthyocystis genomes lack all amino acid synthesis pathways, compelling them to scavenge from the fish host. All genomes encode type II, III, and IV secretion systems, a large repertoire of predicted effectors, and a type IV pilus. These are all considered to be virulence factors, required for adherence, invasion, and host manipulation. However, no evidence of lipopolysaccharide synthesis could be found. Beyond the core functions shared within the genus, alignments showed distinction into different species, characterized by alternative large gene families. These comprise up to a third of each genome, appear to have arisen through duplication and diversification, encode many effector proteins, and are seemingly critical for virulence. Thus, Ca . Ichthyocystis represents a novel obligatory intracellular pathogenic beta-proteobacterial lineage. The methods used: mini-metagenome analysis and manual annotation, have generated important insights into the lifestyle and evolution of the novel, uncultured pathogens, elucidating many putative virulence factors including an unprecedented array of novel gene families. - Ca. Endozoicomonas cretensis: A Novel Fish Pathogen Characterized by Genome PlasticityItem type: Journal Article
Genome Biology and EvolutionQi, Weihong; Cascarano, Maria C.; Schlapbach, Ralph; et al. (2018)Endozoicomonas bacteria are generally beneficial symbionts of diverse marine invertebrates including reef-building corals, sponges, sea squirts, sea slugs, molluscs, and Bryozoans. In contrast, the recently reported Ca. Endozoicomonas cretensis was identified as a vertebrate pathogen, causing epitheliocystis in fish larvae resulting in massive mortality. Here, we described the Ca. E. cretensis draft genome, currently undergoing genome decay as evidenced by massive insertion sequence (IS element) expansion and pseudogene formation. Many of the insertion sequences are also predicted to carry outward-directed promoters, implying that they may be able to modulate the expression of neighbouring coding sequences (CDSs). Comparative genomic analysis has revealed many Ca. E. cretensis-specific CDSs, phage integration and novel gene families. Potential virulence related CDSs and machineries were identified in the genome, including secretion systems and related effector proteins, and systems related to biofilm formation and directed cell movement. Mucin degradation would be of importance to a fish pathogen, and many candidate CDSs associated with this pathway have been identified. The genome may reflect a bacterium in the process of changing niche from symbiont to pathogen, through expansion of virulence genes and some loss of metabolic capacity. - Evolution and Diversity of the Wild Rice Oryza officinalis Complex, across Continents, Genome Types, and Ploidy LevelsItem type: Journal Article
Genome Biology and EvolutionShenton, Matt; Kobayashi, Masaaki; Terashima, Shin; et al. (2020)The Oryza officinalis complex is the largest species group in Oryza, with more than nine species from four continents, and is a tertiary gene pool that can be exploited in breeding programs for the improvement of cultivated rice. Most diploid and tetraploid members of this group have a C genome. Using a new reference C genome for the diploid species O. officinalis, and draft genomes for two other C genome diploid species Oryza eichingeri and Oryza rhizomatis, we examine the influence of transposable elements on genome structure and provide a detailed phylogeny and evolutionary history of the Oryza C genomes. The O. officinalis genome is 1.6 times larger than the A genome of cultivated Oryza sativa, mostly due to proliferation of Gypsy type long-terminal repeat transposable elements, but overall syntenic relationships are maintained with other Oryza genomes (A, B, and F). Draft genome assemblies of the two other C genome diploid species, Oryza eichingeri and Oryza rhizomatis, and short-read resequencing of a series of other C genome species and accessions reveal that after the divergence of the C genome progenitor, there was still a substantial degree of variation within the C genome species through proliferation and loss of both DNA and long-terminal repeat transposable elements. We provide a detailed phylogeny and evolutionary history of the Oryza C genomes and a genomic resource for the exploitation of the Oryza tertiary gene pool. - Ultralong Oxford Nanopore reads enable the development of a reference-grade perennial ryegrass genome assemblyItem type: Journal Article
Genome Biology and EvolutionFrei, Daniel; Veekman, Elisabeth; Grogg, Daniel; et al. (2021)Despite the progress made in DNA sequencing over the last decade, reconstructing telomere-to-telomere genome assemblies of large and repeat-rich eukaryotic genomes is still difficult. More accurate basecalls or longer reads could address this issue, but no current sequencing platform can provide both simultaneously. Perennial ryegrass (Lolium perenne L.) is an example of an important species for which the lack of a reference genome assembly hindered a swift adoption of genomics-based methods into breeding programs. To fill this gap, we optimized the Oxford Nanopore Technologies sequencing protocol, obtaining sequencing reads with a N50 of 62 kb—an unprecedented value for a plant sample. The assembly of such reads produced a highly complete (2.3 of 2.7 Gb), correct (QV 45), and contiguous (contig N50 and N90 11.74 and 3.34 Mb, respectively) genome assembly. We show how read length was key in determining the assembly contiguity. Sequence annotation revealed the dominance of transposable elements and repeated sequences (81.6% of the assembly) and identified 38,868 protein coding genes. Almost 90% of the bases could be anchored to seven pseudomolecules, providing the first high quality haploid reference assembly for perennial ryegrass. This protocol will enable producing longer Oxford Nanopore Technology reads for more plant samples and ushering forage grasses into modern genomics-assisted breeding programs. - Estimates of Positive Darwinian Selection Are Inflated by Errors in Sequencing, Annotation, and AlignmentItem type: Journal Article
Genome Biology and EvolutionSchneider, Adrian; Souvorov, Alexander; Sabath, Niv; et al. (2009)Published estimates of the proportion of positively selected genes (PSGs) in human vary over three orders of magnitude. In mammals, estimates of the proportion of PSGs cover an even wider range of values. We used 2,980 orthologous protein-coding genes from human, chimpanzee, macaque, dog, cow, rat, and mouse as well as an established phylogenetic topology to infer the fraction of PSGs in all seven terminal branches. The inferred fraction of PSGs ranged from 0.9% in human through 17.5% in macaque to 23.3% in dog. We found three factors that influence the fraction of genes that exhibit telltale signs of positive selection: the quality of the sequence, the degree of misannotation, and ambiguities in the multiple sequence alignment. The inferred fraction of PSGs in sequences that are deficient in all three criteria of coverage, annotation, and alignment is 7.2 times higher than that in genes with high trace sequencing coverage, “known” annotation status, and perfect alignment scores. We conclude that some estimates on the prevalence of positive Darwinian selection in the literature may be inflated and should be treated with caution.
Publications 1 - 10 of 27