Journal: PLoS Computational Biology
Loading...
Abbreviation
PLOS comput. biol.
Publisher
PLOS
240 results
Search Results
Publications 1 - 10 of 240
- TreeKnit: Inferring ancestral reassortment graphs of influenza virusesItem type: Journal Article
PLoS Computational BiologyBarrat-Charlaix, Pierre; Vaughan, Timothy G.; Neher, Richard A. (2022)When two influenza viruses co-infect the same cell, they can exchange genome segments in a process known as reassortment. Reassortment is an important source of genetic diversity and is known to have been involved in the emergence of most pandemic influenza strains. However, because of the difficulty in identifying reassortment events from viral sequence data, little is known about their role in the evolution of the seasonal influenza viruses. Here we introduce TreeKnit, a method that infers ancestral reassortment graphs (ARG) from two segment trees. It is based on topological differences between trees, and proceeds in a greedy fashion by finding regions that are compatible in the two trees. Using simulated genealogies with reassortments, we show that TreeKnit performs well in a wide range of settings and that it is as accurate as a more principled bayesian method, while being orders of magnitude faster. Finally, we show that it is possible to use the inferred ARG to better resolve segment trees and to construct more informative visualizations of reassortments. - Nutritional Systems Biology Modeling: From Molecular Mechanisms to PhysiologyItem type: Journal Article
PLoS Computational BiologyGraaf, Albert A. de; Freidig, Andreas P.; De Roos, Baukje; et al. (2009)The use of computational modeling and simulation has increased in many biological fields, but despite their potential these techniques are only marginally applied in nutritional sciences. Nevertheless, recent applications of modeling have been instrumental in answering important nutritional questions from the cellular up to the physiological levels. Capturing the complexity of today's important nutritional research questions poses a challenge for modeling to become truly integrative in the consideration and interpretation of experimental data at widely differing scales of space and time. In this review, we discuss a selection of available modeling approaches and applications relevant for nutrition. We then put these models into perspective by categorizing them according to their space and time domain. Through this categorization process, we identified a dearth of models that consider processes occurring between the microscopic and macroscopic scale. We propose a “middle-out” strategy to develop the required full-scale, multilevel computational models. Exhaustive and accurate phenotyping, the use of the virtual patient concept, and the development of biomarkers from “-omics” signatures are identified as key elements of a successful systems biology modeling approach in nutrition research—one that integrates physiological mechanisms and data at multiple space and time scales. - Precise determination of input-output mapping for multimodal gene circuits using data from transient transfectionItem type: Journal Article
PLoS Computational BiologyStelzer, Christoph; Benenson, Yaakov (2020)The mapping of molecular inputs to their molecular outputs (input/output, I/O mapping) is an important characteristic of gene circuits, both natural and synthetic. Experimental determination of such mappings for synthetic circuits is best performed using stably integrated genetic constructs. In mammalian cells, stable integration of complex circuits is a time-consuming process that hampers rapid characterization of multiple circuit variants. On the other hand, transient transfection is quick. However, it is an extremely noisy process and it is unclear whether the obtained data have any relevance to the input/output mapping of a circuit obtained in the case of a stable integration. Here we describe a data processing workflow, Peakfinder algorithm for flow cytometry data (PFAFF), that allows extracting precise input/output mapping from single-cell protein expression data gathered by flow cytometry after a transient transfection. The workflow builds on the numerically-proven observation that the multivariate modes of input and output expression of multi-channel flow cytometry datasets, pre-binned by the expression level of an independent transfection reporter gene, harbor cells with circuit gene copy numbers distributions that depend deterministically on the properties of a bin. We validate our method by simulating flow cytometry data for seven multi-node circuit architectures, including a complex bi-modal circuit, under stable integration and transient transfection scenarios. The workflow applied to the simulated transient transfection data results in similar conclusions to those reached with simulated stable integration data. This indicates that the input/output mapping derived from transient transfection data using our method is an excellent approximation of the ground truth. Thus, the method allows to determine input/output mapping of complex gene network using noisy transient transfection data. - Mixture-of-Experts Variational Autoencoder for clustering and generating from similarity-based representations on single cell dataItem type: Journal Article
PLoS Computational BiologyKopf, Andreas; Fortuin, Vincent; Somnath, Vignesh Ram; et al. (2021)Clustering high-dimensional data, such as images or biological measurements, is a long-standing problem and has been studied extensively. Recently, Deep Clustering has gained popularity due to its flexibility in fitting the specific peculiarities of complex data. Here we introduce the Mixture-of-Experts Similarity Variational Autoencoder (MoE-Sim-VAE), a novel generative clustering model. The model can learn multi-modal distributions of high-dimensional data and use these to generate realistic data with high efficacy and efficiency. MoE-Sim-VAE is based on a Variational Autoencoder (VAE), where the decoder consists of a Mixture-of-Experts (MoE) architecture. This specific architecture allows for various modes of the data to be automatically learned by means of the experts. Additionally, we encourage the lower dimensional latent representation of our model to follow a Gaussian mixture distribution and to accurately represent the similarities between the data points. We assess the performance of our model on the MNIST benchmark data set and challenging real-world tasks of clustering mouse organs from single-cell RNA-sequencing measurements and defining cell subpopulations from mass cytometry (CyTOF) measurements on hundreds of different datasets. MoE-Sim-VAE exhibits superior clustering performance on all these tasks in comparison to the baselines as well as competitor methods. - Scrutinizing MHC-I Binding Peptides and Their Limits of VariationItem type: Journal Article
PLoS Computational BiologyKoch, Christian P.; Perna, Anna M.; Pillong, Max; et al. (2013)Designed peptides that bind to major histocompatibility protein I (MHC-I) allomorphs bear the promise of representing epitopes that stimulate a desired immune response. A rigorous bioinformatical exploration of sequence patterns hidden in peptides that bind to the mouse MHC-I allomorph H-2Kb is presented. We exemplify and validate these motif findings by systematically dissecting the epitope SIINFEKL and analyzing the resulting fragments for their binding potential to H-2Kb in a thermal denaturation assay. The results demonstrate that only fragments exclusively retaining the carboxy- or amino-terminus of the reference peptide exhibit significant binding potential, with the N-terminal pentapeptide SIINF as shortest ligand. This study demonstrates that sophisticated machine-learning algorithms excel at extracting fine-grained patterns from peptide sequence data and predicting MHC-I binding peptides, thereby considerably extending existing linear prediction models and providing a fresh view on the computer-based molecular design of future synthetic vaccines. The server for prediction is available at http://modlab-cadd.ethz.ch (SLiDER tool, MHC-I version 2012). - Systematic Analysis of Transcriptional and Post-transcriptional Regulation of Metabolism in YeastItem type: Journal Article
PLoS Computational BiologyGonçalves, Emanuel; Raguz Nakic, Zrinka; Zampieri, Mattia; et al. (2017)Cells react to extracellular perturbations with complex and intertwined responses. Systematic identification of the regulatory mechanisms that control these responses is still a challenge and requires tailored analyses integrating different types of molecular data. Here we acquired time-resolved metabolomics measurements in yeast under salt and pheromone stimulation and developed a machine learning approach to explore regulatory associations between metabolism and signal transduction. Existing phosphoproteomics measurements under the same conditions and kinase-substrate regulatory interactions were used to in silico estimate the enzymatic activity of signalling kinases. Our approach identified informative associations between kinases and metabolic enzymes capable of predicting metabolic changes. We extended our analysis to two studies containing transcriptomics, phosphoproteomics and metabolomics measurements across a comprehensive panel of kinases/phosphatases knockouts and time-resolved perturbations to the nitrogen metabolism. Changes in activity of transcription factors, kinases and phosphatases were estimated in silico and these were capable of building predictive models to infer the metabolic adaptations of previously unseen conditions across different dynamic experiments. Time-resolved experiments were significantly more informative than genetic perturbations to infer metabolic adaptation. This difference may be due to the indirect nature of the associations and of general cellular states that can hinder the identification of causal relationships. This work provides a novel genome-scale integrative analysis to propose putative transcriptional and post-translational regulatory mechanisms of metabolic processes. - From "What Is?" to "What Isn't?" Computational BiologyItem type: Other Journal Item
PLoS Computational BiologyNussinov, Ruth; Bonhoeffer, Sebastian; Papin, Jason A.; et al. (2015) - ResMiCo: Increasing the quality of metagenome-assembled genomes with deep learningItem type: Journal Article
PLoS Computational BiologyMineeva, Olga; Danciu, Daniel; Schölkopf, Bernhard; et al. (2023)The number of published metagenome assemblies is rapidly growing due to advances in sequencing technologies. However, sequencing errors, variable coverage, repetitive genomic regions, and other factors can produce misassemblies, which are challenging to detect for taxonomically novel genomic data. Assembly errors can affect all downstream analyses of the assemblies. Accuracy for the state of the art in reference-free misassembly prediction does not exceed an AUPRC of 0.57, and it is not clear how well these models generalize to real-world data. Here, we present the Residual neural network for Misassembled Contig identification (ResMiCo), a deep learning approach for reference-free identification of misassembled contigs. To develop ResMiCo, we first generated a training dataset of unprecedented size and complexity that can be used for further benchmarking and developments in the field. Through rigorous validation, we show that ResMiCo is substantially more accurate than the state of the art, and the model is robust to novel taxonomic diversity and varying assembly methods. ResMiCo estimated 7% misassembled contigs per metagenome across multiple real-world datasets. We demonstrate how ResMiCo can be used to optimize metagenome assembly hyperparameters to improve accuracy, instead of optimizing solely for contiguity. The accuracy, robustness, and ease-of-use of ResMiCo make the tool suitable for general quality control of metagenome assemblies and assembly methodology optimization. - Ten simple rules to host an inclusive conferenceItem type: Other Journal Item
PLoS Computational BiologyJoo, Rocío; Sánchez-Tapia, Andrea; Mortara, Sara; et al. (2022)Conferences are spaces to meet and network within and across academic and technical fields, learn about new advances, and share our work. They can help define career paths and create long-lasting collaborations and opportunities. However, these opportunities are not equal for all. This article introduces 10 simple rules to host an inclusive conference based on the authors’ recent experience organizing the 2021 edition of the useR! statistical computing conference, which attracted a broad range of participants from academia, industry, government, and the nonprofit sector. Coming from different backgrounds, career stages, and even continents, we embraced the challenge of organizing a high-quality virtual conference in the context of the Coronavirus Disease 2019 (COVID-19) pandemic and making it a kind, inclusive, and accessible experience for as many people as possible. The rules result from our lessons learned before, during, and after the organization of the conference. They have been written mainly for potential organizers and selection committees of conferences and contain multiple practical tips to help a variety of events become more accessible and inclusive. We see this as a starting point for conversations and efforts towards building more inclusive conferences across the world. - Inferring Epidemic Contact Structure from Phylogenetic TreesItem type: Journal Article
PLoS Computational BiologyLeventhal, Gabriel E.; Kouyos, Roger; Stadler, Tanja; et al. (2012)Contact structure is believed to have a large impact on epidemic spreading and consequently using networks to model such contact structure continues to gain interest in epidemiology. However, detailed knowledge of the exact contact structure underlying real epidemics is limited. Here we address the question whether the structure of the contact network leaves a detectable genetic fingerprint in the pathogen population. To this end we compare phylogenies generated by disease outbreaks in simulated populations with different types of contact networks. We find that the shape of these phylogenies strongly depends on contact structure. In particular, measures of tree imbalance allow us to quantify to what extent the contact structure underlying an epidemic deviates from a null model contact network and illustrate this in the case of random mixing. Using a phylogeny from the Swiss HIV epidemic, we show that this epidemic has a significantly more unbalanced tree than would be expected from random mixing.
Publications 1 - 10 of 240