Bridging data management platforms and visualization tools to enable ad-hoc and smart analytics in life sciences
Open access
Date
2022-12Type
- Conference Paper
Abstract
Core facilities have to offer technologies that best serve the needs of their users and provide them a competitive advantage in research. They have to set up and maintain instruments in the range of ten to a hundred, which produce large amounts of data and serve thousands of active projects and customers. Particular emphasis has to be given to the reproducibility of the results. More and more, the entire process from building the research hypothesis, conducting the experiments, doing the measurements, through the data explorations and analysis is solely driven by very few experts in various scientific fields. Still, the ability to perform the entire data exploration in real-time on a personal computer is often hampered by the heterogeneity of software, the data structure formats of the output, and the enormous data sizes. These impact the design and architecture of the implemented software stack. At the Functional Genomics Center Zurich (FGCZ), a joint state-of-the-art research and training facility of ETH Zurich and the University of Zurich, we have developed the B-Fabric system, which has served for more than a decade, an entire life sciences community with fundamental data science support. In this paper, we sketch how such a system can be used to glue together data (including metadata), computing infrastructures (clusters and clouds), and visualization software to support instant data exploration and visual analysis. We illustrate our in-daily life implemented approach using visualization applications of mass spectrometry data. Show more
Permanent link
https://doi.org/10.3929/ethz-b-000571188Publication status
publishedExternal links
Journal / series
Journal of Integrative BioinformaticsVolume
Pages / Article No.
Publisher
De GruyterEvent
Subject
accessible; findable; interoperable and reusable (FAIR); integrations for data analysis; open research data (ORD); workflowOrganisational unit
02207 - Functional Genomics Center Zurich / Functional Genomics Center Zurich
More
Show all metadata