Quantifying Markov Chain Monte Carlo Exploration of Tumour Progression Tree Spaces

Initialisation Strategies, Convergence Diagnostics & Multi-modalities


OPEN ACCESS

Author / Producer

Date

2023-10-23

Publication Type

Master Thesis

ETH Bibliography

yes

Citations

Altmetric
OPEN ACCESS

Data

Abstract

Understanding the mutational intra-tumour heterogeneity within tumours is crucial to developing effective personalised cancer therapies. Bayesian Markov chain Monte Carlo (MCMC) sampling schemes have proven successful and trusted in reconstructing tumour progression histories, particularly mutation trees. To understand the effectiveness of mutation tree MCMC methods and their required runtimes, it is crucial to understand how quickly the empirical distribution of the MCMC converges to the posterior distribution. We quantify the MCMC exploration of the mutation tree space for the landmark inference scheme SCITE using tree similarity measures. In this simulation study, the tree similarities map features informative of a tumour’s clonal expansion from the mutation tree space to a scalar space, allowing the study of the MCMC exploration. Quantification of the exploration is provided by the novel application of convergence diagnostics established in continuous space to the discrete space of mutation trees via tree similarities. Consequently, we estimate the required runtime of SCITE for simulated data, which may imply significantly reduced runtimes for real-world datasets. Further, we find the dependence of the initial state of the MCMC to vanish quickly. We recommend trialling the significant reduction of the warm-up period for real-world datasets, implying another reduction in required runtime. In the process of exploring initialisation strategies, we validated the performance of the fast heuristic inference method HUNTRESS. Lastly, we investigate the topology of the Bayesian tree posterior, which is thought to contain multi-modalities potentially. For simulated data, we did not find evidence for any multi-modalities justifying the design of SCITE as a single-chain MCMC scheme.

Publication status

published

External links

Editor

Contributors

Book title

Journal / series

Volume

Pages / Article No.

Publisher

ETH Zurich

Event

Edition / version

Methods

Software

Geographic location

Date collected

Date created

Subject

Cancer genomics; Tumor progression; Markov chain Monte Carlo (MCMC); Convergence diagnostics; Bayesian Inference; trees (mathematics)

Organisational unit

03790 - Beerenwinkel, Niko / Beerenwinkel, Niko check_circle
02219 - ETH AI Center / ETH AI Center

Notes

Funding

Related publications and datasets