Overcoming Observation Bias for Cancer Progression Modeling


Loading...

Date

2023-12-05

Publication Type

Working Paper

ETH Bibliography

yes

Citations

Altmetric

Data

Abstract

Cancers evolve by accumulating genetic alterations, such as mutations and copy number changes. The chronological order of these events is important for understanding the disease, but not directly observable from cross-sectional genomic data. Cancer progression models (CPMs), such as Mutual Hazard Networks (MHNs), reconstruct the progression dynamics of tumors by learning a network of causal interactions between genetic events from their co-occurrence patterns. However, current CPMs fail to include effects of genetic events on the observation of the tumor itself and assume that observation occurs independently of all genetic events. Since a dataset contains by definition only tumors at their moment of observation, neglecting any causal effects on this event leads to the “conditioning on a collider” bias: Events that make the tumor more likely to be observed appear anti-correlated, which results in spurious suppressive effects or masks promoting effects among genetic events. Here, we extend MHNs by modeling effects from genetic progression events on the observation event, thereby correcting for the collider bias. We derive an efficient tensor formula for the likelihood function and learn two models on somatic mutation datasets from the MSK-IMPACT study. In colon adenocarcinoma, we find a strong effect on observation by mutations in TP53, and in lung adenocarcinoma by mutations in EGFR. Compared to classical MHNs, this explains away many spurious suppressive interactions and uncovers several promoting effects.

Publication status

published

Editor

Book title

Journal / series

Volume

Pages / Article No.

Publisher

Cold Spring Harbor Laboratory

Event

Edition / version

Methods

Software

Geographic location

Date collected

Date created

Subject

Cancer progression model; Selection bias; Collider bias

Organisational unit

03790 - Beerenwinkel, Niko / Beerenwinkel, Niko check_circle

Notes

Funding

179518 - Using single-cell sequencing data to analyse tumour evolution (SNF)

Related publications and datasets

Is previous version of:
Is supplemented by: https://github.com/cbg-ethz/ObservationMHN