Self-supervised Disentanglement of Modality-specific and Shared Factors Improves Multimodal Generative Models
Metadata only
Date
2020Type
- Conference Paper
ETH Bibliography
yes
Altmetrics
Abstract
Multimodal generative models learn a joint distribution over multiple modalities and thus have the potential to learn richer representations than unimodal models. However, current approaches are either inefficient in dealing with more than two modalities or fail to capture both modality-specific and shared variations. We introduce a new multimodal generative model that integrates both modality-specific and shared factors and aggregates shared information across any subset of modalities efficiently. Our method partitions the latent space into disjoint subspaces for modality-specific and shared factors and learns to disentangle these in a purely self-supervised manner. Empirically, we show improvements in representation learning and generative performance compared to previous methods and showcase the disentanglement capabilities. Show more
Publication status
publishedExternal links
Book title
Pattern Recognition. 42nd DAGM German Conference, DAGM GCPR 2020, Tübingen, Germany, September 28 – October 1, 2020, ProceedingsVolume
Pages / Article No.
Publisher
SpringerEvent
Organisational unit
09670 - Vogt, Julia / Vogt, Julia
Funding
188466 - Machine Learning Methods for Clinical Data Analysis and Precision Medicine (SNF)
Related publications and datasets
Is cited by: https://doi.org/10.3929/ethz-b-000634822
Notes
Conference lecture held on October 1, 2020. Due to the Coronavirus (COVID-19) the conference was conducted virtually.More
Show all metadata
ETH Bibliography
yes
Altmetrics