Machine Learning meets Exoplanet Science: Methodical Contributions to Direct Imaging and Atmospheric Retrieval


Loading...

Author / Producer

Date

2025

Publication Type

Doctoral Thesis

ETH Bibliography

yes

Citations

Altmetric

Data

Abstract

Over the past thirty years, exoplanet science—that is, the study of planets beyond our Solar System—has become one of the most thriving and dynamic subfields of astronomy. At the time of this writing, close to 6000 extrasolar planets have been discovered through various methods, and measurements from groundbreaking instruments such as the James Webb Space Telescope (JWST) allow us to study their properties in unprecedented detail. Complementing these hardware advances, there has recently been an increased interest in methods for processing observational data, especially through the use of machine learning (ML). This should not come as a surprise, considering the success that ML has had in other domains, and given that both the detection and characterization of exoplanets are fundamentally challenging inference problems which require the extraction of information from complex, noisy data that push traditional analysis techniques to their limits. In this thesis, we present three contributions to this young research field at the intersection of exoplanet science and ML, which trace an arc from the detection of extrasolar planets with the help of ML to the characterization of their atmospheres. The first study addresses the problem of post-processing data from high-contrast imaging. We show how we can combine physical domain knowledge about the data with techniques from the field of causal inference to learn pixel-wise models for the systematic noise that allow us to denoise the data and thus reveal previously unseen companions. We demonstrate the applicability of our approach on four publicly available datasets from the VLT/NACO instrument. A particular innovation of our approach is the explicit incorporation of the external observing conditions, which in our experiments improves the denoising performance. In the second study, we turn to the problem of atmospheric retrieval; that is, the inference of parameters such as the chemical composition from an observed exoplanet spectrum. We show that we can use neural networks to replace a key component in the standard Bayesian inference pipeline; namely the parameterization of the thermal structure. This reduces the number of parameters needed to describe an atmosphere, thus speeding up retrievals or freeing up computational resources for other parameters of interest. In addition, it effectively allows performing atmospheric retrieval with pressure–temperature profiles from self-consistent atmospheric models, which are usually too computationally expensive for Bayesian parameter inference. Finally, in the third contribution, we completely replace the traditional atmospheric characterization workflow using stochastic samplers with a simulation-based inference approach based on continuous normalizing flows. We combine this approach with importance sampling to ensure the reliability of our results and show that we can learn models that amortize over different assumptions for the noise in the data, thus boosting the practical applicability of our method. We demonstrate this practical applicability and validate it against traditional alternatives through extensive experiments on simulated emission spectra of a gas giant-type exoplanet.

Publication status

published

Editor

Contributors

Examiner : Schölkopf, Bernhard
Examiner : Waldmann, Ingo
Examiner : Quanz, Sascha Patrick

Book title

Journal / series

Volume

Pages / Article No.

Publisher

ETH Zurich

Event

Edition / version

Methods

Software

Geographic location

Date collected

Date created

Subject

Machine Learning; Exoplanets; High-Contrast Imaging; Half-Sibling Regression; Atmospheric Retrieval; Flow Matching

Organisational unit

09664 - Schölkopf, Bernhard / Schölkopf, Bernhard

Notes

Funding

Related publications and datasets