Show simple item record

dc.contributor.author
Bock, Christian
dc.contributor.supervisor
Borgwardt, Karsten M.
dc.contributor.supervisor
Doshi-Velez, Finale
dc.contributor.supervisor
Krishnaswamy, Smita
dc.date.accessioned
2022-01-10T08:18:42Z
dc.date.available
2022-01-08T11:00:53Z
dc.date.available
2022-01-10T08:18:42Z
dc.date.issued
2021
dc.identifier.uri
http://hdl.handle.net/20.500.11850/524042
dc.identifier.doi
10.3929/ethz-b-000524042
dc.description.abstract
The increased focus on evidence-based practice in the health sciences led to a plethora of (un)organised and digitised data. In conjunction with the availability of technological advances in the life sciences, this resulted in extraordinary access to biomedical data. Due to efficient measurement devices, the frequency at which data can be obtained is at an unprecedented high, leading to the adage that data, indeed, could be the new gold. Examples of such high-resolution time series data are the continuous monitoring of patient vital parameters or a single electrocardiogram (ECG) itself. The temporal component introduced by time series data is both a chance and a challenge, necessitating the development of appropriate data analysis techniques. A chance, as it allows us to utilise a measurement’s temporal evolution to characterise or classify the object of interest (e.g. patients, cells, or other organisms). A challenge because local and global correlation structures exacerbate obtaining a complete picture of a time-evolving phenomenon. Moreover, many dynamically changing systems exhibit alterations that occur at multiple scales and in multiple channels. This thesis presents a set of novel methods to help characterise and classify time-varying data with the express purpose of answering questions at the intersection of machine learning and healthcare. Recognising that time series arise from different categories, we first separate them into real-valued and object-valued time series and investigate both types separately. For the analysis of the first type, we propose a novel method to mine time series patterns efficiently. Driven by a statistical approach, we will introduce a way to identify temporal biomarkers and illustrate their utility in a data set of intensive care unit patients. For this, we leverage the expressive power of subsequences to obtain a high-dimensional time series representation. This feature representation is subsequently used to develop a kernel method based on optimal transport theory. The developed algorithm is of general applicability for medium-sized data sets and has proven particularly effective in the classification setting. The first part of this thesis ends with the presentation of a collaborative machine learning system to predict myocardial ischaemia from stress test ECGs. We develop a deep learning-based approach to significantly reduce the number of patients that unnecessarily undergo myocardial perfusion imaging. A subsequent interpretability analysis presents a potential path towards explainable and trustworthy artificial intelligence in cardiology. The second part of this thesis describes an effort to improve our understanding of artificial neural networks. By treating the network as a composition of time-varying graphs, we develop a method that characterises the change of its structural complexity over time. Our method captures the benefit of deep-learning best practices and can be used as an earlystopping criterion without the need for a validation data set. We thus manage to improve our understanding of artificial neural networks and shed light on the properties linked to their generalisation capabilities. Throughout this thesis, we demonstrate and highlight that in the analysis of (biomedical) time series, it is crucial to take the end-user into account. Interpretability and statistical analyses are of utter importance to make the otherwise opaque field of machine learning transparent to clinicians, physicians, and biologists. Moreover, we also hold up the mirror to ourselves as machine learning researchers: Comprehending the underlying mechanisms of our algorithms is at least as important as their empirical successes. The present thesis paves the path towards a better understanding of artificial neural networks and sheds light on complex phenotypes such as sepsis and myocardial ischaemia in clinically relevant ways.
en_US
dc.format
application/pdf
en_US
dc.language.iso
en
en_US
dc.publisher
ETH Zurich
en_US
dc.rights.uri
http://rightsstatements.org/page/InC-NC/1.0/
dc.subject
Machine learning (artificial intelligence)
en_US
dc.subject
time series analysis
en_US
dc.subject
Life sciences
en_US
dc.subject
Healthcare analytics
en_US
dc.subject
Deep Learning
en_US
dc.subject
Kernel methods
en_US
dc.subject
Topological data analysis
en_US
dc.subject
Classification
en_US
dc.subject
Hypothesis testing
en_US
dc.title
Motifs and Manifolds Statistical and Topological Machine Learning for Characterising and Classifying Biomedical Time Series
en_US
dc.type
Doctoral Thesis
dc.rights.license
In Copyright - Non-Commercial Use Permitted
dc.date.published
2022-01-10
ethz.size
187 p.
en_US
ethz.code.ddc
DDC - DDC::0 - Computer science, information & general works::004 - Data processing, computer science
en_US
ethz.code.ddc
DDC - DDC::5 - Science::570 - Life sciences
en_US
ethz.identifier.diss
27969
en_US
ethz.publication.place
Zurich
en_US
ethz.publication.status
published
en_US
ethz.leitzahl
ETH Zürich::00002 - ETH Zürich::00012 - Lehre und Forschung::00007 - Departemente::02060 - Dep. Biosysteme / Dep. of Biosystems Science and Eng.::09486 - Borgwardt, Karsten M. (ehemalig) / Borgwardt, Karsten M. (former)
en_US
ethz.leitzahl.certified
ETH Zürich::00002 - ETH Zürich::00012 - Lehre und Forschung::00007 - Departemente::02060 - Dep. Biosysteme / Dep. of Biosystems Science and Eng.::09486 - Borgwardt, Karsten M. (ehemalig) / Borgwardt, Karsten M. (former)
en_US
ethz.date.deposited
2022-01-08T11:00:58Z
ethz.source
FORM
ethz.eth
yes
en_US
ethz.availability
Open access
en_US
ethz.rosetta.installDate
2022-01-10T08:18:52Z
ethz.rosetta.lastUpdated
2023-02-06T23:48:04Z
ethz.rosetta.versionExported
true
ethz.COinS
ctx_ver=Z39.88-2004&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.atitle=Motifs%20and%20Manifolds%20Statistical%20and%20Topological%20Machine%20Learning%20for%20Characterising%20and%20Classifying%20Biomedical%20Time%20Series&rft.date=2021&rft.au=Bock,%20Christian&rft.genre=unknown&rft.btitle=Motifs%20and%20Manifolds%20Statistical%20and%20Topological%20Machine%20Learning%20for%20Characterising%20and%20Classifying%20Biomedical%20Time%20Series
 Search print copy at ETH Library

Files in this item

Thumbnail

Publication type

Show simple item record