Suche
Ergebnisse
-
Telling BERT's Full Story: from Local Attention to Global Aggregation
(2021)Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics: Main VolumeWe take a deep look into the behaviour of self-attention heads in the transformer architecture. In light of recent work discouraging the use of attention distributions for explaining a model’s behaviour, we show that attention distributions can nevertheless provide insights into the local behaviour of attention heads. This way, we propose a distinction between local patterns revealed by attention and global patterns that refer back to the ...Conference Paper -
MIDI-VAE: Modeling Dynamics and Instrumentation of Music with Applications to Style Transfer
(2018)Proceedings of the 19th International Society for Music Information Retrieval Conference (ISMIR 2018)We introduce MIDI-VAE, a neural network model basedon Variational Autoencoders that is capable of handlingpolyphonic music with multiple instrument tracks, as wellas modeling the dynamics of music by incorporating notedurations and velocities. We show that MIDI-VAE can per-form style transfer on symbolic music by automaticallychanging pitches, dynamics and instruments of a musicpiece from, e.g., a Classical to a ...Conference Paper