Online learning of long-range dependencies


METADATA ONLY
Loading...

Date

2024-07

Publication Type

Conference Paper

ETH Bibliography

yes

Citations

Altmetric
METADATA ONLY

Data

Rights / License

Abstract

Online learning holds the promise of enabling efficient long-term credit assignment in recurrent neural networks. However, current algorithms fall short of offline backpropagation by either not being scalable or failing to learn long-range dependencies. Here we present a high-performance online learning algorithm that merely doubles the memory and computational requirements of a single inference pass. We achieve this by leveraging independent recurrent modules in multi-layer networks, an architectural motif that has recently been shown to be particularly powerful. Experiments on synthetic memory problems and on the challenging long-range arena benchmark suite reveal that our algorithm performs competitively, establishing a new standard for what can be achieved through online learning. This ability to learn long-range dependencies offers a new perspective on learning in the brain and opens a promising avenue in neuromorphic computing.

Permanent link

Publication status

published

Book title

Advances in Neural Information Processing Systems 36

Journal / series

Volume

Pages / Article No.

10477 - 10493

Publisher

Curran

Event

37th Annual Conference on Neural Information Processing Systems (NeurIPS 2023)

Edition / version

Methods

Software

Geographic location

Date collected

Date created

Subject

Machine Learning (cs.LG); Neural and Evolutionary Computing (cs.NE); FOS: Computer and information sciences

Organisational unit

03672 - Steger, Angelika (emeritus) / Steger, Angelika (emeritus) check_circle

Notes

Poster presented on December 12, 2023.

Funding

Related publications and datasets