Infinite-width limit of deep linear neural networks

Chizat, Lénaïc; Colombo, Maria; Fernández-Real, Xavier; Figalli, Alessio

doi:10.1002/cpa.22200

Download

Full text (early view) (PDF, 1.341Mb)

Open access

Author

Chizat, Lénaïc

Colombo, Maria

Fernández-Real, Xavier

Figalli, Alessio

Show all

Date

2024-05-06

Type

Journal Article

ETH Bibliography

yes

Altmetrics

Download

Full text (early view) (PDF, 1.341Mb)

Rights / license

Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International

Abstract

This paper studies the infinite-width limit of deep linear neural networks (NNs) initialized with random parameters. We obtain that, when the number of parameters diverges, the training dynamics converge (in a precise sense) to the dynamics obtained from a gradient descent on an infinitely wide det Show more

Permanent link

https://doi.org/10.3929/ethz-b-000673033

Publication status

published

External links

https://doi.org/10.1002/cpa.22200

Journal / series

Communications on Pure and Applied Mathematics

Publisher

Wiley

More

Show all metadata

ETH Bibliography

yes

Altmetrics

Research Collection

Search

Infinite-width limit of deep linear neural networks Mendeley CSV RIS BibTeX

Infinite-width limit of deep linear neural networks

Mendeley

CSV

RIS

BibTeX