A Globally Convergent Algorithm for Neural Network Parameter Optimization Based on Difference-of-Convex Functions

Tschernutter, Daniel; Kraus, Mathias; Feuerriegel, Stefan

doi:10.3929/ethz-b-000652380

Download

Full text (published version) (PDF, 7.353Mb)

Open access

Author

Tschernutter, Daniel

Kraus, Mathias

Feuerriegel, Stefan

Date

2024-01

Type

Journal Article

ETH Bibliography

yes

Altmetrics

Download

Full text (published version) (PDF, 7.353Mb)

Rights / license

Creative Commons Attribution 4.0 International

Abstract

We propose an algorithm for optimizing the parameters of single hidden layer neural networks. Specifically, we derive a blockwise difference-of-convex (DC) functions representation of the objective function. Based on the latter, we propose a block coordinate descent (BCD) approach that we combine w Show more

Permanent link

https://doi.org/10.3929/ethz-b-000652380

Publication status

published

External links

https://openreview.net/forum?id=EDqCY6ihbr

Journal / series

Transactions on Machine Learning Research

Volume

2024(1)

Publisher

OpenReview

Organisational unit

09623 - Feuerriegel, Stefan (ehemalig) / Feuerriegel, Stefan (former)

More

Show all metadata

ETH Bibliography

yes

Altmetrics

Research Collection

Search

A Globally Convergent Algorithm for Neural Network Parameter Optimization Based on Difference-of-Convex Functions Mendeley CSV RIS BibTeX

A Globally Convergent Algorithm for Neural Network Parameter Optimization Based on Difference-of-Convex Functions

Mendeley

CSV

RIS

BibTeX