Show simple item record

dc.contributor.author
Hoffer, Elad
dc.contributor.author
Ben-Nun, Tal
dc.contributor.author
Hubara, Itay
dc.contributor.author
Giladi, Niv
dc.contributor.author
Hoefler, Torsten
dc.contributor.author
Soudry, Daniel
dc.date.accessioned
2021-07-21T11:33:46Z
dc.date.available
2021-01-14T12:18:41Z
dc.date.available
2021-01-19T09:04:09Z
dc.date.available
2021-07-21T11:33:46Z
dc.date.issued
2020
dc.identifier.isbn
978-1-7281-7168-5
en_US
dc.identifier.isbn
978-1-7281-7169-2
en_US
dc.identifier.other
10.1109/CVPR42600.2020.00815
en_US
dc.identifier.uri
http://hdl.handle.net/20.500.11850/462535
dc.identifier.doi
10.3929/ethz-b-000462535
dc.description.abstract
Large-batch SGD is important for scaling training of deep neural networks. However, without fine-tuning hyperparameter schedules, the generalization of the model may be hampered. We propose to use batch augmentation: replicating instances of samples within the same batch with different data augmentations. Batch augmentation acts as a regularizer and an accelerator, increasing both generalization and performance scaling for a fixed budget of optimization steps. We analyze the effect of batch augmentation on gradient variance and show that it empirically improves convergence for a wide variety of networks and datasets. Our results show that batch augmentation reduces the number of necessary SGD updates to achieve the same accuracy as the state-of-the-art. Overall, this simple yet effective method enables faster training and better generalization by allowing more computational resources to be used concurrently.
en_US
dc.format
application/pdf
en_US
dc.language.iso
en
en_US
dc.publisher
IEEE
en_US
dc.rights.uri
http://rightsstatements.org/page/InC-NC/1.0/
dc.title
Augment Your Batch: Improving Generalization Through Instance Repetition
en_US
dc.type
Conference Paper
dc.rights.license
In Copyright - Non-Commercial Use Permitted
dc.date.published
2020-08-05
ethz.book.title
2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)
en_US
ethz.pages.start
8129
en_US
ethz.pages.end
8138
en_US
ethz.size
10 p.
en_US
ethz.version.deposit
publishedVersion
en_US
ethz.event
2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR 2020) (virtual)
en_US
ethz.event.location
Seattle, WA, USA
en_US
ethz.event.date
June 13-19, 2020
en_US
ethz.notes
Due to the Coronavirus (COVID-19) the conference was conducted virtually.
en_US
ethz.publication.place
Piscataway, NJ
en_US
ethz.publication.status
published
en_US
ethz.leitzahl
ETH Zürich::00002 - ETH Zürich::00012 - Lehre und Forschung::00007 - Departemente::02150 - Dep. Informatik / Dep. of Computer Science::02666 - Institut für Hochleistungsrechnersysteme / Inst. f. High Performance Computing Syst::03950 - Hoefler, Torsten / Hoefler, Torsten
en_US
ethz.leitzahl.certified
ETH Zürich::00002 - ETH Zürich::00012 - Lehre und Forschung::00007 - Departemente::02150 - Dep. Informatik / Dep. of Computer Science::02666 - Institut für Hochleistungsrechnersysteme / Inst. f. High Performance Computing Syst::03950 - Hoefler, Torsten / Hoefler, Torsten
en_US
ethz.date.deposited
2021-01-14T12:18:48Z
ethz.source
FORM
ethz.eth
yes
en_US
ethz.availability
Open access
en_US
ethz.rosetta.installDate
2021-01-19T09:04:17Z
ethz.rosetta.lastUpdated
2022-03-29T10:34:52Z
ethz.rosetta.versionExported
true
ethz.COinS
ctx_ver=Z39.88-2004&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.atitle=Augment%20Your%20Batch:%20Improving%20Generalization%20Through%20Instance%20Repetition&rft.date=2020&rft.spage=8129&rft.epage=8138&rft.au=Hoffer,%20Elad&Ben-Nun,%20Tal&Hubara,%20Itay&Giladi,%20Niv&Hoefler,%20Torsten&rft.isbn=978-1-7281-7168-5&978-1-7281-7169-2&rft.genre=proceeding&rft_id=info:doi/10.1109/CVPR42600.2020.00815&rft.btitle=2020%20IEEE/CVF%20Conference%20on%20Computer%20Vision%20and%20Pattern%20Recognition%20(CVPR)
 Search print copy at ETH Library

Files in this item

Thumbnail

Publication type

Show simple item record