Findings of the BabyLM Challenge: Sample-Efficient Pretraining on Developmentally Plausible Corpora - Research Collection

Download

Full text (published version) (PDF, 487.5Kb)

Open access

Autor(in)

Choshen, Leshem

Zhuang, Chengxu

Mosquera, Rafael

Paranjabe, Bhargavi

Williams, Adina

Cotterell, Ryan

Datum

2023-12

Typ

Conference Paper

ETH Bibliographie

yes

Altmetrics

Download

Full text (published version) (PDF, 487.5Kb)

Rechte / Lizenz

Creative Commons Attribution 4.0 International

Abstract

Children can acquire language from less than 100 million words of input. Large language models are far less data-efficient: they typically require 3 or 4 orders of magnitude more data and still do not perform as well as humans on many evaluations. These intensive resource demands limit the ability Mehr anzeigen

Persistenter Link

https://doi.org/10.3929/ethz-b-000650680

Publikationsstatus

published

Externe Links

https://doi.org/10.18653/v1/2023.conll-babylm.1

Herausgeber(in)

Choshen, Leshem

Buchtitel

Proceedings of the BabyLM Challenge at the 27th Conference on Computational Natural Language Learning

Seiten / Artikelnummer

1 - 34

Verlag

Association for Computational Linguistics

Konferenz

BabyLM Challenge at the 27th Conference on Computational Natural Language Learning (CoNLL 2023), Singapore, December 6, 2023

Organisationseinheit

09682 - Cotterell, Ryan / Cotterell, Ryan

Zugehörige Publikationen und Daten

Is part of: http://hdl.handle.net/20.500.11850/651188

Mehr

Alle Metadaten anzeigen

ETH Bibliographie

yes

Altmetrics