On the Representational Capacity of Recurrent Neural Language Models

Nowak, Franz; Svete, Anej; Du, Li; Cotterell, Ryan

doi:10.18653/v1/2023.emnlp-main.434

Download

Full text (published version) (PDF, 685.2Kb)

Open access

Author

Du, Li

Date

2023-12

Type

Conference Paper

ETH Bibliography

yes

Altmetrics

Download

Full text (published version) (PDF, 685.2Kb)

Rights / license

Creative Commons Attribution 4.0 International

Abstract

This work investigates the computational expressivity of language models (LMs) based on recurrent neural networks (RNNs). Siegelmann and Sontag (1992) famously showed that RNNs with rational weights and hidden states and unbounded computation time are Turing complete. However, LMs define weightings Show more

Permanent link

https://doi.org/10.3929/ethz-b-000650677

Publication status

published

External links

https://doi.org/10.18653/v1/2023.emnlp-main.434

Editor

Bouamor, Houda

Pino, Juan

Bali, Kalika

Book title

Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing