
Open access
Date
2024-08Type
- Conference Paper
ETH Bibliography
yes
Altmetrics
Abstract
A language model may be viewed as a Svalued stochastic process for some alphabet S. However, in some pathological situations, such a stochastic process may "leak" probability mass onto the set of infinite strings and hence is not equivalent to the conventional view of a language model as a distribution over ordinary (finite) strings. Such ill-behaved language processes are referred to as non-tight in the literature. In this work, we study conditions of tightness through the lens of stochastic processes. In particular, by regarding the EOS symbol as marking a stopping time and using results from martingale theory, we give characterizations of tightness that generalize our previous work (Du et al., 2023). Show more
Permanent link
https://doi.org/10.3929/ethz-b-000709073Publication status
publishedExternal links
Book title
Findings of the Association for Computational Linguistics: ACL 2024Pages / Article No.
Publisher
Association for Computational LinguisticsEvent
Organisational unit
09682 - Cotterell, Ryan / Cotterell, Ryan
More
Show all metadata
ETH Bibliography
yes
Altmetrics