Preventing Generation of Verbatim Memorization in Language Models Gives a False Sense of Privacy

Ippolito, Daphne; Tramèr, Florian; Nasr, Milad; Zhang, Chiyuan; Jagielski, Matthew; Lee, Katherine; Choquette-Choo, Christopher A; Carlini, Nicholas

doi:10.3929/ethz-b-000642948

Download

Full text (published version) (PDF, 977.9Kb)

Open access

Author

Choquette-Choo, Christopher A

Carlini, Nicholas

Show all

Date

2023

Type

Conference Paper

ETH Bibliography

yes

Altmetrics

Download

Full text (published version) (PDF, 977.9Kb)

Rights / license

Creative Commons Attribution 4.0 International

Abstract

Studying data memorization in neural language models helps us understand the risks (e.g., to privacy or copyright) associated with models regurgitating training data, and aids in the evaluation of potential countermeasures. Many prior works -- and some recently deployed defenses -- focus on "verbat Show more

Permanent link

https://doi.org/10.3929/ethz-b-000642948

Publication status

published

External links

https://aclanthology.org/2023.inlg-main.3

Editor

Keet, C. Maria

Lee, Hung-Yi

Zarrieß, Sina

Book title

Proceedings of the 16th International Natural Language Generation Conference