Control, Generate, Augment: A Scalable Framework for Multi-Attribute Text Generation

Open access
Date
2020-11Type
- Conference Paper
ETH Bibliography
yes
Altmetrics
Abstract
We introduce CGA, a conditional VAE architecture, to control, generate, and augment text. CGA is able to generate natural English sentences controlling multiple semantic and syntactic attributes by combining adversarial learning with a context-aware loss and a cyclical word dropout routine. We demonstrate the value of the individual model components in an ablation study. The scalability of our approach is ensured through a single discriminator, independently of the number of attributes. We show high quality, diversity and attribute control in the generated sentences through a series of automatic and human assessments. As the main application of our work, we test the potential of this new NLG model in a data augmentation scenario. In a downstream NLP task, the sentences generated by our CGA model show significant improvements over a strong baseline, and a classification performance often comparable to adding same amount of additional real data. Show more
Permanent link
https://doi.org/10.3929/ethz-b-000464810Publication status
publishedExternal links
Book title
Findings of the Association for Computational Linguistics: EMNLP 2020Pages / Article No.
Publisher
Association for Computational LinguisticsEvent
Organisational unit
09588 - Zhang, Ce / Zhang, Ce
More
Show all metadata
ETH Bibliography
yes
Altmetrics