Towards understanding policy design through text-as-data approaches: The policy design annotations (POLIANNA) dataset
OPEN ACCESS
Author / Producer
Date
2023-12-13
Publication Type
Journal Article
ETH Bibliography
yes
Citations
Altmetric
OPEN ACCESS
Data
Rights / License
Abstract
Despite the importance of ambitious policy action for addressing climate change, large and systematic assessments of public policies and their design are lacking as analysing text manually is labour-intensive and costly. POLIANNA is a dataset of policy texts from the European Union (EU) that are annotated based on theoretical concepts of policy design, which can be used to develop supervised machine learning approaches for scaling policy analysis. The dataset consists of 20,577 annotated spans, drawn from 18 EU climate change mitigation and renewable energy policies. We developed a novel coding scheme translating existing taxonomies of policy design elements to a method for annotating text spans that consist of one or several words. Here, we provide the coding scheme, a description of the annotated corpus, and an analysis of inter-annotator agreement, and discuss potential applications. As understanding policy texts is still difficult for current text-processing algorithms, we envision this database to be used for building tools that help with manual coding of policy texts by automatically proposing paragraphs containing relevant information.
Permanent link
Publication status
published
External links
Editor
Book title
Journal / series
Volume
10 (1)
Pages / Article No.
896
Publisher
Nature
Event
Edition / version
Methods
Software
Geographic location
Date collected
Date created
Subject
Organisational unit
09550 - Schmidt, Tobias / Schmidt, Tobias
Notes
Funding
190936 - Uncovering policy designs: A training dataset for future automated text analysis (SNF)