Show simple item record

dc.contributor.author
de Rassenfosse, Gaétan
dc.contributor.author
Seliger, Florian
dc.date.accessioned
2021-02-03T13:19:33Z
dc.date.available
2021-01-17T09:15:51Z
dc.date.available
2021-02-03T13:19:33Z
dc.date.issued
2021-02
dc.identifier.issn
2352-3409
dc.identifier.other
10.1016/j.dib.2020.106615
en_US
dc.identifier.uri
http://hdl.handle.net/20.500.11850/463084
dc.identifier.doi
10.3929/ethz-b-000463084
dc.description.abstract
We present a general method for imputing missing information in the Worldwide Patent Statistical Database (PATSTAT) and make the resulting datasets publicly available. The PATSTAT database is the de facto standard for academic research using patent data. Complete information on patents is essential to obtain an accurate picture of technological activities across countries and over time. However, the coverage of the database is far from complete. Our data imputation method exploits detailed institutional knowledge about the international patent system, and we codify it in a SQL algorithm. We provide two datasets related to the imputation of missing country codes and missing technology classification. We also release the algorithm that can be easily adapted to impute other pieces of information that are missing in PATSTAT.
en_US
dc.format
application/pdf
en_US
dc.language.iso
en
en_US
dc.publisher
Elsevier
en_US
dc.rights.uri
http://creativecommons.org/licenses/by/4.0/
dc.subject
Missing data
en_US
dc.subject
Patents
en_US
dc.subject
PATSTAT
en_US
dc.subject
Imputation
en_US
dc.subject
PostgreSQL
en_US
dc.title
Imputation of missing information in worldwide patent data
en_US
dc.type
Journal Article
dc.rights.license
Creative Commons Attribution 4.0 International
dc.date.published
2020-12-05
ethz.journal.title
Data in Brief
ethz.journal.volume
34
en_US
ethz.pages.start
106615
en_US
ethz.size
9 p.
en_US
ethz.version.deposit
publishedVersion
en_US
ethz.identifier.wos
ethz.publication.place
Amsterdam
en_US
ethz.publication.status
published
en_US
ethz.leitzahl
ETH Zürich::00002 - ETH Zürich::00012 - Lehre und Forschung::00007 - Departemente::02120 - Dep. Management, Technologie und Ökon. / Dep. of Management, Technology, and Ec.::02525 - KOF Konjunkturforschungsstelle / KOF Swiss Economic Institute
en_US
ethz.leitzahl
ETH Zürich::00002 - ETH Zürich::00012 - Lehre und Forschung::00007 - Departemente::02120 - Dep. Management, Technologie und Ökon. / Dep. of Management, Technology, and Ec.::02525 - KOF Konjunkturforschungsstelle / KOF Swiss Economic Institute::06333 - KOF FB Innovationsökonomik / KOF Innovation Economics
en_US
ethz.leitzahl.certified
ETH Zürich::00002 - ETH Zürich::00012 - Lehre und Forschung::00007 - Departemente::02120 - Dep. Management, Technologie und Ökon. / Dep. of Management, Technology, and Ec.::02525 - KOF Konjunkturforschungsstelle / KOF Swiss Economic Institute::06333 - KOF FB Innovationsökonomik / KOF Innovation Economics
en_US
ethz.leitzahl.certified
ETH Zürich::00002 - ETH Zürich::00012 - Lehre und Forschung::00007 - Departemente::02120 - Dep. Management, Technologie und Ökon. / Dep. of Management, Technology, and Ec.::02525 - KOF Konjunkturforschungsstelle / KOF Swiss Economic Institute
en_US
ethz.date.deposited
2021-01-17T09:15:58Z
ethz.source
FORM
ethz.eth
yes
en_US
ethz.availability
Open access
en_US
ethz.rosetta.installDate
2021-02-03T13:19:43Z
ethz.rosetta.lastUpdated
2022-03-29T05:04:17Z
ethz.rosetta.versionExported
true
ethz.COinS
ctx_ver=Z39.88-2004&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.atitle=Imputation%20of%20missing%20information%20in%20worldwide%20patent%20data&rft.jtitle=Data%20in%20Brief&rft.date=2021-02&rft.volume=34&rft.spage=106615&rft.issn=2352-3409&rft.au=de%20Rassenfosse,%20Ga%C3%A9tan&Seliger,%20Florian&rft.genre=article&rft_id=info:doi/10.1016/j.dib.2020.106615&
 Search print copy at ETH Library

Files in this item

Thumbnail

Publication type

Show simple item record