Data
I share key data and algorithms from several papers to encourage reproducibility and follow-on research. If you cannot find what you are looking for, just contact me.
- Comprehensive dataset of patent citations
A fully open source project in which we extract, classify and enrich citations found in patent documents. Includes citations to norms and standards, wikipedia pages, databases, etc.
- The GitHub page containing all necessary replication data and algorithms. - Patents arising from U.S. government funding
- The companion paper in PLoS ONE (open access).
- The 3PFL dataset on Zenodo. - Geocoding of worldwide patent data
- The project website.
- The companion paper in Scientific Data.
- The main dataset on Harvard dataverse (+ bridge table).
- Some source code (1, 2) on Florian Seliger's GitHub.
- A nice use case by Topi Jukanov on the most innovative square kilometers in Europe. - Getting started with PATSTAT Register
- The companion paper in Australian Economic Review (open access).
- The MySQL source code on GitHub. - An introduction to the PATSTAT database with example queries
- The companion paper in Australian Economic Review (open access).
- The test database in MS Access as well as the MySQL source code - The worldwide count of priority patents: A new indicator of inventive activity
- The companion paper in Research Policy.
- Count of priority filings by country of inventor, 1980-2011 (Excel file). The data is visible in Google Public Data.
- Raw patent data with recovered country of inventor (CSV file to upload into Patstat)
(April 2011) (October 2012) (October 2013)
- MySQL source code to recover missing data (SQL file) - see also the FAQ. - On the price elasticity of demand for patents
- The companion paper in Oxford Bulletin of Economics and Statistics.
- Dataset on patent fees at the EPO, JPO and USPTO, 1980-2007 (Excel file).