The Data Feed changefiles show all the changes in the Unpaywall database over time. They are provided for subscribers to the Unpaywall Data Feed. Files use the same schema as the REST API and database snapshot. This list is also available via a JSON endpoint for programmatic access.
Each changefile has a timestamp in its filename that tells you the most recent update it contains.
You can use the snapshot and changefiles together to keep your copy up to date by following these steps:
Content-Disposition
header if you download it with a utility like curl
:
Content-Disposition: attachment; filename="unpaywall_snapshot_2021-08-04T083001.jsonl.gz"
GET api.unpaywall.org/feed/changefiles?api_key=YOUR_API_KEY&interval=INTERVAL
Description | Provides a JSON object containing a list of changefile attributes and URLs. |
Accepts |
The difference in overlap behavior is an implementation detail that shouldn't affect your import process; each row in a changefile should overwrite the corresponding record in the dataset regardless. |
Returns | An object containing an array of changefiles: { "list": [ { "date": string (YYYY-MM-DD), "filename": string, "filetype": string ("jsonl", or "csv"), "last_modified": string (YYYY-MM-DDThh:mm:ss) "lines": integer, "size": integer, "url": string }, … ] } |
Example | https://api.unpaywall.org/feed/changefiles?api_key=YOUR_API_KEY&interval=day |