Skip to content
This repository has been archived by the owner on May 30, 2021. It is now read-only.

COVID-19 Coronavirus data scraped from government and curated data sources.

License

Notifications You must be signed in to change notification settings

covidatlas/coronadatascraper

THIS PROJECT IS BEING REPLACED BY Li

This project is being replaced by Li, the next-generation serverless crawler for COVID-19 data.

The reasons for the switchover are documented in Issue 782.

We are not actively accepting PRs for this repository, but are still using this repo to track issues.

Scraper code written for this project is not compatible with Li, see that project for examples. We have some helpers to assist in migrating code, see this document.

Thanks very much!


coronadatascraper

A crawler that scrapes COVID-19 Coronavirus data from government and curated data sources.

This project exists to scrape, de-duplicate, and cross-check county-level data on the COVID-19 coronavirus pandemic.

Every piece of data includes GeoJSON and population data, cites the source from which the data was obtained, and includes a rating of the source's technical quality (completeness, machine readability, best practices -- not accuracy).

Where's the data?

https://coronadatascraper.com/

How often is it updated?

We upload fresh data every day at around 9PM PST.

How do I use this data?

Read the Data Fields documentation for details on exactly what each field in the dataset means.

How can I run the crawler locally?

Check out our Getting Started guide to help get our project running on your local machine.

Contributing

NOTE: This project is being replaced by Li, the next-generation serverless crawler for COVID-19 data.

You can contribute to this project in two big ways:

Contribute to the project core

Check the Issues for any task we need to get done. If you are new to open-source, look for the label Good first issue

Contribute a source

Contributions for any place in the world are welcome. See the community-curated list of verified data sources to find a new datasource to add, and be sure to update the "Scraped?" column when you do.

To help you contribute a new source, please read the Sources and Scrapers guide before you start!

Send a pull request with your scraper, and be sure to run the scraper first with the instructions specified in the guide to make sure the data is valid.

License

This project is licensed under the permissive BSD 2-clause license.

The data produced by this project is public domain.

This project uses data from ISO-3166 Country and Dependent Territories Lists with UN Regional Codes under the Creative Commons Attribution-ShareAlike 4.0 International License.

Attribution

Please cite this project if you use it in your visualization or reporting.

Data obtained from Corona Data Scraper