
Datasets
Cumulative and individual case data:
-
Johns Hopkins data repository: https://github.com/CSSEGISandData/COVID-19/tree/master/csse_covid_19_data
-
Individual case data from Singapore: https://co.vid19.sg/cases
-
Summary of case lists/line lists for different countries: https://github.com/midas-network/COVID-19/wiki/Data-catalog#cases
-
Case numbers for different countries: https://www.worldometers.info/coronavirus/
-
Real-time fetching of data: https://github.com/datasets/covid-19
-
Another global case summary: https://coronavirus-data.com/
-
Global repository overview: https://github.com/stevenliuyi/covid19
-
Case numbers Switzerland: https://github.com/openZH/covid_19 and https://www.corona-data.ch/
-
Individual case data India: https://docs.google.com/spreadsheets/d/e/2PACX-1vSc_2y5N0I67wDU38DjDh35IZSIS30rQf7_NYZhtYYGU1jJYT6_kDx4YpF-qw0LSlGsBYP8pqM_a1Pd/pubhtml
-
Visualising JH data: https://rpruim.github.io/ds303/S20/hw/covid-19/covid-19.html
-
Case data/SIR models Brazil: http://covid19br.org/ and https://github.com/cidacslab/Mathematical-and-Statistical-Modeling-of-COVID19-in-Brazil
Data on test numbers in different countries:
-
https://ourworldindata.org/coronavirus (CSV files under “Current COVID-19 test coverage estimates”)
Please note that the number of confirmed cases also depends on the underlying sampling process (i.e., infection tests). You may decide to incorporate the “tests per capita” in your models to get better predictions and statistics of estimates. For more information on unreported cases, see Li, Ruiyun, et al. "Substantial undocumented infection facilitates the rapid dissemination of novel coronavirus (COVID-19)." Science (2020).
Please also note that behavioral changes (e.g., no handshakes, social distancing, quarantine, etc.) influence the spreading parameters and outbreak dynamics.
Local interactions, social networks, and travel activities also have a large influence on the resulting spreading dynamics (see e.g., https://www.nytimes.com/interactive/2020/03/22/world/coronavirus-spread.html).
Mobility data:
-
Mobility data Italy: https://github.com/pcm-dpc/COVID-19
-
Mobility estimates for different regions: https://citymapper.com/CMI
-
Mobility data Switzerland: https://statistik.zh.ch/internet/justiz_inneres/statistik/de/aktuell/mitteilungen/2020/covid_mobilitaetsverhalten.html
-
Additional mobility data Switzerland: https://ivtmobis.ethz.ch/mobis/covid19/reports/mobis_covid19_report_2020-06-04.html
It might be important to account for spatial and surface-contact effects (see van Doremalen, Neeltje, et al. "Aerosol and surface stability of HCoV-19 (SARS-CoV-2) compared to SARS-CoV-1." medRxiv (2020)). Mean-field models and representative-agent models may lead to inaccuracies.
Kaggle data:
We also recommend UCLA's COVID-19 dataset clearinghouse as an excellent summary of various datasets and literature.
Excess mortality data:
-
Historical and current mortality data Italy: https://www.istat.it/it/archivio/240401
-
Mortality monitor Spain: https://momo.isciii.es/public/momo/dashboard/momo_dashboard.html\#datos
-
Historical and current mortality data Switzerland: https://www.bfs.admin.ch/bfs/de/home/statistiken/gesundheit/gesundheitszustand/sterblichkeit-todesursachen.html
-
Death statistics England/Wales: https://www.ons.gov.uk/peoplepopulationandcommunity/birthsdeathsandmarriages/deaths/datasets/weeklyprovisionalfiguresondeathsregisteredinenglandandwales
-
National Center for Health Statistics Mortality Surveillance System: https://gis.cdc.gov/grasp/fluview/mortality.html
Government response data:
1. https://covidtracker.bsg.ox.ac.uk