Skip to content

chrism0dwk/covid19uk

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

covid19uk: Bayesian stochastic spatial modelling for COVID-19 in the UK

This Python package implements a spatial stochastic SEIR model for COVID-19 in the UK, using Local Authority District level positive test data, population data, and mobility information. Details of the model implementation may be found in doc/lancs_space_model_concept.pdf.

Workflow

This repository contains code that produces Monte Carlo samples of the Bayesian posterior distribution given the model and case timeseries data from coronavirus.data.gov.uk, implementing an ETL step, the model itself, and associated inference and prediction steps.

Users requiring an end-to-end pipeline implementation should refer to the covid-pipeline repository.

For development users, the recommended package management system is poetry. Follow the instructions in the poetry documentation to install it.

From a bash-like command line, clone the covid19uk repo and install dependencies

git clone <path to this repo>
cd covid19uk
poetry install

To run the various algorithms, a general configuration file must be specified, as exemplified in example_config.yaml which runs the model on a month's worth of publicly available COVID19 case data from the 11 Northern Irish Local Authority Districts. The configuration file specifies the location of raw data, which we assemble into a NetCDF4 file:

mkdir results
poetry run python -m covid19uk.data.assemble example_config.yaml results/inferencedata.nc

The inference algorithm may then be run using the assembled data

poetry run python -m covid19uk.inference.inference \
    -c example_config.yaml \
    -o results/posterior.hd5 \
    inferencedata.nc

The resulting HDF5 file results/posterior.hd5 contains the posterior samples.

COVID-19 Lancaster University data statement

Data contained in the data directory is all publicly available from UK government agencies or previous studies. No personally identifiable information is stored.

ONS: Office for National Statistics

PHE: Public Health England

UTLA: Upper Tier Local Authority

LAD: Local Authority District

Example data files

  • data/c2019modagepop.csv a file containing local authority population data in the UK, taken from ONS prediction for December 2019. Local authorities [City of Westminster, City of London] and [Cornwall, Isles of Scilly] have been aggregated to meet commute data processing requirements.
  • data/mergedflows.csv inter local authority mobility matrix taken from UK Census 2011 commuting data and aggregated up from Middle Super Output Area level (respecting aggregated LADs as above).
  • data/UK2019mod_pop.gpkg a geopackage containing UK Local Authority Districts (2019) polygons together with population and areal metrics.