The COVID-19 pandemic has generated an enormous demand for data in India and around the world. Policymakers responding to the crisis need data on the spread of the disease, the healthcare resources at their disposal and the economic hardships being faced by the population. Epidemiologists need data to parameterize their models and estimate the dangers. Economists need data to assess the tradeoffs between policy alternatives.
DDL COVID India comprises an administrative data backbone with local estimates of health system capacity and local economic conditions. All data will be provided with consistent location identifiers (state, district, etc.) to allow for data to be easily merged and analyzed together. We'll supplement what's here with additional information like real-time COVID-19 case data and further demographic and economics data; we have many variables and data sources we've identified and targeted for inclusion, but are limited by manpower.
Moving forward, this effort will integrate three types of data in a common geographic frame: (i) baseline data on health, economic conditions, demographics, and state capacity; (ii) surveys collecting data on the rapidly evolving conditions on the ground; and (iii) real time data on cases, policy responses, etc.
For the source code, please see the GitHub repository. Please cite these data as described in the GitHub repo.
|Public Hospital Capacity
|Facilities, doctors, and beds. Sources: 2011 Population Census and DLHS-4 (2012-14).
|District, Subdistrict (PC only)
|Public and private hospital employment from 2013 Economic Census. Can estimate private system beds based on public employment:bed ratios.
|Predicted COVID-19 mortality rates
|Predictions based strictly on local age distributions, which create substantial risk differences across locations.
|Keys linking current districts to 2011 Population Census districts, which are the basis of many datasets
|Quantity of arrivals and prices in terms of maximum, minimum and mode price traded for a specific commodity.
|District-level short-term and long-term in and outmigration.
|2011 Population Census (most recent) population, density, literacy rate, urbanization
|State / District / Subdistrict / Town / Village
|Child immunisations, maternal health, hospitalisations and lab testing among others across all years of available data.
|Fourth round of the National Family Health Survey (NFHS-IV) with district level health infrastructure.
|Cumulative vaccination counts, vaccination dose type counts, and other vaccination-related data.
|Total deaths by district and state, aggregated by month and year.
While some of the input data cannot be shared in its raw format, all processing and data construction steps are reproducible. You can also directly access the data files here:
Metadata describing each component of DDL COVID India the dataset and how it was built are available in the GitHub repository:
© Development Data Lab, 2024