Introduction
By late 2019 an outbreak of COVID-19, caused by SARS-Cov-2 virus, started in the region of Hubei (China), more specifically in the city of Wuhan. Since then, the disease has spread all over the world, being declared pandemic by the World Health Organization on 2020 March 11th. The rapid propagation of the virus around the world has stressed the health systems of many countries to their limit. In Europe, the most affected countries in terms of number of detected cases, hospitalizations and deaths are, to the date, Italy and Spain. This unprecedented situation for the public health systems of these countries forced decision makers to act very quickly in order to minimize the impact of the disease and to avoid collapse. In Spain, the emergency state was declared on 2020 March 14th and was hardened with mandatory home confinement except for vital sectors workers (including health professionals, food supply, etc.) on March 30th. Under this situation, it is urgent to assess the efficiency of social distancing and other non-pharmaceutical interventions undertaken to control the pandemics of COVID-19.1,2 One of the main challenges in evaluating the impact of these actions is that data are only partially available, as many of the cases are asymptomatic or with mild symptoms,3 and shortage of testing kits prevent testing all patients with possible COVID-19 symptoms. Therefore, the number of cases might be severely underestimated. This issue is common in epidemiology and several methods have been recently proposed to address it under specific circumstances.4,5 Another issue is that, when no massive viral testing of the population is performed, new cases are only detected once symptoms appear, at a variable delay after contamination. This complicates the assessment of the impact of interventions to reduce/slow down the spread of a virus.
Here, we aimed at evaluating the impact of the COVID-19 related non-pharmaceutical interventions undertaken in Spain over the basic reproduction ratio R0, considering three periods of time in 2020: no intervention (until March 13th), emergency state (March 16-30th and April 13th-15th) and mandatory confinement (March 31st to April 12th). We inferred retrospectively the number of contamination in each Spanish autonomous community from the patterns of hospitalizations, deaths and detected cases. Analyses were performed with data up to April 15th 2020.
Methods
We modelled the dynamics of infection with a discrete-time SIR (Susceptible-Infected-Recovered) model with time-varying R0 defined for each region r and day d. We also assumed the stock of susceptible population S(r,d) is almost constant through time at S(r, d) ≈ Nr, (where Nr is the size of the total population in the region; i.e. a small percentage of the population is contaminated) as is probably valid up to now. The dynamics of infected people I(r, d) an region rat day d thus varies as:
where γ is the recovery rate of the infection and R0 is a stochastic variable whose expected value is defined by the ongoing social distancing measures (no measure, state emergency or mandatory confinement). Those three expected values are parameters that are estimated from the reports. Formally, the priors over R0 for each region are defined as independent Gaussian processes. The prior mean is defined by the distancing measures, while the prior covariance is squared exponential covariance K with variance σ2=0.12 and length l=1 day.6 Such prior allows to capture differences in the spread between different regions as well as temporary change of R0 within a region (Gaussian processes enforce that these fluctuations are smooth in time). The variance term σ2 was set so it is unlikely that the infected population goes down in a single day by a proportion larger than the recovery rate γ. Based on the mean 20 days of contagious before recovery (duration of viral shedding7), we set γ=0.05day-1. We assumed that the value of infected people at the beginning of the period studied (20th of February 2020) was drawn from a log-normal distribution with mean xir and variance 1, where xi is a parameter specific to each autonomous community (estimated from the data). This variability allowed us to capture initial variations in the spread of the epidemics at the beginning of the period of study.
The number of new cases per day is
with
There is a debate about the reliability of these data. It is believed that hospitalizations report is the most reliable of these indicators,11 as many cases go undetected (some patients are asymptomatic or suffer mild symptoms; saturation of health systems have led to testing only more severely affected patients in some autonomous community), and some deaths are not integrated in the official count because of the lack of viral charge testing. There is indeed a large variability between the fraction of hospitalizations per reported case between autonomous communities: it is 31% in Galicia but 82% in Comunidad de Madrid. This difference is more likely to be due to differences in detecting and reporting cases rather than in the true proportion of infected people requiring hospitalization. It is also more difficult to assess reliably the number of cases from death reports as the latency from infection to death is long and can be very variable across individuals.
Parameters of the model, including the value of R0 in the different conditions, were estimated from the data using an expectation-maximization algorithm (see Appendix A for details). We ran three alternative analyses: the principal analysis estimated model parameters using hospitalizations reports; two control analyses were run using either detected cases or deaths reports, which are two less reliable indicators.
Results
We present the number of COVID-19 cases in all 19 autonomous communities inferred from the pattern of hospitalizations in Figure 2 (see also Figure I in the online Appendix for breakdown by autonomous community). The estimated cases display a sharp increase until the lockdown followed by a plateau, and then a decrease. We estimated the R0 before state of emergency, during state emergency and during enforced lockdown. R0 was found to drop from 5.89 (95% confidence interval ENT#091;95%CIENT#093;: 5.46-7.09) to 1.86 (95%CI: 1.10-2.63) after state of emergency, and down to 0.48 (95%CI: 0.15-1.17) after full lockdown. We estimate a total number of 0.871 million COVID-19 cases in Spain by April 15th 2020, including 0.294 active cases, 0.559 recovered and 0.018 deceased (see Fig. 3 for breakdown by autonomous community).
Estimates were similar when we used detected cases rather than hospitalizations. Using detected case reports, we estimated the R0 to be 6.91 (95%CI: 6.75-7.39) before the state of urgency, 2.22 (95%CI: 1.92-2.74) during the state of emergency and 0.85 (95%CI: 0.5-1.05) during the full lockdown. The estimate for the cumulative number of cases was 0.823 million overall in Spain (0.351 million active).
Because deaths occur after a long and variable interval after contamination, we could not reliably estimate the value of R0 separately for the state emergency and full lockdown measures, so we simply estimated R0 before and after declaring the state of emergency. We estimated the R0 to be 6.48 before the lockdown (95%CI: 5.5-7.51), and 0.49 afterwards (95%CI: 0.16 1.57). The estimate for the cumulative number of cases was 2.82 millions overall in Spain (0.72 millions active).
Finally, we assessed how much our results depended on some of our assumptions, using hospitalization reports. We estimated the parameters using a different pattern of delays of notification for hospitalizations (50% reported on the same day; 20% on the next day; 20% two days later; 10% three days later). This change in delays induced a relatively small change in estimated R0: 5.60 before the state of emergency (95%CI: 4.77-7.05), 1.66 (95%CI: 1.14-2.55) during the state of emergency and 0.60 (95%CI: 0.13-1.35) during the full lockdown. The assumed probability of hospitalizations had no impact at all on the estimated R0, and had an inversely proportional influence on the estimated number of cases: assuming 7.5% of hospitalizations instead of 15% would lead a twofold increase in the estimated number of cases.
Discussion
We found similar estimates of reproducibility number and the proportion of the Spanish population contaminated by the new coronavirus, whether they were estimated from hospitalization numbers or detected cases. Both estimates from case reports and hospitalizations suggest that only mandatory quarantine achieved R 0 lower than 1, while R 0 during state of emergency before non-essential services were shut down was estimated to be well beyond 1. This predicts that the opening of the non-essential services by April 13 may lead to a new surge of cases. Based on this empirical study, mandatory confinement is the only state-wise measure that effectively reduces the number of contaminations.
Estimates based on death reports differed considerably from those based on either hospitalizations or detected cases. This suggests that some of the assumptions and data our modeling is based on may not be accurate (although commonly used in previous studies), and shows that this can induce very large biases in the estimation of the propagation of the new coronavirus in Spain. A crucial pre-requisite for the reliability of our estimates is that the proportion of events do not vary in time. That was likely not the case neither for detected cases, as the testing policy evolved during the period of study, nor for deaths, as the saturation of health systems may have led to higher death tolls and reporting of deaths in retirement homes evolved. As the proportion of hospitalized cases is believed to be more stable across time, we believe that estimates based on the latter are more reliable than estimates based on either reported cases or deaths. Below, we further comment on the results obtained with the hospitalization reports.
Our approach is similar to a study by an Imperial College team published last week inferring the impact of non-pharmaceutical measures (including lockdown) on propagation of the new coronavirus in 11 European countries.10 Both studies rely on fitting a model of infection dynamics to observed data (here hospitalizations). This contrasts with other approaches based on fitting a curve to the observed time series (e.g., for patterns of fatalities in United States12 or patterns of cases in China13), or to model simulations studies that capture how the pattern of contacts in different scenarios (with or without social distancing measures) affect virus propagation.14-16,19 Other studies have also estimated the number of cases from the reported deaths, assuming a fixed duration from infection to death.17
Our modeling approach included stochasticity in the reproducibility number in each area. This notably allows to capture distinct trajectories of infection in different areas. We noted a negative correlation across autonomous communities between the proportion of infected people at the onset of state of emergency and the subsequent increase in infection (Fig. 4, Pearson coefficient: r=−0.46, p=0.048). In other words, R0 at the state of emergency was smaller in the regions with more cases. This could be due to a series of factors. First, the communities with the largest proportion of cases could start developing herd immunity, hence limiting the propagation of the infection (which seems unlikely given that in the most affected autonomous communities, only a few percent have been infected according to our estimations). It could also be the result of local policies taken before the national lockdown in the most affected autonomous communities, or a better compliance of lockdown and social distancing in most affected areas. Another factor could be the migration from the most affected (especially Madrid) to less affected region before lockdown was implemented,18 or some distortions in the reporting (under-reporting) of cases in saturated health systems.
Our study has several important limitations. First, as noted previously, it is not clear how reliable is the data the modeling is based on both epidemiological reports, and infectious estimate probability and latency of symptoms, case detection, hospitalization and death. It should be stressed that the unreliability of epidemiologic data (with changes of criterion of inclusion along time and between autonomous communities) induces important biases which impede an accurate estimation of the impact of political measures on the propagation of the new coronavirus. A faster and more reliable tracking of the epidemics could be performed if cases were reported in a systematic way dated by the onset of symptoms rather than detection, as the incubation period has been well characterized.
Second, the model captures the number of new cases each day as a proportion from the pool of infected people in the same area. It does not take into account how the age distribution in each area affects that each infection leads to hospitalizations or death, nor how the probability of infecting depends on the days from infection.10 Nor does it take into account mobility between regions, whose impact is believed to be more important at the initiation of the epidemics. It is also worth noting that in a locked down environment where most contacts are compartmentalized in households, it is possible that at beginning the virus continues to spread rapidly within households, but less so between households. As immunity was not taken care here, R0 may decrease significantly more without further policies, after this first wave of within-household contamination is over. Finally, we only modelled the impact of lockdown, not of other measures which were taken too close apart (banning public events, closing schools, etc.), and simultaneously in most regions, so it is not possible here to disentangle their effects precisely. The analysis also did not take into account other changes that have occurred during this period, such as measures at an individual level or at the level of companies and local institutions (usage of masks, hands washing, etc.).
In conclusion, the greatest interest should be focused on the trends in R0 found in this work, which show drastic successive reductions after the implementation of the state of urgency and forced quarantines.
What is known about the topic?
The late 2019 Covid-19 disease outbreak has put the health systems of many countries to the limit of their capacity. The objective of this work is to evaluate the impact of the measures undertaken in Spain to deal with the pandemic.
What does this study add to the literature?
Using hospitalization reports, we estimate that the basic reproduction number fell from 5.89 before the state of emergency to 0.48 after full lockdown was implemented. Our work show as well the importance of relying on update and reliable epidemiologic data to evaluate precisely the impact of public health policies on the outbreak.