key: cord-0556158-cu8cy0xq authors: Cacciapaglia, Giacomo; Cot, Corentin; Sannino, Francesco title: Multiwave pandemic dynamics explained: How to tame the next wave of infectious diseases date: 2020-11-25 journal: nan DOI: nan sha: 8af9a612fff391e70b703f03a408558da5535ac1 doc_id: 556158 cord_uid: cu8cy0xq Pandemics, like the 1918 Spanish Influenza and COVID-19, spread through regions of the World in subsequent waves. There is, however, no consensus on the origin of this pattern, which may originate from human behaviour rather than from the virus diffusion itself. Time-honoured models of the SIR type or others based on complex networks describe well the exponential spread of the disease, but cannot naturally accommodate the wave pattern. Nevertheless, understanding this time-structure is of paramount importance in designing effective prevention measures. Here we propose a consistent picture of the wave pattern based on the epidemic Renormalisation Group (eRG) framework, which is guided by the global symmetries of the system under time rescaling. We show that the rate of spreading of the disease can be interpreted as a time-dilation symmetry, while the final stage of an epidemic episode corresponds to reaching a time scale-invariant state. We find that the endemic period between two waves is a sign of instability in the system, associated to near-breaking of the time scale-invariance. This phenomenon can be described in terms of an eRG model featuring complex fixed points. Our results demonstrate that the key to control the arrival of the next wave of a pandemic is in the strolling period in between waves, i.e. when the number of infections grow linearly. Thus, limiting the virus diffusion in this period is the most effective way to prevent or delay the arrival of the next wave. In this work we establish a new guiding principle for the formulation of mid-term governmental strategies to curb pandemics and avoid recurrent waves of infections, deleterious in terms of human life loss and economic damage. various countries 7 . The approach is based on the analysis of the time evolution of the total number of infected cases and the symmetries that this epidemic curve reveals, allowing to extract reliable information from the data independently on the specific conditions met in each country. In fact, all the elements that can influence the velocity of the disease spreading are included in a single parameter, which contains the effect of local conditions, non-pharmaceutical interventions and socio-demographical characteristics. The eRG, therefore, can provide complementary information to studies that analyse in detail the effect of various measures [14] [15] [16] [17] [18] . As an example, the eRG framework has been used to study the effect of mobility reduction in Europe and the US during the first wave 19 , highlighting a universal time-frame of 2-4 weeks before an observable effect can be detected in the virus diffusion. For comparison, detailed studies of the mobility in the US 20 have been able to identify the locations and events that foster the infection of new individuals and ignite hotspots. In this work we focus on the total number of infected cases, as this is the most reliable tracker of the time-evolution of the pandemics. In fact, other data, like the number of deaths and of hospitalisations, depend on factors like the age distribution and medical pre-conditions of the infected individuals, which can influence the delay between the infection and the time-stamp in the data. The master multiwave equation for the time-evolution of the total number of infected cases I j (t) in a region j reads: where the first term on the right-hand side is a generalisation for w + 1 consecutive waves of the CeRG equations 8 and the second term contains the interactions between regions 7 . Here, we will always consider the number of cases per million inhabitants in order to compare different regions. In the master equation, most of the parameters are explicitly independent on the normalisation, as I j (t) always appears divided by the total number of cases at the end of the first wave, A j : the only exception is the interaction term, which also depends on the population of the regions (n m j measures the population of region-j in millions). The parameters γ j measure the effective velocity of the virus in each regions, and can be associated to an effective infection rate. This parameter can be eliminated from the equation by measuring the time in terms of a region-dependent scale, τ j = γ j t, once the couplings k jl are also rescaled: γ j can therefore be interpreted as a local time-dilation, characteristic of each region and taking into account all the non-pharmaceutical measures and local conditions in each region. These parameters can be extracted from the data at the beginning of the epidemic diffusion in each region, independently on the normalisation of the number of cases, which is very sensitive to the testing strategies 21 changing during the pandemic. More details on the equations, and on the meaning of other parameters can be found in the supplementary material. The master equation (1) encodes the multi-wave pattern in two ways: in the first term, the parameters δ j,ρ destabilise the fixed points at I j (τ * ρ ) = A j /ζ ρ ; in the second term, the interactions with other regions, or with an external source, can also destabilise the system and drive a new growth of I j towards the next fixed point. In fact, for δ j,ρ = 0, the number of infected will grow until I j (τ * ρ ) = A j /ζ ρ , where the growth stops because of the vanishing of the beta function. This is a steady-state, independent of time, which signals the end of the infection. For δ j,ρ < 0, the zero is moved on the complex plane and cannot be reached, thus driving an endemic state with linear growth, which we call strolling in honour to the application of this formalism in high energy physics 8 . The second term has been used 13 to predict the European second wave of September 2020, where k jl was associated to an estimated number of travellers between each country. In general, both effects are expected to be present: as we will see, the instability due to δ provides a maximal delay for the arrival of the next wave, which is directly related to the number of new cases recorded during the strolling period between waves. The presence of a large interaction can induce an early arrival of the new wave. Thus, effective measures to prevent and control the next wave of a pandemic like COVID-19 can only go via a strict control of the number of cases inside the country or region, combined to effective tracking of new infected individuals traveling in. This is precisely the strategy followed by China, Vietnam and New Zealand, leading to an early extinction of the disease and the absence of a second wave. The CeRG multiwave model, corresponding to (1) without the interaction terms, can be used to effectively describe a pandemic episode in multiple subsequent waves. To illustrate the model, in the top panels of Fig. 1 we show some features of the solutions for an isolated region with only two waves, corresponding to w = 1 and δ 1 = 0. The latter condition ensures that the epidemic is extinguished after the second wave. All results are shown for number of cases normalised to the first wave, and expressed in the local time τ = γt. Panel a) shows three solutions (solid lines) together with the corresponding new cases (dashed lines), for three values of p 0 . We can clearly see the two-wave structure emerging in the solution, the fact that the second peak tends to be flatter than the first (for ζ 1 = 1/2). Furthermore, larger values of the exponent p 0 tend to delay the second peak and flatten it. An important factor in controlling the arrival of a future wave is the number of new cases during the intermediate strolling Table 1 ). In panel d) we show the value of the geographical uniformity indicator as defined in the text for a sample of countries, showing that the virus is more equally spread in the various regions during the second wave, in most cases. phase, which we encode in the parameter normalised to the total number of infected cases after the first wave, A, and expressed in terms of the local time. This is a crucial parameter in controlling the timing of the second wave, as illustrated in panel b) of Fig. 1 where we show the peak delay ∆τ peak as a function of S t . For S t → 1, the delay goes to zero as the two waves merge into a single one, while it grows for smaller values following a power law that depends on p 0 . Remarkably, for small enough S t , the peak delay saturates to a finite value, indicating that the second wave cannot be pushed away as it seems inevitable as long as a strolling period is present. The peak delays also depend strongly on ζ 1 , which encodes the height of the second peak, being more delayed for decreasing ζ 1 : for values above 0.8, however, the second wave becomes too small, thus the solution looses physical relevance. Comparing the predictions of the CeRG multiwave model to data is not an easy task: in fact, the number of detected infections, collected via the positivity of the tests done in each country, depends crucially on the number of tests done each day 21 and on the specific testing policy adopted over time. For instance, during the first wave, many countries did fewer tests while focusing on hospitalised cases, while a more extensive testing campaign occurred starting in the summer months. As a consequence, quantities that depend on the number of cases, like the delay between peaks, cannot be computed accurately as a bias between two waves is present in the data. We, therefore, will apply the model to the second wave and following ones. As an example, in panel c) of Fig. 1 we show the data for Japan compared to a scenario based on the CeRG multiwave model. Japan is an ideal candidate for this model since, being an island, the frontiers can be well-controlled, and one can consider the country as an isolated system. Furthermore, the second wave has already ended, followed by a two-month period of strolling with around 450 new infected cases detected each day. The CeRG model, shown in blue, provides a good quantitative and (6) 2020-08-26 0.7 2020-11-25 Table 1 . Numerical results for the eRG fit and the CeRG forecast for the next wave. Columns 2 to 6 contain the results for the fit of the first two waves using the eRG model 6 , where A are indicated in number of cases per million inhabitants, γ eRG is given in inverse days, and the new parameter I strol indicates the number of daily new cases per million inhabitants obtained by fitting the strolling period with a linear growth. The values highlight that the second wave typically features a smaller infection rate and a larger number of infected cases. In the last three columns we show the expected second and third peak dates (within a one week error) and the expected value of I strol . Except for Australia, South Africa, Bolivia, Saudi Arabia, Japan and South Korea, we fix S t = 0.01, p 0 = 0.55, p 1 = 0.6 and ζ 1 = 1/2, while γ and A 0 are chosen to fit the second wave (first for South Africa, Bolivia, Saudi Arabia). For countries currently in the strolling period, all the parameters of the CeRG model are fitted. qualitative description of the data, and predicts that the third wave will peak at the beginning of December and be slightly higher than the second (ζ 1 = 0.4). Between the first and second waves, instead, no significant strolling was observed. This scenario could be interpreted in the following way: after the first wave, the virus diffusion was strongly limited by the enacted measures. However, new infected cases may have entered the country from abroad and/or spontaneous emergence of local hotspots inside the country (parametrised by the k-interactions). After the second wave, the virus kept spreading geographically within the population triggering at a later stage the third wave. The latter phase can be described by the CeRG model, while the transition between the first and second wave is due to external interactions. We do not attempt to unify the three waves because of the bias in the counting. Testing the geographical diffusion of the virus in each country can provide useful indications on the mechanism behind each wave. For this purpose, we define a uniformity indicator, χ 2 , which encodes how far is the distribution of new cases in regions of the country from a completely flat one. Smaller values of χ 2 indicate a more uniform distribution. For Japan, we considered the new cases in the various prefectures (where we exclude Okinawa for the geographical distance from the mainland) during the first and second waves, as shown in red in Fig.1 d) . The second wave has a larger χ 2 , which could be interpreted as more localised diffusion due to hotspots or travellers returning to their home cities. We also report the indicator for the month of November, which we would expect to be reduced if the strolling plays an important role in creating the third wave. The result is too preliminary, as Japan is still far from the peak of the third wave and the indicator is found to be minimised at the peak. Having validated the CeRG approach, we can now apply it to understand and predict the next wave in various regions of the World. As a caveat, we should recall that the interactions between regions and the presence of hotspots can also affect the results and anticipate the insurgence of the next wave. Europe: Most of the European countries are being hit by a second wave of the COVID-19. A general trend we observe is that the infection rate during the second wave is smaller than the one of the first wave, as shown by the values of γ eRG in Table 1 . The comparison is done by fitting the two waves independently by use of the eRG model. Moreover, the expected cumulative number of infected cases during the second wave is much larger than that for the first wave, even taking into account the higher number of tests performed during the second wave. The emergence of the second wave was explained as arising from the interactions between countries 13 , nevertheless the presence of strolling between the two waves, shown in terms of the number of new cases per million inhabitants I strol , indicates that both effects participated. The decrease of the geographical uniformity indicator, shown in Fig. 1 d) , indicates that the strolling had an important role in diffusing the virus across the countries (for England, the uniformity was present since the first wave). The geographical diffusion may also be the reason behind the fact that the second wave has infected a larger portion of the population. The CeRG model cannot be applied straightforwardly to the first two waves, due to the uncertainty in their relative normalisation, while it can be used to estimate when a third wave will hit. The result is shown in the last three columns of Table 1 , and in Fig. 2 Table 1 shows that a wide range of peak timings are expected, ranging from March to November 2021, where we associated an error of 1 week to the projection due to a variation of 10% in the infection rate. These results show clearly that controlling the infection rates and reducing the level of strolling after the end of the second wave are keys to delaying the next wave. Another element that should be included is the number of travellers across various countries 13 , which can help propagate the wave from country to country, thus affecting the ones with pronounced delayed projections. The US: The US has already seen two waves in April and July-August, and is undergoing a third. However, the first two waves are geographically distinct, with the episode in April mainly involving New York and New England, and the second spreading all over the remaining states. This is well illustrated by the geographical uniformity indicator in Fig. 1 d) , which sharply drops between the two episodes. The third point, based on the data of November, is still preliminary and will decrease as the third wave approaches its peak. To analyse the evolution of the COVID-19 epidemic in the US, a dedicated study which takes into account sub-regions is required. The uniformity analysis suggests that the first two episodes should be described in terms of interactions between states, while the third one may be originated by the strolling. Results of this analysis and In the bottom panels we show two sample countries from other regions of the World. In both cases, currently the epidemic is in the strolling regime after the first wave, indicating an imminent restart of the epidemic. In cases with an ongoing strolling, S t is fitted to reproduce the data. projections for the future waves in the US will be presented in a separated article 22 . Other countries: We included in our analysis a selection of countries for other regions of the World, selected in order to represent all continents. Note that we retained only countries for which the multi-wave analysis is best explained in terms of the CeRG model, i.e. where diffusion of the virus in sub-regions do not produce features that would require a multi-region analysis. The latter situation can be tackled within coupled CeRG equations, but this analysis goes beyond the scope of the present work. In Table 1 we show the results for the selected countries, also illustrated in the bottom row of Fig. 3 for South Africa and Saudi Arabia. In most selected cases, the country is in the strolling regime, following the end of the first or second wave, thus allowing us to tune the CeRG parameters to reproduce the strolling and give a more reliable forecast for the following wave. In some cases, like South Africa, the high level of strolling indicates that a new wave is imminent. As we are not trying to perform a fit, due to the many uncertainties in the social distancing and testing policies, the scenario we present should be considered as a probable one. Yet, it should be noted that the model does not leave much room to modify the expected number of total infected cases during the future wave nor change the timing substantially, without drastic pharmaceutical or non-pharmaceutical interventions We provide a mathematical understanding of the wave pattern for pandemics, like the COVID-19 one. The approach is employed to forecast the timing of a future wave based on the number of new infections during the intermediate endemic phase. The timing of the new wave is related to a newly introduced parameter, S t , that can be easily deduced from the cumulative number of infected cases. We studied several countries in different regions of the World and, in absence of any pharmaceutical interventions, we estimate the timing of the next wave of infections. We found countries where a new wave will start in December 2020, like South Africa, and countries where it could start as late as October 2021, like in France. Our predictions will be affected by the border control regulations with the generic effect of inducing an early increase in the number of infections. Our understanding of the wave structure of the COVID-19 pandemic draws the attention to the inter-wave strolling period. We discover that controlling the number of new infections during the strolling period is necessary to delay the beginning of a future wave. This amounts to imposing social distancing measures and break potential chains of infections after the end of the wave in order to keep S t as low as possible. Delaying the next wave is crucial in order to have enough time to realise an effective vaccine campaign. Our results can effectively guide policymakers to time (non)pharmaceutical interventions to delay or reduce the impact of future COVID-19 waves. Until now, most measures are taken when the number of new infected cases is already grown substantially. At this point in time one can only contain the wave not avoid it with serious impact for the loss of human lives as well as the economy. We prove that intervening during the strolling period of endemic diffusion is essential to delay or avoid a new wave while buying time for pharmaceutical interventions, like an effective vaccine campaign. More specifically, to maximise the delay, the strolling parameter must be kept small, S t ≈ 10 −5 for an optimal use of the enacted measures. In most countries, this implies that the number of new cases at the end of the wave should be kept at the level of a few units per day. This effect can be achieved by keeping or introducing new measures after the end of the wave, in the form that is more appropriate for the local conditions. In the original eRG approach 6 , rather than the number of cases, it was used its natural logarithm α(t) = ln I(t). For a single wave pandemic this provides a good fit to the data. To describe multiple waves 8 , it is better to use the eRG directly for the cumulative number of total cases rather than its log. The derivative of I(t) with respect to time is interpreted as the beta function of an underlying microscopic model. In statistical and high energy physics, the latter governs the time (inverse energy) dependence of the interaction strength among fundamental particles. Here it regulates infectious interactions. Thus, the dictionary between the eRG equation for the epidemic strength I(t) in an isolated region of the world 6 and the high-energy physics analog is whose solution, for 2p = 1, is a familiar logistic-like function This solution can effectively describe one wave of the epidemic, and is related to the SIR model via time-dependent parameters 23 . The dynamics encoded in Eq. (3) is that of a system that flows from an Ultra-Violet fixed point at t = −∞, where I = 0, to an Infra-Red one where I = A, with A being the total number of infected cases at the end of the wave. The parameter γ is an effective infection rate, characterising the diffusion slope. Both the equation and the solution show that γ can be absorbed in a redefinition of time, τ = γt, so that γ can also be interpreted as a time-dilation factor, which differs by region and depends on external factors (social, demographic and relative to non-pharmaceutical interventions) which can slow down or accelerate the diffusion of the virus. Finally, b shifts the solution in time to match the beginning of the epidemic in each region, as shown by the equivalence b → 1 with γt → γt − ln b. Further details, including what parameter influences the flattening of the curve and location of the inflection point and its properties can be found in 24 and in 6 . The presence of a truly interacting fixed point at large times predicts that the total number of cases approaches a constant value, while the new cases drop to zero. This feature can be associated to the system approaching a time scale-invariant state. Most regions affected by COVID-19, however, show a slightly different dynamics: in fact, the infection stabilises to an endemic period of constant growth, where the total cases continue growing linearly. This region can be associated to a breaking of the time scale-invariance 8 , which does not allow the system to approach the fixed point and generates the strolling regime of linear growth. To account for this feature, the beta function in (3) can be extended as follows: where we have used normalised variable ι = I/A and τ = γt. The two new parameters δ and p are real numbers, with p positive. For negative δ , the second factor has complex zeros, so that the system cannot approach a UV fixed point but, for small |δ |, it lingers near the would-be fixed point ι ≈ 1 for a long time, inversely proportional to |δ |. This feature can explain the origin of the strolling 8 . In this work, we will use the new Complex eRG (CeRG) approach to describe multi-wave dynamics. The solution of Eq. (5) still encodes a single wave, followed by a strolling period. The latter signals an instability of the system, and precedes a new exponential increase of new cases. The mathematical model can be extended to describe multiple waves by endowing the CeRG beta function with new (quasi) zeros as follows: with ζ ρ < 1, 0 < −δ ρ 1 and p ρ > 0. We have that w + 1 is the total number of pandemic waves, and that the diffusion will stop if δ w = 0, with a total number of infected cases equals I(∞) = A/ζ w . For a case with two waves, w = 1, the value of ι at the two peaks (where the new infected cases reach a local maximum) can be determined by the zeros of the derivative of the beta function, ∂ β multiwaves /dι = 0. Setting δ i = 0 for simplicity (and because their value is very small and numerically irrelevant), we find From the definition of the beta-function, the time between the two peaks can be computed as follows: Note that the dependence on δ 1 is negligible, because the domain of the integral is always far from the second set of complex zeros. The value of δ is directly related to the number of new cases during the strolling, given by Interactions among different regions of the world can be taken into account by adding to the beta function for each country j the following interaction term 7 : where n m j is the population of region-j in millions. The matrix k jl is proportional to the numbers of travellers between each of the regions considered in the interaction. We first fit the first and second wave (if reached) by use of independent solutions to the eRG framework. This exercise will allow us to compare the two waves, in particular the infection rates γ eRG , which are independent on the normalisation of the cumulative number of infected. We use open-source data from the online repository Ourworldindata.org. We first identify the first wave, within the dates indicated in Table 2 , and fit the data by use of a simple logistic function: We then fit the data after the end date of the first wave (having subtracted the final number of cases) with a second independent logistic function plus a linearly growing term, which characterises the strolling period. If the second wave is absent, we simply fit the linear strolling. It is well known 6 that the fit is optimal only after the peak is achieved, so we tune the fit to reproduce the expected peak in some cases. In two cases we take into account spurious features in the data: for France, a sudden drop in the number of new cases can be observed after November 8, which is accountable by a decrease in the number of tests plus a reduced positivity rate; for Spain, the initial growth is due to a hotspot in Catalunya, followed by the spread to the rest of the country. Conservatively, we only fit the second higher sub-peak. As a second step, we tune the CeRG model, without interactions, to reproduce the last wave in order to forecast the arrival of the next one. The beginning of the wave (after subtracting any initial data) allows to determine γ, while A 0 and p 0 are determined by the end of the wave and the peak region. For countries where the second wave is still ongoing, we fix p 0 = 0.55, while its value is fitted in other cases. Furthermore, we fix the strolling parameter S t = 0.01, or fit it to the data for countries that are currently in the strolling regime. The parameters obtained by this tuning are reported in Table 2 , and have been used to obtain the peak dates and in the plots. To investigate potential differences between waves due to the geographical spread of the virus, we define an uniformity indicator for each country by studying the number of new cases in various regions over a period of time around the peak. The indicator is inspired to a χ 2 -variable, measuring how far is the actual distribution of cases compared to a perfectly uniform distribution, and it is defined as follows : where I ri (∆t) is the number of new infected cases in the region ri over a time interval ∆t, and I r (∆t) indicates the mean over the whole set of regions. We normalise the parameter to the number of regions in order to obtain comparable values. The smaller is χ 2 , the more uniform is the diffusion of the virus. We also noted that the value of the uniformity indicator reaches local minima at the peak of each wave, thus the values reported for starting waves are larger than the final one. Table 2 . Parameters for the CeRG forecast for the third waves. To compute the values, we only used open-source data available on various online repositories. For Japan, we obtained the local data for the 46 prefectures from Toyokeizai.net. In the analysis we omitted Okinawa because of its distance from the mainland. For Germany (16 Landen) , Italy (20 regions), Spain (17 regions) and England (9 regions), we obtained data collected in periods of about 4 weeks from Citypopulation.de. For France, as the local data on the new number of infected is not available before May 13, 2020, we used information for the number of hospitalisations in the 94 continental departments from Data.gouv.fr. Finally, the data for each state of the US is obtained from Ourworldindata.org. For Japan and the US, the result for the third wave overestimates the one at the peak, which is not reached yet. The prediction corresponds to S t = 0.01 for Canada, while the strolling parameter is fitted to the data for Bolivia, South Korea and Australia, and is compared to the data (adjourned to November 23) and the eRG fit from influenza: The mother of all pandemics A contribution to the mathematical theory of epidemics Coupled disease-behavior dynamics on complex networks: A review Statistical physics of human cooperation Coupling dynamics of epidemic spreading and information diffusion on complex networks Renormalization Group Approach to Pandemics: The COVID-19 Case Interplay of social distancing and border restrictions for pandemics (COVID-19) via the epidemic Renormalisation Group framework Evidence for complex fixed points in pandemic data Forecasting covid-19 Early spread of covid-19 in romania: imported cases from italy and human-tohuman transmission networks Preliminary prediction of the basic reproduction number of the Wuhan novel coronavirus 2019-nCoV How the pandemic might play out in 2021 and beyond Second wave covid-19 pandemics in europe: A temporal playbook Effect of non-pharmaceutical interventions for containing the covid-19 outbreak in china Estimating the effects of non-pharmaceutical interventions on covid-19 in europe The effect of travel restrictions on the spread of the 2019 novel coronavirus (covid-19) outbreak The effect of control strategies to reduce social mixing on outcomes of the covid-19 epidemic in wuhan, china: a modelling study Time, space and social interactions: exit mechanisms for the covid-19 epidemics Acknowledgements G.C. and C.C. acknowledge partial support from the Labex-LIO (Lyon Institute of Origins) under grant ANR-10-LABX-66 (Agence Nationale pour la Recherche), and FRAMA (FR3127, Fédération de Recherche "André Marie Ampère"). This work has been designed and performed conjointly and equally by the authors. G.C., C.C. and F.S. have equally contributed to the writing of the article. The authors declare no competing interests. All data used in this work are obtained from open-source repositories: Ourworldindata.org, Worldometer.info, Citypopulation.de, Data.gouv.fr, Toyokeizai.net. A γ δ 1 p 0 p 1 ζ 1 1st wave start 1st wave end France In Figs 4 and 5 we provide a visualisation of the CeRG forecast for the next wave for the countries not shown in the main text.