key: cord-0576546-n4225rcd authors: Martelloni, Gabriele; Martelloni, Gianluca title: Analysis of the evolution of the Sars-Cov-2 in Italy, the role of the asymptomatics and the success of Logistic model date: 2020-04-05 journal: nan DOI: nan sha: 7a7754b76163543eddb0d2471eeac9759b73a418 doc_id: 576546 cord_uid: n4225rcd In this letter we study the temporal evolution of the Sars-Cov-2 in Italy. The time window of the real data is between February 24 and March 25. After we upgrade the data until April 1.We perform the analysis with 4 different model and we think that the best candidate to describe correctly the italian situation is a generalized Logistic equation. We use two coupled differential equations that describe the evolution of the severe infected and the deaths. We have done this choice, because in Italy the pharyngeal swabs are made only to severe infected and so we have no information about asymptomatic people. An important observation is that the virus spreads between Regions with some delay; so we suggest that a different analysis region by region would be more sensible than that on the whole Italy. In particular the region Lombardia has a behaviour very fast with respect to the other ones. We show the behaviour of the total deaths and the total severe infected for Italy and five regions: Lombardia, Emilia Romagna, Veneto, Piemonte, Toscana. Finally we do an analysis of the peak and an estimation of how many lifes have been saved with the LockDown. In this letter we study the temporal evolution of the Sars-Cov-2 in Italy. The time window of the real data is between February 24 and March 25. After we upgrade the data until April 1.We perform the analysis with 4 different model and we think that the best candidate to describe correctly the italian situation is a generalized Logistic equation. We use two coupled differential equations that describe the evolution of the severe infected and the deaths. We have done this choice, because in Italy the pharyngeal swabs are made only to severe infected and so we have no information about asymptomatic people. An important observation is that the virus spreads between Regions with some delay; so we suggest that a different analysis region by region would be more sensible than that on the whole Italy. In particular the region Lombardia has a behaviour very fast with respect to the other ones. We show the behaviour of the total deaths and the total severe infected for Italy and five regions: Lombardia, Emilia Romagna, Veneto, Piemonte, Toscana. Finally we do an analysis of the peak and an estimation of how many lifes have been saved with the LockDown. In early December, or reasonably during November, the Sars-Cov-2, or commonly coronavirus, appear in Wuhan, China. Coronaviruses are a large family of viruses known to cause diseases ranging from the common cold to more serious diseases such as Middle Eastern Respiratory Syndrome (MERS) and Severe Acute Respiratory Syndrome (SARS). Coronaviruses were identified in the mid-1960s and are known to infect humans and certain animals (including birds and mammals). The primary target cells are the epithelial cells of the respiratory and gastrointestinal tract and seven Coronaviruses are identified to be able to infect humans. They can cause common colds but also serious lower respiratory tract infections. The virus causing the current coronavirus epidemic has been called "severe acute respiratory syndrome coronavirus 2" (SARS-CoV-2). This was announced by the International Committee on Taxonomy of Viruses (ICTV) which deals with the designation and naming of viruses (i.e. species, genus, family, etc.). The name is indicated by a group of experts specifically appointed to study the new coronavirus strain. According to this pool of scientists, the new coronavirus is the brother of what caused Sars (SARS-CoVs), hence the chosen name of SARS-CoV-2. The disease caused by the new Coronavirus has a name: "COVID-19" (where "CO" stands for corona, "VI" for virus, "D" for disease and "19" indicates the year in which it occurred). The Oms Director-General Tedros Adhanom Ghebreyesus announced it on February 11, 2020, during the extraordinary forum dedicated to the virus. The appearance of new pathogenic viruses for humans, previously circulating only in the animal world, is a widely known phenomenon (called spill over) and it is thought that it may also be at the basis of the origin of the new coronavirus (SARS-CoV-2). The scientific community is currently trying to identify the source of the infection. On December 31, 2019, the Municipal Health Commission of Wuhan (China) reported to the Oms a cluster of cases of pneumonia of unknown etiology in the city of Wuhan, in the Chinese province of Hubei. On January 9, 2020, the Chinese Center for Disease Prevention and Control (CDC) reported that a new coronavirus (initially called 2019-nCoV and now called SARS-CoV-2) has been identified as the causative agent and has been rendered publishes the genomic sequence. On January 30, 2020, the World Health Organization declared that this epidemic represents an international public health emergency. Oms on March 11, 2020 declared that COVID-19 can be defined as a pandemic. After notification of the epidemic by China, Italy immediately recommended postponing unnecessary flights to Wuhan and, subsequently, with the spread of the epidemic, to all of China. Consequently, the latter has canceled all flights from Wuhan. On 30 January, the Minister of Health ordered the suspension of air traffic with the People's Republic of China, including the Special Administrative Regions of Hong Kong and Macao, on 30 January. The measure also applies to Taiwan. In consideration of the Oms declaration of "International public health emergency", on January 31, 2020 the Council of Ministers declared a state of emergency as a consequence of the health risk associated with Coronavirus infection. This disease does not save Italy that has become a protected area with the DPCM signed on the evening of 9 March by the Prime Minister, Giuseppe Conte, who has extended the restrictive measures already applied for Lombardy and the 14 northern provinces most affected by the coronavirus infection to the whole national territory. The new action comes into force on March 10 and will take effect until April 3. Among the main innovations: it limits the movement of people, blocks sporting events, suspends teaching activities in schools and universities throughout the country until April 3. With the new ordinance of 22 March 2020 issued by the Minister of Health and the Minister of the Interior, from 22 March people are prohibited from moving or moving with public or private trasportation in a municipality other than that in which they are located, except for proven work needs, of absolute urgency or for health reasons. [1] In this scenario let's try to analyze the time evolution of the Sars-Cov-2 in Italy with growth patterns. Many growth models have been very recently applied to study the Covid-19 infection [2] [3] [4] [5] [6] [7] [8] [9] . Two very famous examples of these models are described by the Logistic [10] and the Gompertz [11] equations. The Logistic law has been applied in population dynamics, in economics, in material science and in many other sectors, while the Gompertz one describes tumor growth, kinetics of enzymatic reactions, oxygenation of hemoglobin, intensity of photosynthesis as a function of CO2 concentration, drug dose-response curve, dynamics of growth, (e.g., bacteria, normal eukaryotic organisms). The Logistic behaviour assumes that growth stops when maximum sustainable population density is reached through the carrying capacity K that depends on the environmental conditions. For example the ordinances of the Prime Minister G.Conte, the people's hygiene habits are encoded in the carrying capacity K. Whereas the Gompertz equation takes into account the aging of the population through a lower reproductive capacity over time; not knowing yet the characteristics of the virus we thought it could be a correct choice. Now we describes the regional and national choices to contain the epidemic and and consequently our strategy. At first in Italy, pharyngeal swabs were initially made only at seriously ill people. This choice, extremely correct in safeguarding the hospital structure (at least apparently), is wrong for a study of the disease data. We explain better: if we believe that the Case Fatality Ratio (CFR) is 1% [12] the number of deaths is the only data extremely relevant. From the extrapolation of total number of infected [15] we observe that the number of infected detected is only 10% of the total. This means that in Italy about 90% of the sick are not detected. Having no information, therefore, on asymptomatic and infected with mild symptoms, we decided to study the population of the severe infected with 2 equations: one for the severe infected and another coupled for the deaths, obviously the latter are related to the former. So we implement a Logistic model that take into account a possibility to consider the contribute of asymptomatic people in the evolution of the Sars-Cov-2 by means of additional constant and an equation of proportionality between infected and deaths. We consider also an interpolating between Logistic and Gompertz Model. We show that the first scenario is more optimistic and faster respect to the second one. Finally we observe the data of each region separately, noting that is present a delay in the evolution of the virus between the various regions and that each region has its own trend. In particular, Lombardia has the fastest trend and Veneto the slowest one. We conclude with the observation that Veneto's strategy of performing swabs even to asymptomatic potential seems to have successful, as can be seen from the estimate of the constant term of the logistic model. A possible scenario of the evolution of the evolution of Sars-Cov-2 virus in Italy is described by the logistic differential equation. The model is represented by the following equation where I(t) is the number of infected and the time t is the integration variable. The constant r 0 is the growth rate of this particular population (the Sars-Cov-2) and K is the carrying capacity. The growth rate r 0 is typical of the population because it is linked to the reproductive mechanism of the virus while the carrying capacity K depends on the environmental conditions. For example the government action of the Prime Minister G.Conte (Locked down(LD)), the people's hygiene habits are encoded in the carrying capacity K. However we change this equation adding a constant term A that represents the contribution of asymptomatic people. We explain our idea for the Italian situation. Studying [12] we assume that the CFR is 1%. Consider, for example, the situation of the March 19 in Italy • 3405 deaths, • 41035 infected (severe cases, which is presented the necessity to verify the positivity to avoid the collapse of the healthcare facility), • 340500 estimated data of deaths + severe, symptomatic and asymtomatic cases. This means that we have a bath of 300000 infected that can produce severe cases. We consider for simplicity that this growth rate is constant. The exact solution for this equation is well-known and the limit for t− > ∞ represents the maximum possible number for this population. After the March 23 we think it will be necessary recalibrate A with the data of the Civil Protection. Why? Because 14 days after the LD the incubation period is over and we suppose that this action destroys possible places for the propagation of the virus. Finally we have where A LD < A has the meaning that the LD restricts the possibility of contagion through asymptomatic in individual houses or just on the job places. After 11 days the possibility to have symptoms are 2, 5% [13] then also March 20 is a good candidate to impose with the data A LD < A. We can also imagine the possibility that the LD would be so good that A LD < 0. We remark that the exact date of the decrease in the infected is so difficult because the results of the test is not always referred to the same day but we can have a delay for the incredible number of pharyngeal swabs. In conclusion we think that a day between March 20 and March 23 is a good candidate to the drop in infections. [16] Now we consider the second population of this model, the deaths. We couple the infected I with the deaths D in the following equation: After a simple integration we obtain where t 1 is the first integration extreme, reasonably the February 24. So we fix to have the same growth for the severe cases and the deaths; t d represents the interval between the appearance of symptoms and the deaths. From [14] we estimate t d = 13 days, but we observe from numerical aspects that t d = 4. A brief comment: this reflects the italian choice to do swabs only to severe cases, and not at the very beginning of the disease. The second scenario of the evolution of Sars-Cov-2 virus in Italy is described by the Gompertz differential equation. The model is represented by the following equation where I(t) is the number of infected and the time t is the integration variable. The constant r 0 is the growth rate of the virus Sars-Cov-2 and K is the carrying capacity. This model takes into account the aging of the population through a lower reproductive capacity over time, in other words we consider the virus less contagious over time. The solution of the previous equation give us the Gompertz law As in the logistic study we compute the limit or t− > ∞ and we obtain this represents the asymptotic term of the population, defined by the resources available in the environment, in some sense also in this model we can encoded the human external action. We choose to implement also in this model a constant term that we think that describes the role of the asymptomatic people. So we study numerically this generalized Gompertz equation From the study of the data, as we will see later in this letter, it's clear that the logistic scenario is very optimistic respect to the natural disaster described by the Gompertz model. Our idea is to create a model with a new parameter that enables to interpolate between the two regimes. So we take into account the following differential equatioṅ where r 0 , K l are respectively the growth rate and the carrying capacity of the Logistic model, and r 0 , K g for the Gompertz one. Obviously the time t is always the integration variable. The parameter q has range between 0 and 1, and obviously for q=0 we obtain the Gompertz behaviour and for q=1 the Logistic one. From the numerical simulations, performing 200 stochastic runs of the Gillespie direct method algorithm applied to the logistic equation, we obtain the following parameters for the Italy situation described by fig.1 : and K 1 = 0.14 and t d = 4. The errors are estimated at 5 σ, or about 5%. With the interpolating model at 21 March in Fig. 2 we observe how the situation could become particularly dramatic, especially as regards the duration of the epidemic (the curve becomes flat in late May). However, it becomes comforting with the data of March 25; the two graphs overlap and from the numerical simulations we find q = 1. This means that environmental factors, or LD, prevail over virus growth factors. 15 days after the LD we therefore started to see concrete effects, and this is compatible with the incubation times of the virus. It therefore seems obvious to study the evolution of the virus with a Logistic model when a Government applies LD. In conclusion, we expect around 111,000 diagnosed patients and around 16,000 deaths. This, considering a CFT of 1.2%, would lead to a total of about 1334,000 infected. In Fig. 3 we see that the peak is near 21-22 March and the end of the epidemic at the early May. In this section we update the Italy situation according to the calibration performed using the real data of Civil Protection at 27/03/2020. In Fig. 4 we report the previsions of infected and deaths: with the assimilation of two new data for the calibration of the model, the total number of infected goes to about 124800 and the deaths to about 17800. Observing Fig. 5 , we can note how the curve of daily total infected changes from day to day according to the acquisition of a new data, but the right tails of the considered curves are similar. This suggests that the end of contagion is confirmed, according to our study, at the end of April or early May. Finally, in Fig. 6 we show the correlation, according to the delay, between the temporal series of total infected and deaths. The plot represents the real data of deaths and the real ones of the total infected shifted of 4 days and rescaled according to the minimum difference between them, i.e., we search the best correlation between the deaths data and the infected shifted forward according to Eq.7 (the deaths are delayed compared to infected). By means of this analysis we found the correct delay and we calibrate the parameter K 1 of Eq.7 that represents the rate between the derivatives of infected and deaths (the value of K 1 , update at 27/03/2020, is 0.1429). 6 : Analysis of delay between infected and dead: for Italy delay is 4 days; the plot represent the real data of dead (blue circles) and the real data of total infected (red diamonds) shifted of 4 days and rescaled according to the minimum difference. We consider the following generalized Logistic equation where α is an experimentally parameter to fit better the initial power law climb. We show the trend of the severe infected in Fig.7, Fig.8 and in Fig.9 respectively with the α-model, the Logistic calibrated with data at 29 March and with the α-model calibrated at 1 April. We observe as fitting better the initial power law the estimated number of severe infected and deaths increases of 2%, so we choose to use the α-model to estimated the total number of severe infected and deaths. We obtain I(end) = 138500, D(end) = 19900, A brief comment, as we expect the growth coefficient r 0 is falling and the carrying capacity is obviously increased simply because the number of infected has grown. It is interesting that the contribution A of asymptomatics is also increased. We can explain this by thinking that asymptomatics remain the most important sources of contagion in the single houses of italian people during the LD. We observe as in a week after the peak the estimated number of severe infected and deaths are increased of 20%, but in the last 4 days the trend is in accordance with the data, both numerically and in forecasting the end of the epidemic, in particular the end date is stable from March 27. For this stability we decide to estimate the errors at 2,5 σ, i.e. 2, 5%. A possible marker of the presence of the peak could be the following quantity: where I(t i ) is the total number of infected at the day t i and S(t i ) is the total number of swabs at the same day. As we see in Fig. 10 P marker has a peak at 24 March, which is basically in agreement with our prediction. The graph of Fig. 10 has been reproduced with a moving average. To complete the peak picture we show the diagrams of the severe infected and the deaths for single day (Fig.11 e Fig.12) , that are in agreement with the delay time t d of 4-5 days stimated numerically. Considering the uncertainty in performing swabs, the possible delays in processing swabs and the end of the incubation time (10-14 days) the picture of the peak in the temporal window 21-24 March seems to be reasonable. As we see in Fig. 13 the two scenarios are very different. We compare the generalized Logistic curve at 1 April ( Fig. 8) with the Gompertz one calibrated with the data at 21 March ( i.e. with the data before the effect of the • 831,000 severe infected, • 119,000 deaths, • 11,900,000 total infected, • the end of epidemic at early September. This means that with LD we probably saved 100,000 human lifes! Let's start with the analysis of the single regions. If we show the data of the Civil Protection in Fig.14 it's evident as any region has a velocity of propagation of the virus different from each other. In particular, Lombardia is very fast and Veneto is much slow at the beginning of the spread of the disease (we observed how in each region the number of days in which the number of infections rises by an order of magnitude ranges from 10 days in Lombardia to 16 in Veneto.). Looking at the data and from the previous simple observation, it is spontaneous to think that an analysis of the individual regions is more sensible than a global analysis of Italy. This idea seems correct for two reasons: the virus arrives in the various regions at different times and each region has its different environmental and working characteristics, which mainly involves a different population density. After these considerations we therefore propose an analysis for the 6 regions mainly affected by the virus on 25 March. Unfortunately, the forecast of Lombardia shows us that 59% of future deaths are due to this region. The parameter A is the highest of all regions and this seems to confirm the interpretation that A is the contribution of asymptomatics. From the numerical simulations we obtain the following parameters and K 1 = 0.21 and t d = 5. (21) The region Emilia-Romagna will probably be the second most affected region in Italy. The geographical proximity to Lombardy and a large industrial activity could be the main causes. From the numerical simulations we obtain the and K 1 = 0.12 and t d = 2. (23) For Veneto we make the opposite consideration compared to Lombardia: the coefficient A is the lowest among the regions that performed the largest number of swabs. This data seems to be in agreement with the fact that Veneto has carried out a large number of swabs also with and K 1 = 0.06 and t d = 5. Yet another demonstration of our idea is contained in the K 1 value of the Veneto region, which is the lowest ever. and K 1 = 0.11 and t d = 7. A brief comment about the numbers of these regions: the contagion in these regions certainly started later than the other 3. Then it could be that in the initial stages of the contagion the coefficient r is greater (as it was in our previous simulations for Italy) and then slowly going down, as you can see from Fig. 14 Piemonte and Toscana are quite fast. We justify the largest r 0 parameter for Piemonte and Toscana compared to Lombardia. We will do a sensitivity analysis of the parameters later. These numbers are very significative, it is the signal that Lombardia is the carrier region of the spread of the virus in Italy. Finally as we see in Figs. 25-29 it is evident how the spread of the infection occurred at different times between the various regions, as the peaks are at different times. In particular, observing the tails of the curve, Toscana seems to be the region that will last come to zero infections, at 4-5 May. In summary, we studied the evolution of Sars-Cov-2 in Italy with Civil Protection data in the time window from 24 February to 25 March with the Logistic model. We upgrade the curves of severe infected and deaths until 1 April with a generalized Logistic model, that works very well at the early of the contagion. We provide around 138,5000 diagnosed patients and around 19,900 deaths on the whole Italian territory. This, considering a CFT of 1.2%, would lead to a total of about 1,658,000 infected at the end of coronavirus epidemic. We observe how the peak is more or less in correspondence of 21-24 March and the end towards the last days of April and early May as we see in Fig. 9 . We stress with the idea that a possible marker of the presence of the peak could be the quantity: This a quantity that has no fluctuations after passing its maximum value: for our opinion it is a great signal of stability. Another important result is the estimate of the delay time t d between the peak of the severe infected and the peak of the deaths which seems to be truly in agreement with the data of the Civil Protection. It is also interesting the comparison between the coefficient at 25 March and 1 April: • r 0 = 0.200, K = 110950 and A = 49, at 25 March, • r 0 = 0.173, K = 138400 and A = 79, at 1 April. As it is obvious, the coeffecient r 0 is falling during the LD and the the coeffecient K is rising. We stress that it is very interesting that the contribution A of asymptomatics is also increased. A possible interpretation is that asymptomatics remain the most important sources of contagion in the single houses of italian people during the LD. Finally we studied the evolution of the virus in the most affected regions. A delay is present in the appearance of the virus between the individual regions, so it seems more sensible to study them individually. We have noticed how Veneto's strategy in making swabs to as many people as possible has proved successful in containing the evolution of the virus. In the upgrading of the single regions with the data at 1 April, we observe explicitly the peak of the daily severe infected and we show the effective time delay of the comparison of the Sars-Cov-2 in every regions. Finally Tuscany will be the last region with zero infections, at 4-5 May. The last consideration about the importance of the LD: with a simulation with a Gompertz model, i.e. a model that predicts that the virus stops only because its infectivity propagation goes to zero, we estimate 119,000 deaths. With LD we have probably saved 100,000 human lifes! Obviously this project is on-going being the possibility that the data of the next few days may change the scenario, hopefully positive, maybe in a further 6-7 days following the government's even more restrictive measures on 22 March. * Electronic address: gabriele.martelloni@gmail.com † Electronic address: gianluca.martelloni@unifi Early Phylogenetic Estimate of the Effective Reproduction Number Of Sars-CoV-2 Emerging coronaviruses: Genome structure, replication, and pathogenesis Data analysis on Coronavirus spreading by macroscopic growth laws CoViD19: An Automatic,Semiparametric Estimation Method for the Population Infected in Italy Analysis and forecast of COVID-19 spreading in China A Poisson Autoregressive Model to Understand COVID-19 Contagion Dynamics, ssrnabstract-id=3551626 CDC COVID-19 Response Team, Severe Outcomes Among Patients with Coronavirus Disease 2019 (COVID-19) -United States How macroscopic laws describe complex dynamics: asymptomatic population and CoviD-19 spreading Notice sur la loi que la population poursuit dans son accroissement On the nature of the function expressive of the law of human mortality and a new mode of determining life contingencies Total number of infected is simply total number of deaths x 100 or if we believe that in Italy the CFR is 1, 2% (for the high number of ancient people) simply total number of deaths x We thank many colleagues for interesting discussions, in particular Andrea Marzolla, Domenico Seminara and Jacopo Viti, they were fundamental for correct observations and for providing us with sources from which to take inspiration. We also thank Pierluigi Blanc, S.O.C. Infectious Diseases 1 Santa Maria Annunziata Hospital, for stimulating discussions on technical subjects on which we had no knowledge.