key: cord-0732994-rgngy8fo authors: Macchiavelli, A. O.; Marti, G. V.; Marti, J. G. title: A Phenomenological Analysis of COVID-19 date: 2021-05-25 journal: nan DOI: 10.1101/2021.05.23.21249471 sha: a06929d5cb70d91771e96ca1bdcf50ac19f587f3 doc_id: 732994 cord_uid: rgngy8fo A phenomenological analysis of the time evolution of some COVID-19 data in terms of a Fermi-Dirac function is presented. In spite of its simplicity, the approach appears to describe the data well and allows to correlate the information in a universal plot in terms of non-dimensional or reduced variables Nr = N(t)/Nmax, and tr = t/{Delta}T, with N(t) being the total number of cases as a function of time, Nmax the number of total infected cases, and {Delta}T the diffuseness of the Fermi/Dirac function associated with the rate of infection. The analysis of the reported data for the first outbreak in some selected countries and the results are presented and discussed. The approach is also applicable to subsequent waves. Support of our framework is provided by the SIS limit of the SIR model, and simulations carried out with the SEICRD extension. The worldwide outbreak of COVID-19 has triggered a large number of studies. In fact, a search in medRxiv, bioRxiv [1], and arXiv[2] (across all fields) returns more than 8800, 2400 and 2900 articles respectively, that have been posted since the beginning of the year 2020 1 . It goes without saying that this impressive number of manuscripts clearly reflects the scientific urgency to understand the behavior of this novel virus and to develop and to assess several models that can shed light on its outbreak and make reliable predictions of its dynamical evolution. In this work, as a contribution on the topic of mathematical modeling, we would like to point 1 Numbers as of out that the available data [3, 4] for some selected countries could be described by a Fermi/Dirac (F/D) function, introduced in physics by Fermi and Dirac to describe the distribution of particles over energy states in systems consisting of many identical particles that obey the Pauli exclusion principle [5] [6] [7] . This type of function is also extensively used in nuclear physics to parametrize the nuclear potential (Wood-Saxon potential) in studies of structure and reactions [8] . Our phenomenological analysis of the first outbreak of the pandemic, presented in Sections II and III, provides a reasonable description of the data and the evolution of the pandemic without relying on any specific epidemiological model. An interesting finding is that the data follows a universal behavior based on dimensionless variables that allows, in a simple manner, to relate the response of the pandemic to measures adopted to counteract the spread of the virus. While we mainly focus our discussions on the first outbreak, the method can obviously be extended for the subsequent ones as briefly presented in Section IV. We make the conjecture that the data can be described by an analytical function of the F/D type: where N (t) is the cumulative number of infected cases, N max is the total number of cases, ∆T is the diffuseness of the F/D function and t 0 is a reference time when N (t 0 ) = N max 2 . The use of this type of function can be justified by considering the first derivative of Eq. 1: showing that the rate of infections at a given time is proportional to the product between the infected and non-infected cases. In Ref. [9] , the authors considered this type of function in an analysis of the early pandemic in Sweden. Furthermore, the so-called SIR (Susceptible, Infectious, Recovered) model [10] , widely used in epidemiology to study virus transmissions, consists of a system of three time-dependent variables: I(t), the number of total infected individuals at a given time, S(t) the number of individuals susceptible of contracting the infection, and R(t) the cumulative number of recovered individuals. In a closed system with a constant population size, N max , these variables satisfy the following equations: where λ is the transmission rate and γ the recovery rate. If the recovery rate is very low during the pandemic time interval (γ ≈ 0), we can approximate the infected cases by: This limit is known as the SIS (Susceptible -Infected -Susceptible) model with the solution above readily recognized in Eq. 2, with λN max = 1/∆T . In the SIR model the basic reproduction number, R 0 , is given at the outset of the epidemic by: λN max /γ = 1/(γ∆T ) (See the nice discussion in Ref. [11] ). If for a given population R 0 > 1 the infection will spread exponentially, while if R 0 < 1 the infection will progress slowly and eventually die out. The value of R 0 for COVID-19 is estimated to be between 2 and 4. An effective reproduction number, R, is often used to reflect the state of the epidemic at any given time. R = R 0 (1−P i ) where P i is the proportion of the population who is immune at that time [12]. As mentioned above, we focus our analysis on the first outbreak and start our discussion with data 2 from China 3 , shown in Fig. 1 . A fit of Eq. 1 reproduces well the empirical distributions for the cumulative and daily number cases given by Eq. 2. A similar procedure was applied to the data from a number of selected countries listed in Table I , including the city of New York in the USA, which will be further discussed in Section IV. The fit parameters are also given in Table I . . CC-BY-NC-ND 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. (which was not certified by peer review) The copyright holder for this preprint this version posted May 25, 2021. ; https://doi.org/10.1101/2021.05.23.21249471 doi: medRxiv preprint countries. Our interpretation is that the infection rate is progressively delayed further, as the pandemic progressed and the authorities implemented measures to confront and control it. Other parameters characterizing the time evolution are worth discussing. One follows from the first derivative of N (t) that has a maximum at its inflection point: N (t = t 0 ) = N max 2 which substituted in Eq. 2, gives the peak rate, P R(t 0 ) = N max 4∆T . Note that from the numbers in Table I and the relation γR 0 ≈ 4 P R(t 0 ) N max , we can estimate a value for γ ∼ 0.03/day. Measures of the early (asymptotic) behavior are the time, t 10 (t 90 ), when the number of cases reach 10% (90%) of the maximum, N 10 (N 90 ). It is simple to show that t 10 (t 90 ) ≈ t 0 − (+)2.2∆T . The parameter t 90 is also given in Table 1 and compared to that obtained from the empirical curves shown in the last three columns (emp). Deviations between the fits and the empirical data vary in the range of 1 to 20% for t 0 and t 90 , and from 1 to 5% for N 90 . The deviations on P R(t 0 ) are larger, 16-71%, likely due to systematic effects in the reporting of daily cases. . CC-BY-NC-ND 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. (which was not certified by peer review) It is more instructive to consider a linearized form of Eq. 1 and plot from which we can easily extract ∆T from the slope and t 0 from the intercept. The data are presented in this form in Fig. 2 and Fig. 3 for the countries in Table I . The F/D function suggests a universal behavior in terms of dimensionless (or reduced) variables. Introducing: all the data should lie in a common, universal curve. Remarkably this appears to be the case, as seen in Fig. 4 . In addition, in terms of these reduced variables Eq. 4 can be written as: thus, here, the universal curve is a straight line with slope -1 and 0 intercept. The data are presented in this linearized form in Fig. 5 . . CC-BY-NC-ND 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. The copyright holder for this preprint this version posted May 25, 2021. ; https://doi.org/10.1101/2021.05.23.21249471 doi: medRxiv preprint . CC-BY-NC-ND 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. Although the dynamics of the pandemic seems to have a universal behavior, it is clear from Figs. 2-5 that the data for the different countries do not strictly align on a single straight-line. There are many factors affecting the number of daily infected cases. Among them, the most relevant are those closely related to the measures adopted by the authorities of the different countries in an effort to control the disease, to reduce the number of cumulative infected cases, and to "flatten the curve", keeping the P R below the available hospital resources at any given time to avoid overloading the system. Nevertheless, by discussing a few examples, we will show how we can still use the linearized reduced forms, to correlate the data and shed light on the dynamics of the epidemic. . CC-BY-NC-ND 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. curve, meaning the epidemic was almost controlled. Among the countries selected in this study only China and Germany present this kind of behavior where the slope is ≈-1 and the intercept ≈ 0. We believe this is related to the fast measures taken by the chinese authorities to manage the spread of the disease (See Fig.1 in Ref. [13] ). In the case of Germany the adopted strategy of massive testing, social distancing, and isolation of infected people since the early beginnings, was very effective to control the pandemic [14]. . CC-BY-NC-ND 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. such a change appears contradictory to the universality conjecture given that the diffuseness is determined by fitting the F/D function, and therefore should be unique. However, the linearized forms are very instructive in the sense that it allows to investigate in more detail the development of the evolution of the epidemics and how external factors might influence it. In what follows, we will show that the universal behavior can be recovered. One may argue that presenting the data in this form, provides a more sensitive way to highlight potential deviations due to limited reporting, incomplete statistics, etc. . CC-BY-NC-ND 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. Finally, let us come back to the discussion about the loss of universality in our approach. To that effect, we consider a slightly different analysis of the data. From Fig. 7 the date at which the sudden change of slope occurs, t cross , is determined for each country. Now, each set can be separated into two sub-sets: the first one referred to as prior starting on Feb-20 to t cross , and the second one, post, from t cross to May-31, except for Brazil and Perú for which the analysis was extended to Sep-30 and Jul-20 respectively. Then, the same analytical F/D function is applied to each sub-set of data as presented in Figs. 8 and 9 . The analyses of both prior and post data sets show that the average straight lines have reduced slopes: 1.15 ± 0.23 (prior) and 1.11 ± 0.08 (post), and intercepts: −0.21 ± 0.48 (prior) and 0.08 ± 0.09 (post) very close to the universal values. Thus, it seems that the empirical data are amenable of a description in terms of the linearized equations discussed above and follow a universal pattern in both prior and post analyses. It emerges from the comparison of these two sets that the implemented pandemic policies, aimed at reducing the impact of the COVID-19, have been able to decrease the basic reproduction number R 0 by a factor of ≈ 3, as measured by the ratio of the prior and post slopes 1/∆T . Since R 0 ∼ 3, the reduction factor brings it down below the critical value of 1. This is shown, as an example, for the data reported in New York in Fig 10. . CC-BY-NC-ND 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. . CC-BY-NC-ND 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. Our objective here is to provide support of our simple phenomenological approach by comparing with the results of an established mathematical model, commonly used in epidemiological studies. In particular, we focus on the concept of the t cross value and the prior and post analyses that follow. In section I, we briefly presented the SIR model along with the system of differential equations governing the time evolution of the Susceptible, Infected and Recovered compartments. Ref. [16] presents a useful overview of this class of models, some of their theoretical properties and natural ways of extending them to more complex situations. Based on Refs. [16, 17] we thus modify the SIR model to account for individuals in the population that undergo different stages of the disease. We include an Exposed (E) compartment (those that have contracted the disease but cannot yet spread the virus), and two more compartments Critical (C) (those that need intensive care to model the overflow of Intensive Care Units (ICU) and Deaths(D) (cumulative deaths caused by the pandemic) and thus its name SEICRD. Further details are given in the Appendix. . CC-BY-NC-ND 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. The copyright holder for this preprint this version posted May 25, 2021. ; https://doi.org/10.1101/2021.05.23.21249471 doi: medRxiv preprint In section IV A, the time t cross was correlated to the implementation of policies that reduce R 0 . We introduce in the SEICRD model a function of the form: where the following desired properties are met: to simulate a change in the slope from R 0,start to R 0,end at t = t cross . The sharpness of the change is controlled by the parameter k. These additional parameters could either be fixed to suitable values, or fit to empirical data (See Appendix). Here we just present the case of New York City. The SEICRD simulated data are contrasted with the F/D analysis in Fig. 11 that shows a plot of the dimensionless variables ln((1 − N r )/N r ) vs. t r , to be compared with that of Fig. 10 . The similar behavior of the universal straight lines and relevant dates is remarkable, suggesting that in spite of its simplicity, the approach captures the main ingredients of a more sophisticated model. . CC-BY-NC-ND 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. The copyright holder for this preprint this version posted May 25, 2021. ; https://doi.org/10.1101/2021.05.23.21249471 doi: medRxiv preprint C. Extended analysis beyond May-31 Due to the large extent of the pandemic, it is important to demonstrate if our analysis could also be applied to the second outbreak already seen in most countries and, specifically, if the concept of universality discussed in Section IV is still valid. The following example describes the analysis applied to the data reported in the United Kingdom. An inspection of Fig. 12 a) allows us to identify two regions defined by a characteristic separation time t sep between the peaks of the outbreaks. Each full data set can be then separated into two subs-sets: the first one, referred as to Outbreak 1, from Feb-20 to Jul-18 (t sep for this case). The second subset, Outbreak 2, goes from Jul-19 until Nov-30. The same analytical F/D functions was then applied to each of the sub-sets giving the following parameters: N max 1 = 282292, t 0 1 = 65.6 days (with respect to Feb-20) and ∆T 1 = 13.9 days for set 1 and N max 2 = 1631357, t 0 2 = 109.3 days (with respect to Jul-18) and ∆T 2 = 16.7 days for set 2. The results are summarized in Fig. 12 b) the reduced straight lines have slope and intercept close to −1 and 0 the universal values. The prior and post behavior discussed in Section IV A is also present in both outbreaks. Perhaps not surprising, the example confirms that the universal behavior is observed for the two outbreaks and . CC-BY-NC-ND 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. The copyright holder for this preprint this version posted May 25, 2021. ; https://doi.org/10.1101/2021.05.23.21249471 doi: medRxiv preprint anticipated to follow in subsequent ones. Of course, the same analysis can be easily applied to the extended data for other countries with similar results. As a final comment we note that the proposed formalism can be applied instead to the number of COVID-19 related deaths as a function of time, which could be a more robust indicator of the impact of the pandemic. We have presented a phenomenological analysis of the time evolution of COVID-19 in some selected countries, using an analytical function of the Fermi/Dirac (Wood/Saxon) type. In spite of its simplicity, it appears that the proposed approach describes the empirical data relatively well. The extracted characteristic parameters allow, especially in the early stages of the epidemic, to make reasonable estimates readily comparable with the facts, and without resorting to more complex epidemiological models. Notwithstanding, the formalism is supported by the solution of the SIS model, an approximation of the SIR model and its extensions. to correlate all the information in just one graph that reveals the universality of the phenomenon. Within our framework, we showed that the evolution of the epidemic can be easily tracked with changes in the values of the reduced slopes (prior and post analyses), reflecting measures and policies implemented to mitigate the spread of the disease. Although we focused our presentation on the first wave, we have shown that the phenomenology is also applicable to subsequent outbreaks. Finally, it did not escape us that it could be of interest to apply the formalism to the analysis of other viruses like influenza, which will be the topic of a follow-up work. We would like to thank Dr. J. Fernández Niello for his comments on the manuscript. . CC-BY-NC-ND 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. The copyright holder for this preprint this version posted May 25, 2021. ; Appendix: The extended SEICRD Model In this appendix we discuss in more detail the extended SEICRD model which was used in Section IV B to benchmark the phenomenological analysis. This model is a natural extension of the SIR and the SEIR models [16] . The extended SEICRD model (in short SEICRD model) is obtained by the addition of two extra compartments: Critical (C) and Death (D). The first one accounts for individuals that need intensive care during the course of the disease and allows us to model the overflow of Intensive Care Units (ICUs) and the impact on the fatality rates. Individuals can only enter this compartment from an infected (I) state, from which they follow the cycle to D or R. The second compartment will hold the cumulative deaths caused by the pandemic, with the assumption that only individuals from C can die. The system of differential equations readily follow: . CC-BY-NC-ND 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. (which was not certified by peer review) We further consider two additional and relevant features: a) the modeling of limited resources for ICUs; and b) the modeling of a time dependent infection rate. The modeling of limited resources is done by a modification of the transition probabilities from C to D when the total number of patients in C is higher than the total number of ICUs available (N ICU ). When such conditions are met, we assume the transition probability between C and D is Finally, we introduce a time dependent infection rate into the model as a time dependent reproduction number R(t). Our goal here is to give a robust model to support our interpretation of the t cross value introduced in sub-sections IV A, IV B). Thus, we propose that R(t) is given by a function of the form: where the following desired properties are met: The sharpness of the change in slope, from R 0,start to R 0,end at t = t cross , is controlled by the parameter k. Following from the discussions above, the final system of differential equations is: R(t) = R 0,start − R 0,end 1 + exp(−k(t − t cross )) + R 0,end , λ(t) = R 0 (t)γ. . CC-BY-NC-ND 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. The copyright holder for this preprint this version posted May 25, 2021. At first glance, we have several parameters we have to deal with. Nonetheless, some of them that do not have a significant variation during the process of the pandemic can be fixed, while the rest have to be fitted to the available data. In the following list the whole set of parameters and how they are treated in the fitting procedure are given: The values for N ICU are not pandemic but country specific and were compiled from Refs. [18, 19] to get the most up-dated values. A Python-code [20] was developed to solve equations (A.4) and fit the results to the available cumulative deaths data reported for the different countries. While fitting the model to the available reported deaths, the obtained evolution of the infected compartment is likely to reflect the total numbers of cases more precisely than if derived from the positive tested ones. The fit results are summarized in Table II. We have discussed in Section IV A that the linearized form of Eq. 6, fitted to the SEICRD results for New York City, captures the main ingredients of the evolution of the pandemic as predicted by a more realistic, yet more complex, model. A similar conclusion can be drawn from the analysis . CC-BY-NC-ND 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. (which was not certified by peer review) of the other countries. On the Quantization of the Monoatomic Ideal Gas, Rendiconti Lincei On the Theory of Quantum Mechanics Über den Zusammenhang des Abschlusses der Elektronengruppen im Atom mit der Komplexstruktur der Spektren Model studies on the COVID-19 pandemic in Sweden A contribution to the mathematical theory of epidemics Notes on R 0 The Mathematics of Infectious Diseases Estimating and Simulating a SIRD Model of COVID-19 for Many Countries, States and Cities International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity