key: cord-0823026-jxbw8wl2
authors: PRASAD, J.
title: A data first approach to modelling Covid-19
date: 2020-05-26
journal: nan
DOI: 10.1101/2020.05.22.20110171
sha: 073456afdd52e85655dc9a207c9c6448144f12ac
doc_id: 823026
cord_uid: jxbw8wl2

The primary data for Covid-19 pandemic is in the form of time series for the number of confirmed, recovered and dead cases. This data is updated every day and is available for most countries from multiple sources. In this work we present a two step procedure for model fitting to Covid-19 data. In the first step, time dependent transmission coefficients are constructed directly from the data and, in the second step, measures of those (minimum, maximum, mean, median etc.,) are used to set priors for fitting models to data. We call this approach a "data driven approach" or "data first approach". This scheme is complementary to Bayesian approach and can be used with or without that for parameter estimation. We use the procedure to fit a set of SIR and SIRD models, with time dependent contact rate, to Covid-19 data for a set of 45 most affected countries. We find that SIR and SIRD models with constant transmission coefficients cannot fit Covid-19 data for most countries (mainly because social distancing, lockdown etc., make those time dependent). We find that any time dependent contact rate, which falls gradually with time, can help to fit SIR and SIRD models for most of the countries. We also present constraints on transmission coefficients and basic reproduction number R0~ as well as effective reproduction number R(t). The main contributions of our work are as follows. (1) presenting a two step procedure for model fitting to Covid-19 data (2) constraining transmission coefficients as well as R0~ and R(t), for a set of most affected countries and (3) releasing a python package PyCov19 that can used to fit a set of compartmental models with time varying coefficients to Covid-19 data.

At present the world is going through an unprecedented crisis of pandemic Covid-19 caused by a novel form of coronavirus, named Sars-CoV-2 which was passed to the human from bats in the Wuhan city of China, some time in December 2019 [Org20a, Org20b, ea20h, ea20t, ea20v, ea20d, ea20r, ea20l] .

Till the middle of May 2020 the virus has reached almost all the parts of the world resulting in more than four million people infected and more than a quarter million deaths [Wor20] . The measures to contain the virus medically by developing a vaccine are going on war footing. However, the success is still expected to be a few years away [ea20f] . Till a fraction of the population develop (herd) immunity or the vaccine is ready, the only means to contain the pandemic are social measures (social distancing, contact tracing etc.,) and enhanced hygiene practices [ea06, ea20s, ea20p] .

Some of the most important problems related to Covid-19 research are (1) estimating the controlling parameters of the pandemic, (2) making short term predictions using mathematical-statistical modeling which can help in mitigating policies (3) simulating the growth of the epidemic by taking into account as many contributing effects as possible and (4) quantifying the impact of mitigation measures, such as lockdown etc [ea20j] .

Modeling Covid-19 pandemic with compartmental models of Kermack and McKendrick (for an introduction see [KR08, Li18, BC18] ) has been one of the most active problems in the recent times [ea20p, ea20a, ea20c, ea20e, FP20, ea20m, Oli20] . There have been alternative approaches also such as [RMI20] where statistical considerations are being taken into account for predictions. In one of the studies [FP20] it is argued that the data for the confirmed, recovered and dead, all three can easily fit a power law model with similar coefficients. The main attractive feature of these data driven approaches is that the complexity of the model being considered is determined by the data and not by theoretical expectations.

In the present work we follow a middle approach and fit two compartmental models, named SIR and SIRD with some modification, to the Covid-19 data. One of the main reasons to consider these models has been that the Covid-19 data is available only for the Susceptible, Infected, Recovered and Dead compartments (for the notations used here and other places in the present work see table (1)).

It may be true that a large fraction of the population which may be Exposed (defined later) play an important role in the dynamics of the pandemic however, it is hard to get reliable numbers for that.

Apart from that, a large number of undocumented cases [ea20l] may have significant influence on the spread of the pandemic.

A brief summary of the work presented here is as follows.

In §2 we give a brief introduction to the compartmental models and introduce the notations and variables used in the work. In particular, we discuss the SIR model in §2.1 and the SEIR and the SIRD models in §2.2 and §2.3 respectively. One of the major parts of the work presented here is to study the time dependence of the contact rate β, we introduce a set of parametric models of β(t) in §2.4. We discuss the time series data used in the study in the §3 by giving an example of Italy which is one of the most affected countries. The main results of our work are given in §4 and in §5. In §4 we discuss the reconstruction (regression) procedure for the set of transmission coefficients as well as for the effective reproduction (defined later), number R(t). Parameter estimation is discussed in §5.

The main conclusions of our work with a summary and some important points are discussed in §6.

McKendrick (see [NC08, KR08, Li18] for an introduction) is still the main framework most commonly used. The main idea of the Kermack and McKendrick's compartmental models is that every individual in a society belongs to one of the M compartments and the total number of individuals belonging to different compartments keep changing with time. The minimum value M can have is two, for the Susceptible-Infected-Susceptible (SIS) model, in which the recovery does not guarantee that one will not get the infection again [KR08] .

During an epidemic phase an individual can go through many stages from being perfectly healthy to the recovered one after an infection, with or without any immunity (short or long term) or may die. If we represent every stage with a compartment and keep the track of the number of individuals in each compartment then we can easily model the dynamics of the epidemic. This approach is very similar to the approach taken in astronomy where we count the number of stars in different stages of their life to understand the stellar evolution.

In principle we can have any number of logical compartments but in practice we should consider only those compartments for which we have the counts data, in particular for model fitting. Taking into account the fact that we have data only for the number of confirmed, recovered and dead population, the only compartmental model that meets the requirement is the SIRD model. If we consider the recovered and dead together we get the SIR model as is discussed in the next section.

One of the important compartments that also is commonly considered is the 'exposed' one and represents the population which have received the infection but cannot pass to others, before a certain period called the incubation period. If we consider exposed population also then we get the SEIR model that also is discussed below. Three compartmental models SIR, SIRD, and SEIR are shown in the (a), (b) and (c) panels of Figure (1) respectively (for more detail one can refer to [KR08, Het00, ea97, Oli20] ).

If we identify the compartments with the nodes of a graph then the transmission between different compartments, as is represented by a set of coefficients, can be considered the edges of the graph. Some of the nodes may have multiple edges and some of the edges could be bi-directional also. The main challenge of the modeling a pandemic like Covid-19 is not the scarcity of mathematical models but it is of the reliable data for the compartments being considered. considered. If we consider these compartments as nodes of a graph then there are transmission coefficients for every connecting edge that determine how effective that edge is in changing the population of the connected compartments. In (a) and (c) representing SIR and SEIR models, the compartments are connected in a linear way, however, for the case (b), representing the SIRD model, there is a branching also. Since the total population must remain a constant so the rates of change along all the connecting edges must add to zero.

Notation Description 

Here β and γ are the transmission coefficients, also called the contact rate and the recovery removal rate respectively, and 1/β and 1/γ represent the mean duration of infectiousness and the average period of infectivity (see [Het00, ea03a] ), respectively.

In general there is some time lag between acquiring an infection and becoming infectious. However, in the SIR model it is ignored and an assumption is made that individuals become infectious immediately upon getting an infection. This is a very strong assumption and the main reasons for 4 All rights reserved. No reuse allowed without permission.

(which was not certified by peer review) is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.

The copyright holder for this preprint this version posted May 26, 2020 . . https://doi.org/10.1101 making this is that we do not know reliably how many people are actually 'exposed', or have the virus but are still not infectious (cannot pass it to others). One of the ways to address this problem could be contact tracing and assuming that anyone who has come into contact with an infected person is an exposed one. However, this assumption is as strong as the assumption made in the SIR model.

If we do not consider the birth, death and movement of people then the following condition must be satisfied.

here S, I and R is the population of the S, I and R compartments respectively.

In equation (1) the transmission coefficient β is one of the most important parameters of the epidemic dynamics and can be written as the product of the contact rates (the average number of contacts per person per time) and the transmission probability (the probability of disease transmission on contact between a susceptible and an infectious person). As has been mentioned that the transmission coefficient γ can be identified with the recovery rate which is nothing but the inverse of the infectious period (during which an infected person can pass the virus to other healthy people).

In general, the equations (1) is solved with the following initial conditions:

The second equation from 1 can be written as:

and for S/N > γ/β we get a positive infection rate.

Here we define one of the most important parameters of an epidemic in terms of the ratio β/γ, called the basic reproduction number R 0 , when considered a constant, and called the effective reproduction number R(t), when considered a function of time. The most common definition [KR08] of it is that it is the average number of secondary cases arising from an average primary case in an entirely susceptible population. Note that in the text we may also use just "reproduction number" and the meaning of it will depend on the context. Some of the studies such as [ea20a] call R 0 and R(t)both as basic reproduction number, however, we follow the convention used in [ea03b, ea03a, Cob20] .

The basis reproduction number R 0 is the main measure which quantifies the transmissibility of the virus and R 0 > 1, sets a chain of transmissions leading an exponential growth of the pandemic. We can keep R 0 < 1, by minimizing the contact rates (social distancing etc.,), lowering the infectiousness of the infected people (by treating them or putting them in a quarantine etc.) and reducing the susceptibility of the healthy people by vaccination etc., (for detail see [ea05] ).

The SIR model is one of the most basic models and can be easily generalized by one or more of the following ways:

1. Adding more compartments: Depending on the type of pandemic and other details we are interested in we can add more compartments to the SIR model. These compartments can fit in between the existing ones (for example as shown in Figure( 1) (c) for SEIR case) or can branch out from the existing once (as shown in Figure(1 6 All rights reserved. No reuse allowed without permission.

(which was not certified by peer review) is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. 

If we relax the assumption that the people who get the infection become infectious instantly and consider a latent period to the the onset of infectiousness there is a fraction of population (compartment) which has been exposed to the virus but will become infectious only after some latent period 1/σ, then the model is called Susceptible-Exposed-Infected-Recovered (SEIR) model represented by the following set of equations [KR08] 

Note that if we combine the second and third equation above we get:

From the above equation we can see that population in the E and I compartments together can grow with time only when the fraction of the susceptible population is greater than the inverse of the reproduction number :

There are many forms of SEIR equations which are in common use (see [BC18, ea20p, ea20u, ea20b, P.20, ea20n] ) however, equation (5) is the simplest one and does not include natural deaths.

One of the common practices with the SEIR model has been to consider the incubation period 1/σ a constant, and estimates it from some other observations. The SEIR modal is quite complex as compared to the SIR model and we cannot find the number of exposed people exactly at time t = 0 for evolving the equations and so the approach used to define R no longer works. Thanks to the new generation matrix models [BC18] it is still possible to write R in a close form for this case also.

One of the serious drawbacks of the SIR model is that people who recover and who die are treated in the same way -there are no separate compartments for the dead and recovered people. This drawback can be addressed by separating the compartments for the dead and recovered population as is done in the SIRD model described with the following set of equations (for a detail discussion see [ea20a, Vil20] ).

7 All rights reserved. No reuse allowed without permission.

(which was not certified by peer review) is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.

The copyright holder for this preprint this version posted May 26, 2020. . https://doi.org/10.1101/2020.05.22.20110171 doi: medRxiv preprint

Here a new transmission coefficient δ has been introduced which we can identify with the death rate. One of the advantages of the SIRD model is that it has three transmission coefficients β, γ and δ and we have the data for three time series I(t), R(t) and D(t) available so it is possible to compute the time dependency of all the three coefficients as well as the reproduction number R.

The aim of any mitigation measures may be one or more of the followings:

1. Lower the contact or infection rate β.

2. Lower the mortality rate δ.

3. Increase the recovery rate γ.

The SIRD model provides us a framework to estimate or fit all these parameters. In one of the coming sections we will discuss how we can reconstruct the transmission coefficients β, δ and γ as well as R from the data by a direct reconstruction approach.

The basic reproduction rate for the SIRD model can be written in the following way [ea20a] :

or, 1

where R γ = β/γ and R δ = β/δ. If apart from death and recovery there is some other channel that can lower the population in the I compartment, for example if infected people move out from that region with transmission coefficient η then we can write:

with R η = β/η. A more realistic model will have multiple compartments (nodes), either connected in series or some branching out from others, with data to constrain the transmission coefficients (edges).

Apart from this, realistic models may also require to consider different transmission coefficients of different subgroups (based on age etc.,). Incorporating, all these considerations will lead to very complex models having very less connect with the actual data we have.

8 All rights reserved. No reuse allowed without permission.

(which was not certified by peer review) is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.

The copyright holder for this preprint this version posted May 26, 2020. . https://doi.org/10.1101/2020.05.22.20110171 doi: medRxiv preprint 2.4 Time dependent β models As a pandemic triggers various containment measures [ea20s, DG20, ea20k, TS20, RR20, ea20c] such as lockdown, social distancing, improved hygiene practices etc., are taken and that lead to transmission coefficients such as β becoming time dependent [GG20, ea20m, FP20, ea20e, ea20i] . Apart from this, the drop in the susceptible population also decreases β (see [ea03a, Cob20] ).

Lockdown has been one of the most common mitigation measures followed all over the world and, in its extreme form, we can assume that once it starts the contact rate between susceptible and infectious people drops to zero. In general, the lockdown starts on a fixed day t l and has a duration (time scale) we call τ (we will be using both τ and corresponding decay rate µ = 1/τ in the discussion). We can incorporate these two parameters into the modeling of β(t) in many different ways and a set of three common choices is given below:

This model is discussed here just for an example and we do not expect the variation of β(t) as slow as linear one. This expression shows that β(t) starts with an initial value β 0 and after time t l it starts decreasing linearly with a constant rate of µ = 1/τ and finally becomes β 0 (1 − µ) at t = ∞. All rights reserved. No reuse allowed without permission.

(which was not certified by peer review) is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.

The copyright holder for this preprint this version posted May 26, 2020. 

This form of suppression of β(t) starts with a constant value β 0 at some t = t l and keeps decaying for period represented by τ and finally settles to a final value β 0 (1 − α) as is shown in Figure ( 3).

This can be written in the following way also:

From equation (13) we can also write :

Equations (14) and equation (15) are important to find the priors for α and µ once we know the priors for β and this will be discussed again in §4 and will be used in parameter estimation in §5.

3. Exponential Suppression [FP20, ea20o, ea20n] :

This model is similar to the tanh model and in this case also β(t) starts from some initial value β 0 and after decreasing for a period and finally approaches to a constant value β 0 α at t = ∞ as 10 All rights reserved. No reuse allowed without permission.

(which was not certified by peer review) is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.

The copyright holder for this preprint this version posted May 26, 2020. . https://doi.org/10.1101/2020.05.22.20110171 doi: medRxiv preprint is shown in Figure (4) . Note that the transmission coefficient β may decay with time without any intervention also as is discussed in [ea97] for plants.

In this case also we can write :

and

These equations also will be used to find the priors for parameter estimation.

In one of the studies [ea20o] it has been argued that even the time of recovery 1/γ may also vary with time due to the improvement in medical understanding of the epidemic and facilities and that also can be modeled as an exponential function. There have been other physically well motivated exponentially decaying forms also such as given in [Vil20] in which β starts from starting value β 0 and decay with rate 1/τ finally becomes β 1 .

The author argues that β 1 depends on the policy decisions leading to behavioral changes. This model is different from the model we are considering only in the respect that it considers the "lockdown" from the beginning i.e., t = 0.

The time dependent β models as are discussed above and shown in Figure (5) share a common property that before a certain time t l , that we can identify with the day on which lockdown starts, β has a constant value β 0 and after that it starts decreasing with a rate that depend on the parameter µ = 1/τ . The effect of the suppression in β is controlled by the parameter µ and for its zeros values all the models become constant β models. From Figure (5) we can conclude that different models can lead to the same amounts of "flattening" of the curve with a different choice of parameters so there is no preferred model for the suppression.

The SIR model with constant transmission coefficients is applicable only in the situation when the pandemic is let to grow without any intervention. In the real world once a pandemic starts interventions of different kinds (social, medical etc.,) are considered to reduce the rate at which the the epidemic spreads. These interventions can be easily taken into account by considering a time dependent (decaying) growth rate (β). As we can see from the above figure that a decaying (exponentially) β helps to contain the disease by lowering the height of the peak as shown in Figure (5).

The primary data for Covid-19 is in terms of three times series for the count of confirmed C(t) , recovered R(t) and dead D(t), persons for every country. By definition all the three times series 11 All rights reserved. No reuse allowed without permission.

(which was not certified by peer review) is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.

The copyright holder for this preprint this version posted May 26, 2020. . There are many factors, known and unknown, which determine the behavior of these time series, 12 All rights reserved. No reuse allowed without permission.

(which was not certified by peer review) is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. (which was not certified by peer review) is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.

The copyright holder for this preprint this version posted May 26, 2020. (which was not certified by peer review) is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.

The copyright holder for this preprint this version posted May 26, 2020. The time series I(t) for the population in compartment I can be obtained by subtracting R and D from C.

For a set of 45 countries the time series of I(t) are shown in Figure ( (which was not certified by peer review) is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. 16 All rights reserved. No reuse allowed without permission.

(which was not certified by peer review) is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.

The copyright holder for this preprint this version posted May 26, 2020. . https://doi.org/10.1101/2020.05.22.20110171 doi: medRxiv preprint δ, therefore it is a good measure which we can fit to a compartmental model, such as SIR or SIRD and can get constraints.

In this and next section we present the main results of the study in the form of demonstrating a reconstruction procedure for the time dependent transmission coefficients β(t), γ(t), δ(t) and the effective reproduction number R(t). We consider an example of Italy and India for this procedure.

Note that this approach is common and can be used to understand the variation of the transmission coefficients with time as a result of interventions. The main advantages of this approach is that there are no parameters to adjust and so the results are easy to reproduce.

The approach we use here is similar to as used in [ea20e, GG20] . In this approach the evolution equations are written in a discretized form as shown in equation (21).

From the third equation we can write:

and using this and second equation from (21) we get,

Note that by definition R t+1 ≥ R t , so γ(t) ≥ 0, however, we may have I t+1 ≤ I t also, β(t) may become negative also once the population in the compartment I starts decreasing.

Here an important assumption is being made and that is the fraction of susceptible population S/N is close to unity which may be true at the beginning of the epidemic. Once we have expressions for the time dependent β and γ we can also written an expression for the time dependent reproduction number in the following way:

Following the similar procedure we can write the SIRD equations in the following discretized form: From these questions we can write :

and can write the expression for the reproduction number:

18 All rights reserved. No reuse allowed without permission.

(which was not certified by peer review) is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. where ∆X t = X t+1 − X t with X = I, R and D. This equation is identical to equation (24) if we do not count dead and recovered separately i.e., replace ∆R t + ∆D t with ∆R t . One of the interpretations of R is that it is a ratio of two rates and so in case we are interested finding out two separates measures for γ and δ, we can also write :

and so, 1

The procedure as discussed above can be used to know the variation of the transmission coefficients β, γ, δ and effective reproduction R(t)with time. In order to follow this procedure we need to abandon first few data points which have very high noise. As explained above occasionally we may also have negative values of R(t).

In figure (9) we show the reconstruction for β(t), γ(t) and R(t)for Italy with SIR model and in figure (10) the same is shown for β(t), γ(t) and δ(t) in case of SIRD model. We show R(t)as well as R γ (t) and R δ (t) for Italy in the case of SIRD model in figure (11). Similar figure are also shown for India in figure (12), (13) and (14). In all the figures the vertical red dashed lines mark the date of lockdown.

One of the main attractive feature of the split of R for δ and γ is that it is sensitive to individual values of γ and δ and not just their sum γ + δ. For the case when γ = δ we get the usual reproduction 19 All rights reserved. No reuse allowed without permission.

(which was not certified by peer review) is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. 

One of the import uses of the reconstruction procedure we have discussed here is to find the priors (minimum, maximum and best fit) values for the parameters to be fitted. Once we have the estimates for β(t), γ(t) and δ(t) from the above procedure we can easily find X min , X max , X 0 , values (with X = β, γ, δ. Here, X 0 is the approximate point for the parameter that is needed in many optimization procedure which iteratively find the solution. Since in the present work we use parametric form of β(t), so we need priors for the parameters of β(t) i.e., β 1 , α, µ and τ which can find from the reconstructed β(t) (see §2.4 for detail).

We consider a set of six compartmental models, three belonging to the SIR and three to the SIRD class. The models are different from each other in terms of the choice for the epidemiological class (SIR or SIRD) or the model for the contact rate β(t) (see §2.4 for detail). A summary of the models is given in Table (3). 20 All rights reserved. No reuse allowed without permission.

(which was not certified by peer review) is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. All rights reserved. No reuse allowed without permission.

(which was not certified by peer review) is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.

The copyright holder for this preprint this version posted May 26, 2020. . 

Parameters Note that in the model (5) and (6), β(t) starts decaying from the very beginning (in place of starting from a particular day representing the date of the lockdown) with a constant rate µ.

In any fitting procedure the choice of the loss function depends on what we wish to fit. In the common least square fitting we use the sum of the squares of the offsets as the loss function. However, there is a problem here with the data we have for that choice. The time series we wish to fit have small values at the beginning and very large values at the later stage, so the the fitting is biased towards the points which have large values. One of the solutions for this could be to fit the log of the time series but then the fitting becomes biased toward small values, in the beginning (or later stage when 22 All rights reserved. No reuse allowed without permission.

(which was not certified by peer review) is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.

The copyright holder for this preprint this version posted May 26, 2020. We decide to use the loss function of the ordinary least square which fits the data points close to the peak (having higher values) more accurately than other data points. We found this useful for the following two reasons:

1. The peak in the time series is an important feature, in particular its location and height, therefore any loss function biased towards it is justified.

2. For the short term predictions only the data points close to the dates of prediction is important, so using a loss function that fits later points (having higher values) more accurately than the noisy data points in the beginning is favorable.

The loss function which we used for fitting the data to SIR and SIR models is given below.

The variables used in the above equations are defined in table (4):

For SIR and SIRD models we fit multiple time series together so we must weight the sum of the squares of the offsets for different time series since they have very different values -the value of I(t)

is generally few orders of magnitude higher than R(t) and D(t). We use the following weights for this 23 All rights reserved. No reuse allowed without permission.

(which was not certified by peer review) is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.

The copyright holder for this preprint this version posted May 26, 2020. 

whereX l is the average of the time series X(t).

We use the solve ivp and minimize modules from the Scipy [Sci20] for integrating the differential equations and minimize the cost function respectively. The loss function given by equation (30) represents the Root Mean Square Deviation (RMSD) and we use its final value as a measure of the goodness of the fit, and is shown by the points of different colors (for different models) in the Figure (15) . From this figure we can see that it is hard to conclude that which of the models fit the data best (has always lowest RMSD). We can also notice from the figure that the model (6) is less sensitive towards the choice of a country for fitting the model (has the smallest fluctuations).

A list of fitting parameter for the different models is given in Table ( 3). For the SIR class of models, model (1) and (2) we have five fitting parameters named, γ, β 0 , α, µ, t l and for the SIRD models, model (3) and (4) we have six fitting parameters named γ, β 0 , α, µ, t l and δ. As we can notice that for model (1) to model (4) four of the parameters are associated with β(t) and for model (5) and (6) the variation of β(t) is controlled by just two parameter -β 0 , the initial value of β and its decay rate µ = 1/τ .

The best fit values of the fitting parameters with their 90 % CI (standard deviation) as well as the median values are given Table (5) . The tables also give the estimate for the effective reproduction number R(t)which is a derived quantity here. Note that for computing R 0 we have extrapolated the value of R(t)to the last date for which the data is being used here.

A histogram of the effective reproduction number for the different models being considered is shown in Figure (16) and detail values of that for different countries, which include the average values as well as 90% CI (StdDev), are given in Table (6). 24 All rights reserved. No reuse allowed without permission.

(which was not certified by peer review) is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.

The copyright holder for this preprint this version posted May 26, 2020 . . https://doi.org/10.1101 25 All rights reserved. No reuse allowed without permission.

(which was not certified by peer review) is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.

The copyright holder for this preprint this version posted May 26, 2020. . 26 All rights reserved. No reuse allowed without permission.

(which was not certified by peer review) is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. (which was not certified by peer review) is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. 28 All rights reserved. No reuse allowed without permission.

(which was not certified by peer review) is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.

The copyright holder for this preprint this version posted May 26, 2020 . . https://doi.org/10.1101 Covid-19 is a global crisis and understanding its impact on different systems of the modern human life (medical, social, economical etc.,) and the responses presented is an important exercise to carry out.

We understand that despite being a global phenomenon, the impact of Covid-19 in terms of the loss to life and the resourced being exhausted depend on the local conditions as well as on the mitigation measures taken locally. However, we believe that the global picture of the crisis does help to plan and take policy decisions at the local scale also.

Full understanding of any pandemic, in particular like Covid-19 which does not have any other examples in the history (in terms of the scale and impact), may become available only when it is over and the facts and figure presented here may have very short life. However, we still believe that any quick timely insight may help a lot in terms of the planning for the worse. Knowing very well that all the mathematical models are wrong but some are useful, we believe that mathematical models which are presented in this work may help to develop some insight about the crisis. A brief summary of the work presented here is as follows.

In §1 we have given a very brief introduction of the problem being addressed and reviewed some of the key works about Covid-19 which motivated the present work. A brief introduction of the mathematical framework used in the work in §2, in particular we have review a set of compartmental models SIR, SEIR and SIRD in §2. We have also discussed a set of of parametric models for one of the transmission coefficients β(t), in §2.4. We have discussed the data being used in the work in §3.

The main results of the present work are discussed in §4 and §5. In §4 we have reviewed a reconstruction procedure for the transmission coefficients and basic reproduction number R(t). This procedure does not depend on the choice of any parameter and can be easily generalized for other similar models also. We have presented the best-fit values of the parameters with their 90% CI, §5 in the form a set of tables. We have presented the values of the parameters in the following two forms:

1. Model based 2. Country based All the fitting parameters for the models being considered are summarized in Table ( ??). The country based parameters are given in terms of a set of two tables (6) where every row represents a country. From the country based table we can see that the estimate are quite consistent -do not change much from one model to another. In place of presenting all the parameters for countries we present only the basic reproduction number corresponding to different models.

We present Table ( 2) which in the appendix which have some basic information about the countries being considered for the modeling. In order to see how different models fits the data we have given the plots for all the six models for a set of countries in the figures from (??).

29 All rights reserved. No reuse allowed without permission.

(which was not certified by peer review) is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.

The copyright holder for this preprint this version posted May 26, 2020 . . https://doi.org/10.1101 The work we presented here assumes that spreading of a pandemic like Covid-19 happens homogeneously in space and time, however, we know that it is far from true. As the experience [ea03b] shows that "super-spread" events (SSEs) or rare events where, one particular infectious person interacts with a very large number of susceptible people over a short period of time have the maximum impact. In these situation the average measures like R 0 are not very informative. In the present work we data for a set of countries to constrain the parameters of the SIR and SIRD model one similar exercise with SIRD model for India is done in [ea20q] .

All rights reserved. No reuse allowed without permission.

(which was not certified by peer review) is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.

The copyright holder for this preprint this version posted May 26, 2020 . . https://doi.org/10.1101 

An introduction to compartmental modeling for the budding infectious disease modeler

Modeling infectious disease dynamics

Early forecasts of the evolution of the covid-19 outbreaks and quantitative assessment of the effectiveness of countering measures

Analysis and fitting of an sir model with host response to infection load for a plant disease

Transmission dynamics and control of severe acute respiratory syndrome

Transmission dynamics of the etiological agent of sars in hong kong: Impact of public health interventions

Strategies for containing an emerging influenza pandemic in southeast asia

Strategies for mitigating an influenza pandemic

Data-based analysis, modelling and forecasting of the covid-19 outbreak

Accounting for symptomatic and asymptomatic in a seir-type model of covid-19

Assessing the efficiency of different control strategies for the coronavirus (covid-19) epidemic. ArXiv e-prints

Epidemiological and clinical characteristics of 99 cases of 2019 novel coronavirus pneumonia in wuhan, china: a descriptive study

A time-dependent sir model for covid-19 with undetectable infected persons

A strategic approach to covid-19 vaccine r&d

An interactive web-based dashboard to track covid-19 in real time

The species severe acute respiratory syndrome-related coronavirus: classifying 2019-ncov and naming it sars-cov-2

Monitoring the spread of covid-19 by estimating reproduction numbers over time

Inferring change points in the spread of covid-19 reveals the effectiveness of interventions

Early dynamics of transmission and control of covid-19: a mathematical modelling study

Substantial undocumented infection facilitates the rapid dissemination of novel coronavirus (sars-cov-2)

Preliminary analysis of covid-19 spread in italy with an adaptive seird model

A modified seir model to predict the covid-19 outbreak in spain and italy: simulating control scenarios and multi-scale epidemics

A data driven analysis and forecast of an seiard epidemic model for covid-19 in mexico

The effect of control strategies to reduce social mixing on outcomes of the covid-19 epidemic in wuhan, china: a modelling study

Studying the progress of covid-19 outbreak in india using sird model. medRxiv

Covid-19: Epidemiology, evolution, and cross-disciplinary perspectives

The global impact of covid-19 and strategies for mitigation and suppression. Imperial College COVID-19 Response Team

A new coronavirus associated with human respiratory disease in china

Modified seir and ai prediction of the epidemics trend of covid-19 in china under public health interventions

A pneumonia outbreak associated with a new coronavirus of probable bat origin

Analysis and forecast of covid-19 spreading in china, italy and france

Mapping 2019-ncov

Novel coronavirus (covid-19) cases

Extracting the effective contact rate of covid-19 pandemic

The mathematics of infectious diseases

Data on covid-19 (coronavirus) confirmed cases, deaths, and tests

Space-time dependence of corona virus (covid-19) outbreak. ArXiv e-prints

Modeling Infectious Diseases in Humans and Animals

An Introduction to Mathematical Modeling of Infectious Diseases

Mathematical models of infectious disease transmission

Refined compartmental models, asymptomatic carriers and covid-19

Novel coronavirus (2019-ncov) situation report -1

Report of the who-china joint mission on coronavirus disease

A time-dependent seir model to analyse the evolution of the sars-covid-2 epidemic outbreak in portugal

A python package for fitting covid-19 data

Data-driven modeling reveals a universal dynamic underlying the covid-19 pandemic under social distancing

Age-structured impact of social distancing on the covid-19 epidemic in india

Scientific computing tools for python

Sk Shahid Nadim. Assessment of 21 days lockdown effect in some states and overall india: A predictive mathematical study on covid-19 outbreak

Estimating and simulating a sird model of covid-19 for many countries, states, and cities

Covid-19 coronavirus pandemic

The author would like to thank Dr. Gaurav Goswami for comments and feedback. At present the author works as an independent researcher and data scientist and the work presented here is not supported by any public or private agency. The author will be thankful to any agency, individual or individuals who come forward to sponsor/support this and other similar works on Covid-19.