key: cord-0658669-454i5g7r authors: Fu, Yuting; Xiang, Haitao; Jin, Hanqing; Wang, Ning title: Optimal Lockdown Policy for Covid-19: A Modelling Study date: 2021-01-31 journal: nan DOI: nan sha: 3f173496920f178c5ef21c1eeeab74374e9a7603 doc_id: 658669 cord_uid: 454i5g7r As the COVID19 spreads across the world, prevention measures are becoming the essential weapons to combat the pandemic in the period of crisis. The lockdown measure is the most controversial one as it imposes an overwhelming impact on our economy and society. Especially when and how to enforce the lockdown measures are the most challenging questions considering both economic and epidemiological costs. In this paper, we extend the classic SIR model to find optimal decision making to balance between economy and people's health during the outbreak of COVID-19. In our model, we intend to solve a two phases optimization problem: policymakers control the lockdown rate to maximize the overall welfare of the society; people in different health statuses take different decisions on their working hours and consumption to maximize their utility. We develop a novel method to estimate parameters for the model through various additional sources of data. We use the Cournot equilibrium to model people's behavior and also consider the cost of death in order to leverage between economic and epidemic costs. The analysis of simulation results provides scientific suggestions for policymakers to make critical decisions on when to start the lockdown and how strong it should be during the whole period of the outbreak. Although the model is originally proposed for the COVID19 pandemic, it can be generalized to address similar problems to control the outbreak of other infectious diseases with lockdown measures. As one of the most devastating pandemics in human history, the current outbreak of COVID-19 has already caused more than one million deaths around the globe. Numerous prevention measures have been studied by [11] and [24] in order to control the spread of the virus by the governments. For example, medical measures, such as research on testing, medicine, and vaccine, are accelerated; relatively easy measures, like face masking and social distancing, are also widely accepted and applied. Essentially the most effective prevention of COVID-19 is the lockdown measure which completely ceases the movement of the human being and thus slows down the spread of disease. However, the lockdown measure is incredibly controversial as it imposes a tremendous impact on our society and economy. Hence it might be the most difficult decision to be made by the governments. Especially when and how to impose the lockdown measure is one of the most challenging questions for both politicians and scientists. To address this question, there is a need to develop a mathematical model combining both epidemiology and economics. Epidemiological models have been widely studied to analyse the dynamics of the pandemic ( [21] [22] [25] [26] ). However, there is less discussion on how the lockdown policies can influence the economic decisions of people and the spread of disease and how can policymakers make optimal policy in the epidemic. [10] and [9] analyse the government intervention using epidemiological models with exogenous parameters and evaluate the effect of the intervention by simulation results. Some recent papers focus on analysis of optimal policy and policy effect in the framework of the SIR model or its variants. They studied the effect of different measures including fiscal policy ( [7] , [6] , [15] ), testing and quarantine ( [23] , [4] , [18] , [2] ), intervention policy on multi-aged groups ( [5] , [1] , [12] ), social distancing ( [19] , [8] , [14] ) and lockdown control ( [3] , [13] , [1] ). In previous works, [3] studied the optimal lockdown policy that minimises the value of fatalities and the output costs of the lockdown policy by locking down part of the susceptible and Infectious population, [1] researched the optimal lockdown policies on people of different age groups, and [13] maximise the economic activity level with the burden of the health-care system. We extend the classic SIR model ( [16] , [20] ) and incorporate an equilibrium framework to study the optimal lockdown policy during the pandemic period. What we innovate from previous works is that they all only took the governments' perspective but did not take people's own reaction to the pandemic and the government policy into consideration, while we adopt the extension to the SIR model from [7] by involving people's economic decision making (consumption and working hours) and embed the SIR model in a simple Cournot equilibrium framework to model people's reaction to each other. Different from [7] that studied the optimal containment policy by controlling the tax rate, we control the level of lockdown, which is more direct and effective for the governments, especially in the early stage of the pandemic. Furthermore, we emphases the cost of death in our model objective of policymakers, which is an important factor in real-world government decision making. Using this method, we can enable the lockdown policy to identify a balance between the impact of the epidemics on the economy and people's health. The motivation for this work is to address the following questions that the policymakers may face in reality. The main findings are shown as the short answer to these questions. • What difference does the optimal control make on the economic and health outcome of the epidemic compare to no control? We find that optimal lockdown measures could significantly reduce the deaths and infections caused by the epidemics. Although there is a short-term recession with lockdown control, it has better long-term economic outcomes than doing no control. • How does the timing of starting and ending affect the optimal lockdown control itself as well as its economic and health consequences? Our results suggest that both the timing of starting and ending the lockdown control policy makes a difference in terms of both the economic and epidemic outcomes. It is best to start the control as early as possible, and it is more important to avoid ending the control too early. • How does the cost of death affect the lockdown control policy and the outcomes? Whether policymakers regard the deaths as a negative influence on society lead to different results. Regarding deaths as negative results in stricter lockdown control policy which leads to a much better epidemic and slightly worse economic outcomes. • What if policymakers have additional information on people's health status? Additional information about the health status of people is beneficial, as the optimal separate control on people in different health status will reach much better economic and epidemic outcomes. This paper continues as follows, in section 2, we describe the SIR-Lockdown Model and analyse its properties. In section 3, we discuss the parameter estimation of the model. We present the numerical results of the SIR-Lockdown model in section 4. Section 5 makes conclusions. In this section, we first describe the extension to the canonical SIR model. Then analyse the behaviour of susceptible, infectious, and recovered people in regard to their decisions on consumption and working hours under lockdown regulator and formulate the optimal control problem. Finally, we add the cost of death in our model objective. As shown in the classic SIR model ( [20] [17]), we classify people into three categories according to [16] : • Infectious (I) are those who are tested positive to the virus; • Recovered (R) are those who have been tested positive to the virus and now recovered; • Susceptible (S) are those who have not been tested positive to the virus. We assume that all susceptible people are subjects to be infected with some possibility in direct contact with infectious people, and infectious people will recover with a constant probability of π r or become dead with another constant probability π d . Our extension is on the infection. All infection happens via direct contact between susceptible people and infected ones into three types of activities: purchasing and/or consumption of goods and services, working with other people, and other daily activities. A Lockdown policy can be applied to control the working contact, hence change the income flow, which indirectly imposes constraints on the purchasing and consumption. We use the following equation (1) (2) (3) (4) (5) to describe our extended SIR model for the transition among Susceptible, Infected, Recover, and the death outcome. In this system of equations, S t , I t , R t and D t represents the number of people in categories of Susceptible, Infectious, Recovery and Death respectively at time t. We use (C s t , N s t ) to model the (average) consumption behaviour and working hours of susceptible people, (C i t , N i t ) to model the (average) consumption behaviour and working hours of infectious people, and (C r t , N r t ) to model the (average) consumption behaviour and working hours of a recovered people. T t in equation (1)is the number of newly infectious people in the time period t to t + 1 and the three terms in the right-hand side of this equation are used to describe the infection by the three different contact between susceptible people and infectious people via consumption, working, and other types of contact. We use several constant parameters to describe the transition rate between different categories. π s1 reflects the transition rate for a susceptible people get infected by infectious people from direct contact via purchasing/consuming. Similarly, π s2 reflects the transition rate from direct contact via working, and π s3 reflects the transition rate from other contacts. Denote ∆Y t = Y t+1 − Y t for Y = S, I, R, then the dynamics of the SIR model is We use vectors and matrices to simplify our presentation. then the system can be described as We study the rational behaviour of all people who maximise their own welfare by choosing proper consumption and working hours like in a normal time, i.e., the virus does not change people's rationality and preference. Also, we use the following utility function to model the utility from consumption and working of an individual, where c is the consumption, and n is the working hours. The first term measures the utility from consumption, and the second term measures the utility from working. Denote by A the average wage per hour of a person, hence the labor income of an individual, with working hour n is A * n, which will be the upper bound of the consumption, i.e. An ≥ c. Denote by n 0 the full working hours in a unit time before the spread of the virus, which is officially guided by the government. It is natural that n 0 is set optimally for the society, and the optimality brings some information of the parameter θ 0 . If a person follows the full working hours n 0 optimally, then her labor income will be An 0 . Since the utility function is strictly increasing in the consumption, all labor income should be consumed up, hence the optimal consumption c 0 should be Then by the optimality of n 0 , we have ∂u(c 0 ,n 0 ) ∂n = 1 n 0 − θn 0 = 0, by which we will choose θ by The total utility of a flow of consumption and working hours {(c τ , n τ )} τ =t,··· ,T is defined by To contain the spreading of the virus, governments need to apply a lockdown policy to reduce direct contacts between people, which will impose stricter constraints on their behaviour. In this paper, we study the lockdown policy by a constraint on the ratio L ∈ [0, 1] of the working hour in the full working capacity, i.e., given the full working hours n 0 , the maximal working hour cannot exceed n 0 * L. We suppose the government cannot easily identify individuals into their categories so that the lockdown constraint on the working hours is the same for all people. We formulate the decision making problem for each category with a given lockdown policy L · , and then study the lockdown policy-making problem for the government. Suppose the lockdown measure L t ∈ [0, 1] is given for any time t. A recovered people aims at maximising his total utility with the constraint c r τ ≤ An r τ and n r τ ≤ n 0 L τ . Theorem 2.1. At time t with state X t and the lockdown policy Proof. Since ∂J r (c r ,n r ;t) .., T . Denote f (c r , n r , λ r n ; t) = J r (c r , n r ; t) + T τ =t λ r nτ (n 0 L τ − n r τ ). Then by KKT condition, ∀τ = t, ..., T , ∂f (c r * , n r , λ r n ; t) ∂n r Thus n r * τ = n 0 L τ , c r * τ = An 0 L τ ∀τ = t, ..., T Notice that the behaviour of recovered people (c r · , n r · ) plays no role in the spread of the virus, hence the behaviour of recovered people will not affect people in other categories. This is why we start to form this easy-to-handle category. Similar to the case of recovered people, infectious people also need to choose their optimal consumption and working hours {(c s t , n s t )} t=0,1,··· ,T to maximise their total utility from consumption and working hour, subject to the constraint that the consumption c s t cannot exceed the labour income for the working hour n t , and n t must be no more than the lockdown policy n 0 * L t . The labor income of an infectious people is different from other categories. Because they are infected, their health condition is usually worse than other people. So we introduce a constant φ to discount their working efficiency, and the labor income from n t working hour will be A * φ * n. Furthermore, since an infectious people will have a constant probability π r to recover and suffer a possibility π d of death, we need to calculate the distribution over all categories at a future time. For an infectious people at time t, he has the probability π r to recover in the next unit time, π d to die, and the rest probability 1 − π r − π d to stay in the infected category. By this evolution, we can get the conditional probabilities for his health state at a future time τ > t. Denote by p i,i (t, τ ) the probability for him being still infected, p i,r (t, τ ) the probability being recovered, and p i,d (t, τ ) the probability of being dead. Then we can deduce that If he recovered, he should behave optimally as a recovered people, while if death has happened unfortunately, we cease the accumulation of any utility. So, for a given flow (c i · , n i · ) of consumption and working hours taken by the infectious people from time t, the accumulated utility he can get will be where (c r * , n r * ) is the optimal behaviour of a recovered people determined in the previous case, and p i,i , p i,r are as defined in equation (12, 13) . Given the lockdown policy L · , the optimal (c r , n r ) is Different from the infectious case, the behaviour of an infectious people (c i , n i ) is involved in our extend SIR model for the spreading of the virus, hence they will make the decision problem for susceptible people much harder. The decision planning for a susceptible people from time t is much more complicated if we consider the possibilities for this people to turn into infectious, recovered, and death at different future time spots. We avoid the complexity by taking advantage of the optimal value function for an infected, and model the objective function of a susceptible people recursively. As for the previous two categories, we start from time t and pick up a susceptible person. Denote the state of the SIR model at the starting time as X t , and the lockdown policy is fully given as L · ). Suppose he will follow a given flow of consumption and working hours (c s τ , n s τ ) τ =t,t+1,··· ,T before being infected, and then follow the optimal behaviour after been infected, i.e., his consumption and working hours after infected will switch to the optimal control for an infected person from the infection time. We denote his objective value as where τ t = π s1 n 0 AφI t L t c s t + π s2 n 0 I t L t n s t + π s3 I t is the probability of a susceptible person to be infected in the next unit time, J i * (t + 1, L · ) is the optimal objective value achievable for an infected person starting from time t + 1, and X t+1 is the SIR state at time t + 1 resulted by people's behaviour (c s t , n s t , c i * t , n i * t , c r * t , n r * t ) and the time t state X t . Now it is natural that we aim at maximising the objective J s (c s · , n s · ; t, X t , L · ) over feasible control flow (c s · , n s · )), ,i.e., the optimal behaviour of a susceptible people will be the solution for the optimisation Theorem 2.3. At time t with state X t and the lockdown policy Proof. We fix the lockdown policy L · and omit it when no confusion will arise. Denote the value function as V (t, X t ) = J s (c s * · , n s * · ; t, X t , L · ). According to the dynamic programming principle, we know V must satisfy Suppose we start the lockdown problem from some time t 0 with the contamination state X t 0 being given by S t 0 = s, I t 0 = i and R t 0 = r, then the optimal lockdown policy should be the optimal control problem where (c ca * t , n ca * t ) are the optimal consumption and working hours for people in category ca (ca can be s, i or r), which are all determined in previous optimisation problems. In previous objective J 0 , we remove all cases of death. In reality, since death of disease causes has a strong negative impact to a household as well as to the society, regulators should not ignore any death case. We include the strong impact of death cases by introduce a penalty term into the objective In this new objective, we measure the the cost of a death by a multiple of the optimal utility for a recovered people, and the multiple λ > 0 can be viewed as the severity of death in the government's view. When λ = 0, J λ reduces to our previous objective J 0 . With this new objective, the problem for a regulator is to solve In Problem (23), or its reduced version (21), the optimal decisions of individuals in all three categories are involved. Fortunately, the optimal decisions of recovered and infectious people are trivial due to our good structure of the model, which leaves us to tackle the optimal decision problem (19) for susceptible people before the Problem (23). We start our solving scheme by tackling the Problem (19) with a given lockdown policy L · . Because of the lockdown constraint, it is almost hopeless for us to get an explicit solution. We solve this optimal control problem numerically in the same was as in [7] . In this approach, the optimal control at each time step is regarded as the static optimisation with two constraints from the consumption budget and the lockdown policy on the working hours, and solutions are obtained by solving the corresponding KKT condition 1 . With the optimal control (c s * · , n s * ) as functions of the lockdown policy L · , we deal with the optimal control problem (23) as an optimisation over the high dimension space [0, 1] T by the gradient-based interior-point method used in the Matlab function fmincon. Although we have no theoretical proof on the convergence of our scheme, our numerical results show the convergence of our scheme. Parts of our code in our scheme are from [7] . In this section, we study how to estimate those parameters in our model from real data, and apply it in an example with COVID-19 data in the UK to get the numerical results for optimal lockdown control. In our model, we have quite a lot of parameters, and some of them are well-estimated and available from different sources. Let us start from easily accessible ones. For the extended SIR model, without loss of generality, we standardise the total population to N = 1, which makes S t , I t , R t and D t be the proportions of the population of each category in the total population. The unit of a time step is not an essential parameter, we can simply count the time by weeks. π r and π d in the extended SIR model can be easily estimated from historical data, which have been done in several data sources like HPCC covid19 data cluster 2 . In our example, we will use the estimation from [7] . π s1 , π s2 and π s3 are complicated to estimate, and we defer the discussion to after all easy ones. For the characterisation of the decision making for individuals, we still need parameters n 0 , θ, β, A, and φ. Most of them are quite flexible, and in our examples, we do not estimate them from real data but specify their values in the same way as in different literature. We will do it in our detailed example. Finally, let us focus on the estimation of π s 1 , π s 2 , π s 3 . At any time t, we have π s1 c s t c i t + π s2 n s t n i t + π s3 = π t , where π t is the transmission rate in classic SIR model. Similar to π r and π d , the quantity π t is also available in different data source like HPCC covid-19 data cluster 2 . To estimate π s1 , π s2 , π s3 , we choose two different time spots t 1 and t 2 . The first time spot t 1 can be any time between the onset of the spreading of the virus and the first lockdown measure, and the second time spot t 2 must be in a period where a lockdown measure was applied. With the observation of π t 1 and π t 2 , we have: where L t is an estimation of actual lockdown rate at time t. These two equations are not enough to give us the values of three parameters, we still need one more equation for the purpose. In the case (as happened in the UK) that no different (non-null) lockdown measures have been applied, the third equation is officially not available. So we assume that This equation is from the assumption that susceptible people spend about 1/3 of their working hours for other activities related to other types of direct contact, and infectious people spend about half the time of susceptible ones in this type of activity due to the poor health condition. The two proportions 1/3 and 1/6 can be adjusted based on personal experience. These three equations can give us a good estimation of π s1 , π s2 and π s3 . We take the COVID-19 in the UK as our example, which started in the year 2019. The only lockdown took place on 23 March 2020 and lifted up in July 2020. For the estimation of π s1 , π s2 and π s3 , we need to specify some other parameters. According to the starting of the epidemic and lockdown, we take t 1 to be a time in Jan 2020 and t 2 to be some time in April 2020. The government released Experimental results of the pilot Office for National Statistics (ONS) online time-use study (collected 28 March to 26 April 2020 across Great Britain) 3 compared with the 2014 to 2015 UK time-use study, which reported the working-not-from-home time. According to the study, the average daily time (in minutes) of working not from home is 97.6 in March/April 2020 and 150.0 in 2014/2015, thus we estimate L t2 = 97.6/150 ≈ 0.65. Also according to ONS, the average actual weekly hours of work for fulltime workers from Dec 2019 to Feb 2020 was 36.9 4 , thereby we set n 0 = 36.9. According to the equation n 2 0 θ = 1, we set θ = 0.00073. We follows the setting of some parameters in literature. The mortality rate is set to be 0.6% from [7] . As in [7] , we assume that each infected case takes 18 days on average to either recover or die. Since our model is weekly, we have π d = 0.006 × 7/18, π r = 7/18 − π d . The reproduction number R 0 at time t 1 in Jan 2020 is around 1.95 without control measures 5 , and between 0.7 to 1.0 in April 2020 after the lockdown 67 , we use the middle point 0.85 of this range of R 0 for the calcuation of π t 2 . Since in classic SIR model, R 0 = β/γ where β and γ the infected and recovery transmission rate π t 1 = 1.95 × 7/18, π t 2 = 0.85 × 7/18. Given a published average annual income 8 30350 for 52 weeks, we set A = 15.8172. With all quantities involved in the three equations for (π s1 , π s2 , π s3 ), we get solution π s1 = 1.244887 × 10 −6 , π s2 = 1.0336 × 10 −4 , π s3 = 0.01759. 3 The R number in the UK:https://www.gov.uk/guidance/the-r-number-in-the-uk 8 statista: average full time annual earnings in the uk: https://www.statista.com/ statistics/1002964/average-full-time-annual-earnings-in-the-uk/ By the value n 0 = 36.9, we take θ = 1/(36.9) 2 . Finally, we copy the value φ = 0.8 from [7] . In this section, we present the result of our numerical experiments under the parameter setting in section 3.2. We do experiments to analyze the impact of the optimal lockdown control policy, the policy when different levels of the cost of death are taken into consideration, early exit and late start of the lockdown policy, and finally the smart containment policy. For every experiment, the initial state is (S, I, R) = (0.9998, 0.0002, 0) and the time horizon is 100 weeks. As Figure 1 (a), (d) (page 21) shows, if there is no lockdown control, i.e. the lockdown rate is constant 1 for all time, then under our parameter setting, around 15% of the population will be infected, 0.3% of the population will die and the peak of infection will be above 0.6% at week 50. Under the optimal lockdown control, the proportion of Infectious people decrease to 5.22 × 10 −5 at week 50, then raises to 2.5 × 10 −4 at week 100. 0.37% of the population will become infected and 0.0068% of the population will die by week 100. The optimal lockdown policy reduces the peak of infection by 95.8% and reduces the number of deaths by 97.7%. The significant life-saving is associated with a recession. Figure 1 (e) (page 21) shows the aggregate consumption under optimal lockdown policy decreases 20% compares to the no control case at the beginning, but then constantly increases. The average aggregated consumption fall by 6.6% with the optimal lockdown measure. In Figure 1 (f) (page 21), the optimal lockdown rate starts from around 80%, then gradually release to above 95%, the speed of the increase of the lockdown rate first decreases until around week 50, then increase until week 100. The increase of the infected proportion is because our model has a finite time horizon, and does not take the consequences after a time horizon of 100 weeks into consideration. In the beginning, the aggregated consumption under optimal lockdown control is 20% less but becomes 8.2% more than that of no control in the end. The reason that the optimal lockdown control policy did not cause a severe recession might be that in the no control case, susceptible people will cut back their working hours, as well as their consumption as the infected population increases, and in the optimal lockdown control restricted the infected population so that susceptible people won't cut back their consumption as much. We proved in section 2 that the recovered and Infectious people will work as much time as possible in order to maximize their own utility, but the behaviour of susceptible people is not certain. In the parameter setting of our experiments, the susceptible people almost work as much as possible just as the infected and recovered people do, but slightly reduce their working hours from the upper bound of lockdown constrain near the end of the time horizon, this behaviour may due to the increase of infected proportion, which raises the risk of getting infected for susceptible people. In general, the optimal lockdown policy saves lives and is more robust in economic recovery, it brings long-term health benefits and economic growth with the cost of a short-term recession. In this subsection, we study how the severity of death regarded by the planners affects the optimal lockdown policy. We set the penalty coefficient of death λ in (17) as 0, 10, 20, 50, which means the death of 1 people is regarded as the loss of 0, 10, 20, 50 recovered people by the planner. When λ = 0, it is the same as the original optimal control model. Our results in Figure 2 (page 22) show that adding a penalty on deaths makes a huge difference, it significantly slows down the increase of the lockdown rate (Figure 2 (f) (page 22)), thus reduces the proportion of deaths in a great extent: 76.7%, 83.0%, 87.2% respectively (Figure 2 (d) (page 22)), and avoid the substantial rise of the infectious population (Figure 2 (a) (page 22)), these are beneficial in terms of the mental impact in the society as low deaths and infection amount release the pressure on both people in the society and the planner. As the penalty coefficient increases, the optimal policy becomes constantly more strict. The relation of the death penalty coefficient and the result optimal control rate is below linear. As Figure 2 (f) (page 22) shows, despite that optimal lockdown policy with different death penalty coefficient starts with quite different lockdown rates:0.62, 0.56, 0.46 for penalty coefficient 10, 20, 50 respectively, they quickly become close. At the end of control, the aggregate consumption, as well as the lockdown rate of optimal policy with the death penalty is extremely close to the one without the death penalty. Compare to the original optimal lockdown policy, there is a slight recession when adding penalty on number of deaths: the average aggregate consumption decreases by 3.3%, 4.5%, 5.5% for penalty coefficient 10, 20, 50 respectively (Figure 2 (e) (page 22) ). Although the lockdown control policy with or without the death penalty becomes close from the middle to the end of the control, in the latter case, the infectious population does not rise as it in the former case. This is because that the lockdown policy with the death penalty suppresses the infectious population to a much lower level than the lockdown policy without the death penalty, thus the infectious population grows slower as the lockdown rate increases. In general, considering the cost of deaths leads to a more conservative lockdown control policy, it saves much more lives at the cost of a short-term recession. Practically, policymakers may under the intense pressure of economic loss that forces them to end the containment policy in the middle of the pandemic. In this subsection, we discuss the consequences of doing so. As we see in section 4.1, the infected population reaches the bottom at the week 50, which may seem to be a good time spot to end the lockdown policy. Our results in Figure 3 (e) (page 23) shows that there is an instant bounce of consumption right after the end of lockdown control, but this would cause the instant rise of infectious population (Figure 3 (a) (page 23)), and at the end, the infectious population 72 times larger than that of week 50. The burst of infection would result in a recession of 10.8% from the peak at the end (Figure 3 (e) (page 23)). So, ending the lockdown policy prematurely may not bring long-term economic benefit and what's worse is, it would result in a substantial additional number of deaths. Therefore we suggest that policymakers avoid terminating the lockdown policy during the pandemic in pursuit of only a short-term economic benefit. Policymakers could also face the situation that there are things that prevent them from taking the lockdown measure in the early stage of the pandemic. Our results Figure 4 (f) (page 24) show the optimal lockdown policy that starts at week 13 (around 3 months later). Compare to the optimal lockdown policy that starts at week 0 that starts with the lockdown rate 0.8, the late started optimal lockdown policy starts with a stricter constrain rate of 0.73. Although the lockdown rate of the late started lockdown policy constantly increases, it is always less than the original optimal lockdown policy. The late start causes a slight stronger recession (Figure 4 (e) (page 24)): the average aggregated consumption reduces 1% and a substantial rise of deaths ( Figure 4 (d) (page 24)): the number of deaths rises 84.8% by week 100. In general, It is the earlier the better to start the lockdown control policy, and despite that the late start of lockdown policy brings additional loss, it is much better than applying no containment policy or abandon it too early. Vaccination is an effective method of preventing infectious diseases. We now involve vaccination in SIR model. Assume that at each time period, fix amount of susceptible people: δ v of the starting population get vaccination that could prevent them from getting COVID-19 and assume governments to afford the cost of vaccination for people. Once susceptible people get vaccination, they are regarded as recovered. Thus the objective value of susceptible people become: With the cost of vaccination, denote as p, the optimal control problem of policymakers become: Figure( e),(f) shows that vaccination reduces the severity of recession and leads to a less strict optimal lockdown control. In the lockdown control policies we studied so far, the government chooses the same lockdown rate for all three kinds of people (susceptible, infectious, and recovered). In this subsection, we consider the smart containment, by which means the policymaker directly chooses working hours for all three kinds of people with the same objective function as previous models. There is no need to apply any lockdown on recovered people because their utility reaches the maximum as their working hour is at the maximum and they do not affect the utility or the transition of susceptible and Infectious people. Our results show that in the smart lockdown control policy, Infectious people almost do not work at the beginning, but then the planner gradually increases their working hours as the infected population decreases rapidly, and susceptible people work almost without fear of becoming infected. Figure 5 (page 25) shows that compare to the previous optimal lockdown control policy, the smart lockdown policy is much better, since it reduces the number of deaths to a great extent, and almost avoids the recession because the proportion of Infectious people is extremely small. The implement of a smart lockdown control policy requires the planners to know the status of all people and have control over their working hours. In reality, the knowledge of people's status needs measures such as medical testing and rely on the accuracy of testing. Our results suggest that these measures and information that are helpful for taking smart lockdown policy are beneficial for social welfare. The reproduction number (R 0 ) is now a basis for some governments to make decisions in reaction to the pandemic. We present the R 0 of lockdown polices in all our experiments in Figure 6 (page 26). The R 0 of smart lockdown policy is much smaller than that of all other lockdown policies. The R 0 of lockdown policies with the same lockdown rate for all three kinds of people behave similarly to their lockdown rate whereas in the no control case, its R 0 decreases constantly and the R 0 of smart lockdown policy behaves similar to the lockdown rate of Infectious people. This is because the behaviour of all three kinds of people is in accordance with the lockdown rate in optimal lockdown control policies, while the behaviour of susceptible people varies if there is no control, and in the smart lockdown case, the susceptible and recovered people almost remain the lockdown rate as constant 1 in the whole control process, thus its R 0 behaves in accordance with the lockdown rate of Infectious people. Notice that although the R 0 of in the no control case decreases below 1, and the R 0 of lockdown control policies with or without the death penalty increase over 1, policies with lockdown control are much better than that without control as analyzed in previous subsections. We, therefore, suggest that whether R 0 is larger or less than 1 can not be the only foundation for planners to make judgements or decisions on the current situation. In this paper, we extend the canonical epidemiological model SIR to find an optimal decision making with the aim to balance between economy and people's health. In our model, people in different health statuses take different decisions on their working hours and consumption to maximise their own utility, while policymakers control the lockdown rate to maximise the overall welfare, which leads to a two phases optimisation problem. Several parameters in our model are not straightforward to specify using the common epidemic data for modelling. We develop a novel method of parameter estimation through various additional sources of data. Our results show that lockdown measures could effectively reduce the deaths and infections caused by the COVID-19. There is an inevitable trade-off between the short-term recession, and health problems caused by the pandemic, and how policymakers deal with this could lead to very different decisions. We quantify the tradeoff by emphasising the cost of death in the model objective, which enables the optimal lockdown policy to discover a balance between the economic and epidemic outcomes. The timing of starting and ending the lockdown control policy makes much difference in terms of both the economic and epidemic outcomes. So the earlier to start the control, the better the results will be. It is crucial to avoid premature ending of the control. In the analysis of the smart containment policy, the results suggest that additional information about the health status of people is beneficial, as the optimal lockdown control policy will reach much better outcomes if it could be implemented on people with different health status separately. Through comparison of lockdown policies, we suggest that R 0 cannot be the only foundation for policy-making. Optimal targeted lockdowns in a multi-group sir model Modelling the impact of testing, contact tracing and household quarantine on second waves of covid-19 A simple planning problem for covid-19 lockdown An seir infectious disease model with testing and conditional quarantine An economic model of the covid-19 epidemic: The importance of testing and agespecific policies Fiscal policy during a pandemic The macroeconomics of epidemics Internal and external effects of social distancing in a pandemic Strategies for containing an emerging influenza pandemic in southeast asia Impact of non-pharmaceutical interventions (npis) to reduce covid-19 mortality and healthcare demand Estimating the effects of non-pharmaceutical interventions on covid-19 in europe Cost-benefit analysis of age-specific deconfinement strategies On the optimal" lockdown" during an epidemic Mitigating the COVID Economic Crisis: Act Fast and Do Whatever 31 Macroeconomic Implications of COVID-19: Can Negative Supply Shocks Cause Demand Shortages? Exact analytical solutions of the susceptible-infected-recovered (sir) epidemic model and of the sir model with equal death and birth rates Three basic epidemiological models, in: Applied mathematical ecology Dynamic modeling to identify mitigation strategies for covid-19 pandemic. medRxiv Optimal mitigation policies in a pandemic: Social distancing and working from home A contribution to the mathematical theory of epidemics Early dynamics of transmission and control of covid-19: a mathematical modelling study. The lancet infectious diseases The reproductive number of covid-19 is higher compared to sars coronavirus Optimal covid-19 quarantine and testing policies An investigation of transmission control measures during the first 50 days of the covid-19 epidemic in china An evaluation of mathematical models for the outbreak of covid-19. Precision Clinical Medicine Nowcasting and forecasting the potential domestic and international spread of the 2019-ncov outbreak originating in wuhan, china: a modelling study