key: cord-0807526-3hwcc3cc authors: Jeff Hong, L.; Liu, Guangwu; Luo, Jun; Xie, Jingui title: Variability scaling and capacity planning in Covid-19 pandemic date: 2022-05-13 journal: nan DOI: 10.1016/j.fmre.2022.04.019 sha: 347f8ba8c4b9370423de01c7c771cf147a763734 doc_id: 807526 cord_uid: 3hwcc3cc Capacity planning is a very important global challenge in the face of Covid-19 pandemic. In order to hedge against the fluctuations in the random demand and to take advantage of risk pooling effect, one needs to have a good understanding of the variabilities in the demand of resources. However, Covid-19 predictive models that are widely used in capacity planning typically often predict the mean values of the demands (often through the predictions of the mean values of the confirmed cases and deaths) in both the temporal and spatial dimensions. They seldom provide trustworthy prediction or estimation of demand variabilities, and therefore, are insufficient for proper capacity planning. Motivated by the literature on variability scaling in the areas of physics and biology, we discovered that in the Covid-19 pandemic, both the confirmed cases and deaths exhibit a common variability scaling law between the average of the demand μ and its standard deviation σ , that is, σ ∝ μ β , where the scaling parameter β is typically in the range of 0.65 to 1, and the scaling law exists in both the temporal and spatial dimensions. Based on the mechanism of contagious diseases, we further build a stylized network model to explain the variability scaling phenomena. We finally provide simple models that may be used for capacity planning in both temporal and spatial dimensions, with only the predicted mean demand values from typical Covid-19 predictive models and the standard deviations of the demands derived from the variability scaling law. The Covid-19 pandemic has brought a global public health crisis. On one hand, some critical medical resources are facing shortage and therefore require a fair and rational planning and allocation of these resources [1] , including the medical personnel of nurses and physicians [2, 3] , the critical equipment of ventilator and ECMO service as well as the ICU management [4] [5] [6] , the personal protective equipment of masks and alcohol [7] , and so on. On the other hand, capacity planning and maintenance of these resources are often challenging and costly in practice. For instance, the New York City has issued an ambitious pandemic preparedness plan of stockpiling thousands of ventilators and millions of masks right after the 2006 SARS outbreak. Due to the budget constraints, the city only acquired 500 additional ventilators at the beginning, but all were auctioned off a few years later because the department could not afford the expensive maintenance cost [8] 1 ○ . a spatio-temporal risk source model that can forecast the distribution of confirmed cases. However, predicting only the mean demand over different time periods (i.e., in the temporal dimension) and over different geographical locations (i.e., in the spatial dimension) is not enough. Understanding the corresponding variability of demand is also very critical for determining medical resource planning and allocation. Some models also provide a 95% confidence interval of their estimates. However, it is not clear what the confidence intervals mean. For example, some critics have pointed out that the actual death rate was outside the 95% confidence interval of IHME's prediction in nearly 70% of the US states [13] , which highlights the need for better characterization and more accurate estimates of the variabilities 3 ○ . Based on classical theories on capacity planning in the areas of production and operations management, to hedge against the fluctuations in the random demand and to take advantage of the risk pooling effect, one needs to have good understanding of the variabilities in the demand of resources. Traditionally, the variability may have already been specified by the stochastic processes as modeling the random demand arrivals. For instance, the Poisson and inhomogeneous Poisson processes are often used to model the patient arrivals to a healthcare system [14] , which implies that the standard deviation (i.e., a common approach to quantifying variability) of the demand is the square root of the mean demand. Then, the famous square-root staffing rule can be obtained [15] . The square-root staffing rule indicates a strong pooling effect in the temporal dimension, which is due to the independent increments property of Poisson models. Similarly, the pooling effect has been obtained in the spatial dimension, e.g., by specifying the demand correlation structures in the inventory problem with multiple locations in Eppen [16] , Corbett and Rajaram [17] and Mak and Shen [18] . The pooling effect, which means fulfilling the demands for multiple time periods or different locations from a centralized system, is essential for planners to make proper inventory or resource allocation decisions to meet certain quality of service (QoS) levels [19] . The magnitude of the pooling effect depends not only on the average demand level, but also on the scale of variability. Then, appropriately modeling the variability becomes very critical. Motivated by the literature on variability scaling in the areas of physics and biology, in this study, we discovered that in the Covid-19 pandemic, both the confirmed cases and deaths exhibit a common variability scaling law between the average of the demand and its standard deviation , that is: where is the scaling parameter that determines the scaling effect, which is typically in the range of [0. 5, 1] , and the scaling law exists in both the temporal and spatial dimensions. Notice that the Poisson and inhomogeneous Poisson process models also obey the variability scaling law, with = 0 . 5 . Indeed, one can prove that commonly used time series models, such as auto-regressive models and moving-average models, also obey the variability scaling law with = 0 . 5 . Notice that the square-root staffing rule may be extended to these time series models and the square root corresponds to = 0 . 5 . The relationship in Eq. 1 captures the scaling phenomena in many areas, in order to understand the dynamics and the structure of a complex system. In ecology, Taylor was among the first who showed empirically that the mean and standard deviation of the number of individuals of a species in a region follows this scaling relation [20] , which is known as Taylor's law today. In physics, this law is known as fluctuation scaling , and it was found in heavy-ion collision experiments [21] and in fluxes of cosmic radiations [22] . This law has also been discovered in some other systems, including stock markets [23] , emails and printing jobs 3 ○ See article for more details https://www.sydney.edu.au/news-opinion/ news/2020/04/14/us-covid-19-deaths-poorly-predicted-by-ihme-model.html . etc. [24, 25] 4 ○ . However, it appears that this law has not been discovered in Covid-19 or other healthcare systems. By analyzing the Covid-19 data, we find that the values of are typically greater than 0.5. In fact, they are between 0.7 and 1 in most situations. Note that < 1 implies that there is a pooling effect and, therefore, there exist economies of scale in the medical resource planning. Meanwhile, > 0 . 7 suggests that the pooling effect is not as strong as what is commonly expected under the Poisson models, which tends to underestimate the resource requirement and lead to a lower QoS if not taking the true variability scaling into consideration. We show that the variability scaling phenomena exist not only in Covid-19, but also in various healthcare demands, in both temporal and spatial dimensions, based on a number of datasets from different healthcare systems in different countries. Based on the mechanism of contagious diseases, we further build a stylized network model to explain the variability scaling phenomena. In the model, the critical reason that the variability scaling phenomenon emerges is that the network is a small-world network, where any two persons in the network can be linked with a small of steps [26, 27] . Therefore, the diseases can be transmitted quickly between any two random persons, which matches what we have seen in the Covid-19 pandemic. Furthermore, to illustrate how the variability scaling affects the capacity planning, we also provide a simple model that may be used for capacity planning and resource allocation in both temporal and spatial dimensions, with only the predicted mean demand values from typical Covid-19 predictive models. In this model, the standard deviations of the demands are derived from the variability scaling law. The contributions of this study can be summarized in the following three folds. We first show that variability scaling exists in the Covid-19 pandemic in US, Italy, Brazil, China and worldwide, and also in many other healthcare systems. Then, we propose a network model to provide a reasonable explanation that can theoretically yield the scaling phenomena. Finally, we establish explicit formulas between variability scaling and capacity planning, and economies of scale. We show that high variability requires extra resource planning, and the high variability reduces the benefit of economies of scale. We close this section by briefly reviewing some existing works in the operation management literature, which is closely related to our study. Eppen [16] is a seminal work on the pooling effect of a multilocation newsvendor problem where the demand correlations are captured by the covariance matrix of a multivariate normal distribution. This work has been generalized by Corbett and Rajaram [17] for nonnormal dependent demand distributions and extended by Mak and Shen [18] to a general supply chain network model with various dependence structures. These studies can be viewed as modeling works on the demand variability in the spatial dimension. In the temporal dimension, the doubly stochastic Poisson process proposed by Whitt [28] is one of the pioneering works that model the demand correlation of the arrival process for call centers. Since then, in a stream of research, e.g., Chen and Henderson [29] , Avramidis et al. [30] , stochastic arrival rates of arrival processes have been studied in various queueing models for call center operations management. Empirical studies of Kim and Whitt [31] , Kim et al. [32] investigate the Poisson and non-Poisson properties for call centers and healthcare systems to address data fitting issues, e.g., over-dispersion of Poisson models. Recently, Whitt and You [33] use the index of dispersion for counts (IDC) introduced in Fendick and Whitt [34] to emphasize the importance of dependence in singleserver queueing systems. However, rather than providing the IDC to indicate the dispersion between the mean and standard deviation, we establish the relation between the mean and standard deviation via the 4 ○ As an interesting note, Oliveira and Barabási [25] showed in their Nature paper that the mail correspondence patterns between Darwin and Einstein follow the variability scaling law. proposed variability scaling law. The variability scaling reveals the longrange dependence over time. The rest of this paper is organized as follows. In Section 2 , we summarize our empirical findings which show that variability scaling exists in the Covid-19 pandemic and all other investigated healthcare systems. We then propose a network model to explain the mechanism of variability scaling in Section 3 . In Section 4 , we show how the variability scaling affects the capacity planning using a stylized demand model. Section 5 concludes the paper. By taking the logarithmic transformations, the law of variability scaling in Eq. 1 becomes: where and are the regression coefficients to be estimated, and the standard deviation and the average will be computed from the data. In other words, the statistical model can be expressed as follows: where log ̂ and log ̂ are the "observable " variables, are the unobserved error terms, and is the total number of "observations " for log ̂ and log ̂ 5 ○ . Both ̂ and ̂ are computed from the corresponding sample average and sample standard deviation: where , = 1 , 2 , … , are random samples of size from observation . The index plays an essential role in our study, which could be used to represent different subsystems in different dimensions. For instance, in the temporal dimension, the subsystems could be daily new cases of Covid-19 in one country in different consecutive daily intervals; in the spatial dimension, the subsystems could represent new cases occurred in different geographical state locations. To better calibrate the power relationship, subsystems must have different sizes, so that the scaling phenomena may be observed over different magnitude scales. Note that the statistical model in Eq. 2 is often called a loglinear regression model in econometric literature, which studies the elasticities rather than the marginal effects between and (see, for example, Verbeek [35] ). However, we interpret the model from a statistical point of view rather than an econometric perspective since the endogeneity between and is intrinsic and difficult to be removed. Nevertheless, the model is simple and has been widely studied in various research areas in the literature and it can provide us some managerial insights in Covid-19 and other healthcare management problems. To establish the variability scaling law in Covid-19 pandemic, we collect the data of Covid-19 daily new cases and daily new deaths worldwide, as well as the data from four different countries on four different continents, i.e., US on North America, Italy on Europe, Brazil on South America and China on Asia. Since the majority of the new cases and deaths occurred in China are from Hubei province, we investigate on 5 ○ In fact, the "observations " for log ̂ and log ̂ are not directly observable but computed based on the sample average and sample standard deviation, which then bring the error terms into the model. We adopt to use the word "observation " simply following the traditional terminology in the statistic/econometric literature. Hubei province separately 6 ○ . We briefly introduce the datasets as follows, and refer interested readers to Appendix B.1 for more details about the original raw datasets. ( We would like to use all data from the beginning of the breakout until the end of September in 2021 because we notice there might be some data corrections in recent months. Then, we select March 8, 2020 as the starting time for US and Italy as well as worldwide because the US CDC decided to relax the requirements for the nucleic acid testing on March 4 and data from all states became available on March 8 [36] . For Brazil, We start from March 22, 2020 rather than March 8, 2020 since the data of all states in Brazil became available on that day. For China, we choose January 23, 2020 as the starting time because the first record of new cases in CSSE was reported on that day. For Hubei in China, we choose January 25, 2020 as the starting time because the first record of new cases in DXY.cn was available on that day, and choose April 26, 2020 as the ending time because the last hospitalized Covid-19 patient in Wuhan city was discharged on that day. On each dataset, we analyze the variability scaling law in both the temporal and spatial dimensions by fitting the loglinear regression model in Eq. 2 . Taking the Covid-19 data in US for example, in the temporal dimension analysis, we use the daily new cases and new deaths data of 51 states from March 8, 2020 to September 30, 2021 (about 19month period with 572 days in total), and calculate the mean and standard deviation of new cases and deaths over the 51 states for each day. Then, we obtain = 572 data points and use them to run the regression model. In the spatial dimension analysis, we calculate the mean and standard deviation of daily new cases and deaths over the whole time period for each state, and run the loglinear regression model using = 51 data points, as shown in Fig. 1 . Similarly, we apply the same procedure to analyze all other datasets. All results are summarized in Table 1 and log in either the temporal or spatial dimensions, as all the values of 2 are above 0.89. The results indicate that there is a common variability scaling law that governs the variability of Covid-19 new cases or new deaths in the temporal and spatial dimensions. Second, unlike the commonly assumed Poisson models in the healthcare management literature, the exponents of the scaling relation, i.e., the values of , are not always equal to 0.5. Notice that we may rewrite the variability scaling law as ∕ ∝ −1 , where ∕ is the coefficient of variation that measures the demand variability per unit of average demand, implying that there is a pooling effect when < 1 , but the pooling effect decreases as becomes larger. In particular, the results of > 0 . 9 in some scenarios demonstrate a large variability scaling effect but only mild pooling effect, which may lead to great challenges in medical resource planning in the Covid-19 pandemic. Third, for new cases, the values of in spatial dimension in US and Italy are significantly larger than those in temporal dimension, while the opposite results can be observed worldwide and in China. One plausible reason is that the traveling restrictions among different countries and strict traveling examinations among different cities in China may effec-tively lower the travel frequency, and thus lower the Covid-19 propagation geographically, compared with the situations in US and Italy. Fourth, the values of in Brazil, for both new cases and deaths, no matter in temporal or in spatial dimensions, are very close to 1 7 ○ , which is very similar to the results obtained in the early time at Hubei province in China. As for Hubei province, which was first attacked by Covid-19 pandemic, there was no enough medical resource or effective actions to combat the Covid-19 at the beginning, could cause high variability scaling phenomena. We conjecture that the lack of sufficient medical resource or effective prevention and control measures could also be the problems in some developing countries, such as Brazil, then leading to a higher variability scaling, especially in new deaths, compared with developed countries, such as US and Italy. Note that the small value of = 0 . 65 for new deaths in the spatial dimension in China also indicates the significant effectiveness of the Covid-19 prevention and treatment policies taken in China. Last but not least, it is also worthwhile pointing out that, except the result in the temporal dimension in Italy and Hubei province, the values of for new deaths tend to be smaller than those for new cases in the same dimension, indicating a smaller variability for new deaths, which may possibly be explained by the vaccine or treatment effect after being confirmed. We also examine the variability scaling in other healthcare demands, from 10 datasets in 4 different countries with respect to different healthcare systems. These datasets include four detailed patient arrival admissions from a large hospital in China, Israel, Singapore and the US, respectively; one dataset on the daily appointments to a group of clinics in Singapore; one dataset on the admissions of all patients with the government insurance plan to a cluster of hospitals from a provincial capital city in China; two datasets on the all confirmed cases of two infectious diseases (i.e., hand-foot-mouth (HFM) disease and dysentery disease, respectively) with their location information from a prefecture-level city in China; one dataset on all emergency calls to an ambulance service center from one of the largest cities in China; and one dataset on the demands of orthopedic surgery materials from a major supplier at the provincial level in China (see Appendix B.2 for more details about the 10 datasets). We test the scaling relation on all 10 datasets in the temporal and spatial dimensions whenever the data is supported. The results are summarized in Table 2 . We can draw similar conclusions as obtained from the Covid-19 data. Moreover, we have two additional interesting findings. First, compared with the HFM and dysentery infectious diseases, we notice that the values of in Covid-19 tend to be larger than those in HFM and dysentery diseases, which implies larger fluctuations in Covid-19 pandemic. Second, the values of for the patient admissions into a hospital in Singapore, Israel and the US in the temporal dimension are smaller than 0.55, which indicates that Poisson arrival processes are suitable to model these demands as widely used in the queueing literature. In the rest of the datasets, we may conclude that many of the currently used models of resource planning with Poisson assumptions tend to underestimate the resource requirement of the Covid-19 pandemic and various healthcare systems and lead to a lower service quality. Different models have been proposed to explain variability scaling [37] . Some of these models are more conceptual. For instance, it was discovered that random walks on a complex network can generate the 7 ○ We note that = 1 . 04 for the new cases in the spatial dimension in Brazil, however, its 95% confidence interval is [0.89,1.19], which is relatively large compared with others but still covers values less than 1. variability scaling phenomenon in the temporal dimension [38] . Some of these models are specifically designed to explain a certain type of variability scaling phenomenon in physics or biology. For instance, in mean-field thermodynamics, models have been proposed based on ferromagnetic Ising models [37] , and in biology, models have been developed based on plant reproduction processes [39] . However, these models may not be directly applicable to the explanation of the variability scaling phenomena that we observe in Covid-19 and other healthcare demands. In this paper, motivated by the mechanism of contagious diseases, we establish a simplified network model, which is essentially a space-filling fractal network model that has been used to explain allometric scaling by West et al. [ 26, 40 ] , in order to capture the correlation among demands. We consider the case where the infections are correlated through some internal mechanism, in which each demand node except those in the bottom level is directly connected to + 1 demand nodes. To simplify the analysis, we assume that the network has no cycles, thus, is a tree structure 8 ○ . Denote the node at the center of the network as the level-0 node (i.e., the root or the central node in Fig. 2 ) . Note that the level-0 node has + 1 descendant demand units while the nodes in all other intermediate levels, i.e., level 1 , 2 , … , − 1 , have descendants and one parent node. The leaf nodes in the bottom level, i.e., level , have no descendant, where is the number of levels of the network. For the purpose of mathematical rigorousness, we assume that ≥ 2 and ≥ 3 . Let denote the number of nodes in level , 0 ≤ ≤ . It can be shown that: Then the total number of nodes in the network is: For ease of presentation, we let the aggregated demand of all levels in the network be: where E( ) = 0 and Var ( ) = 2 0 for all , and assume the correlation of any pair of demands ( , ) is = 0 , where is the distance between the two nodes. After some derivation, we can obtain the results for Var ( ) as follows. 8 ○ In medical literature, shortest path trees of human contact networks are often used to study the spread of infectious disease [27] . is defined in Eq. 4 with all parameters specified above in the network model. We assume that ≥ 2 and ≥ 3 . Then, as → ∞ (i.e., → ∞), if 2 < 1 , and if 2 > 1 , Proof. We let and denote the summations of correlations for all pairs of nodes within the same level and cross different levels, respectively. Then: We first analyze . Let denote the sum of correlations for level , 1 ≤ ≤ . Notice that the distance between any two nodes at level 1 is 2, and there are 1 such pairs of nodes. Then: Similarly, distance between any pair of nodes in level ( = 2 , … , ) may take values in {2 , 4 , 6 , … , 2 } . In particular, the number of pairs that have distance 2 is ( − 1) −1 for 1 ≤ ≤ − 1 , while the number of pairs that have distance 2 is . Therefore, if 2 ≠ 1 , Next we analyze . Let denote the sum of correlation terms between level-and level-nodes. Notice that distance between the node at level 0 and a node at level is . Then, for ≥ 1 , − + 2 . If 2 ≠ 1 , for 1 ≤ ≤ − 1 and > , we have: Then by elementary algebra, if 2 ≠ 1 and ≠ 1 , Recall that Var ( ) = 2 0 + 2 0 ( + ) , and: = ( − 1 ) ∕ ( + 1 ) + 2∕ ( + 1 ) If 2 < 1 and = 1 , it can be verified that, as → ∞, and thus: If 2 > 1 , we have: Notice that log ( 2 )∕ log > 0 , and: Then: Similarly, it can be verified that: where the last equality follows from the fact that 2 + 2 log ∕ log > 1 . Therefore, if 2 > 1 , Based on the result in Proposition 1 , we are ready to state the main results to exhibit the variability scaling in this network model. Notice that, increases with these two parameters and . In addition, when 2 > 1 , Eq. 7 implies that 0 . 5 < ≤ 1 . The results of both > 0 . 5 and = 0 . 5 match what we have observed empirically in Covid-19 and other healthcare systems. (a) The proposed network is a stylized model. One may consider each node as one person, then is the average number of people contacted with others and is the individual infection rate. One may consider each node as one family, the is the average number of contacted families and is the family infection rate. In fact, each node could be used to represent one region or one time-interval. Intuitively, the policies like travel restrictions and hygiene regulations can reduce the parameters and , thus reduce the coefficient . (b) Even though we make strong assumptions of the proposed network model, such as each node (except the leaf nodes) connects to exactly the other + 1 nodes, the model can be easily extended to the case with a random number of connected nodes, which will be demonstrated from the simulation results in the following subsection. We have conducted simulation experiments to test the robustness of the results for the network model. For all simulations, we set 0 = 2 0 = 1 and 0 = 1 and assume that the atomic demand has the mean 0 and variance 2 0 and the correlation between any pair of and , ≠ , is specified according to the network structure of the proposed model. We take 12 values of , i.e., = {0 . 001 , 0 . 01 , 0 . 1 , 0 . 2 , 0 . 3 , … , 1} . For each , we generate a fixed number of random networks with different size . The size of network varies from a few nodes to thousands of nodes. The distance of any two nodes is the length of the shortest path between these two nodes. The expected total demand E( ) is actually , and the variance of demand Var ( ) can be calculated by Eq. 5 . Using linear regression (as discussed in the empirical analysis), we can estimate the coefficient for each . In order to reflect the random nature of actual networks, we set the number of nodes connected any nonleaf node as a Poisson random variable with mean + 1 = 6 . In particular, we consider 5 random networks with 6 replications for each level = {1 , 2 , … , 5} , resulting 30 random networks in total, i.e., 30 observations for the regression model. The simulation results show that variability scaling also emerges in these random networks (as in Fig. 3 ) and the exponents are close to the ones derived for the deterministic networks, which are derived in Eq. 7 when the number of descendants or the number of connected nodes is fixed as + 1 (also see Table 3 ). From Table 3 , we notice that the value of increases as increases, which is consistent with the theoretical results in Eq. 7 . While in the random case may be slightly different from that in the deterministic case, which could be due to the estimation error of the mean and standard deviation. From Fig. 3 , we notice that the linear regression model fits the simulation data quite well for our proposed network model. In this section, we illustrate how variability scaling affects the capacity planning via two simple inventory models. The first is the classical newsvendor problem with the relationship between the mean and standard deviation specified by the variability scaling in the demand function, and the second is a multi-period/multi-location inventory model with demand correlation structure specified by the proposed network model. We demonstrate that the safety stock level will increase while the pooling effect will decrease when the variability scaling effect, i.e., the value of , becomes larger. We first consider the classical newsvendor problem [41] for a certain type of medical resources for Covid-19. Suppose that the demand for this resource can be fully determined by the number of susceptible, new infected or death cases, which are random and the relationship between the mean and standard deviation follow the variability scaling law. For example, if we know there are 100 new confirmed cases, then it requires 100 additional beds to admit these patients. For notational simplicity, we directly model the demand of the medical resource as a random variable , with cumulative distribution function . In practice, is unknown and hard to estimate due to the high fluctuation in Covid-19 pandemic. Fortunately, different methods have been proposed in the literature to provide an accurate estimation of the mean E( ) . For illustrative purpose, we take a simple way by assuming that follows a normal distribution ( , 2 ) , where is known and = with and estimated from the loglinear regression model. Let ℎ and denote the unit holding cost and unit penalty cost of unsatisfied orders, respectively. Let denote the order quantity and the expected total cost is: By minimizing the expected cost function ( ) , we can determine the optimal inventory level * as follows: * = + (8) where ̄ is the ∕( + ℎ ) th quantile of the standard normal distribution. Then, the corresponding optimal cost is: is the right-hand unit normal linear-loss integral that is widely used in inventory literature. From Eqs. 8 and 9 , we can see that the optimal safety stock level and the total cost increase in an exponential rate as the variability scaling coefficient increases. When is close to 0.5, it implies there is a significant risk pooling effect since the safety stock level is in a near-square-root order of the mean level. When is close to 1, it indicates there is almost no pooling effect as the safety stock increases nearly in the same order of the mean demand to satisfy a certain QoS level. In the following, we establish a simple multi-period/multi-location inventory model to illustrate how the variability scaling affects the optimal capacity planning decision for a central planner. We consider that there are nodes in the proposed network model, which could represent time periods or locations, and each has the demand of one particular medical resource , where = 1 , 2 , … , . For simplicity, we assume that ∼ ( 0 , 2 0 ) , and the correlation of any pair of demands ( , ) is = 0 , with all parameters defined in Theorem 1 . Without pooling, we assume that each individual node solves the classic newsvendor problem without considering the correlations. Then, the optimal expected cost for each node is = 0 0 , where 0 = ℎ ̄ + ( ℎ + ) ( ̄ ) with unit holding cost ℎ and unit shortage cost and ̄ is the ∕( + ℎ ) th quantile of the standard normal distribution and ( ⋅) is the right-hand unit normal linear-loss integral function as defined before. The optimal total expected cost for the system with nodes is simply 1 = = 0 0 . With pooling, we assume that the central planner solves the classic newsvendor problem for the entire system with nodes. Then, the optimal total expected cost for the system is 2 = 0 , where: with being the covariance of and . By the variability scaling result in Theorem 1 , we know that ∝ , where the scaling coefficient is typically between 0.5 and 1. Hence, the cost saving of pooling is: We have established a bridge between two important results, i.e., the variability scaling effect and the economies of scale, in which the former one is widely observed in complex systems and the latter one is well known in economics. Then, the cost savings by pooling subsystems into a centralized system can be approximately estimated as: If is close to 0.5, we achieve strong cost savings by pooling demands together. If is close to 1, as observed from the Covid-19 and some other healthcare datasets in Section 2 , then the benefit of pooling becomes negligible. In practice, the system manager or government decision maker must be very careful when making pooling decisions when the demand variability of the system is large, e.g., the scaling effect is significantly greater than 0. 8 55 . 3% cost savings over the 33 provinces if using = 0 . 77 for the new cases in the spatial dimension in China. To some extent, it means there is not so much pooling benefit if requiring to prepare some medical resources for a long time period that is more than one year 9 ○ . However, allowing different regions to share their resources could bring significant pooling benefit in the spatial dimension, which is definitely worthy to design some mechanism to guarantee the sharing system function in the future study. As the Covid-19 pandemic keeps going, it is very critical to design a reasonable and effective capacity planning of medical resources. Accurately calibrating the evolution of Covid-19 is the precondition to achieve this goal. However, most of the current works focus just on predicting mean demands of new cases and deaths and so on, but seldom provide trustworthy prediction or estimation of the demand variabilities, and therefore are insufficient for proper capacity planning. In this paper, we adopt the commonly used approach to quantifying the variability by the standard deviation and establish a variability scaling law between the mean and standard deviation. Our empirical findings show that variability scaling is common in the Covid-19 pandemic and various other healthcare demands, most of which yield the variability scaling coefficient larger than 0.5 and even greater than 0.7 for most Covid-19 datasets. This indicates that some existing demand models with equal to 0.5 may underestimate the variation of the medical demands and overestimate the risk pooling effects. We then build a stylized network model to explain the internal demand/infections correlation, from which we can see how correlations among the network affect the intensity of variability scaling. At last, we use two simple inventory models to illustrate that larger variability scaling effect leads to higher inventory stock level, and lower pooling effect. Our proposed models serve as a first step towards the understanding of the mechanisms behind variability scaling. In the future, as more information about Covid-19 becomes available, e.g., the imposed social distancing mandates, factors that affect the variability scaling effect deserve further investigation. The change of capacity of the nucleic acid testing and other cure resources could also affect the variability of new cases and new deaths for Covid-19, which is also interesting to study. While we assume a linear relationship between the mean and standard deviation in our study, it may be possible to establish a nonlinear relationship as more data are available. It is also worthwhile to model the variability scaling effect in different dimensions by developing advanced stochastic or complex network models. In terms of examining the impact on the capacity planning and resource allocation problems in the Covid-19 pandemic, one may propose more sophisticated problem formulations rather than the stylized newsvendor model. The authors declare that they have no conflicts of interest in this work. We follow the order of the datasets reported in Table 2 in the paper. JID: FMRE [m5GeSdc JID: FMRE [m5GeSdc FMRE [m5GeSdc Fair allocation of scarce medical resources in the time of Covid-19 On being a neurologist in Italy at the time of the COVID-19 outbreak Clinical ethics recommendations for the allocation of intensive care treatments in exceptional, resource-limited circumstances: the Italian perspective during the COVID-19 epidemic COVID-19: ICU delirium management during SARS-CoV-2 pandemic A model of supply-chain decisions for resource sharing with an application to ventilator allocation to combat COVID-19 A framework for rationing ventilators and critical care beds during the COVID-19 pandemic Sourcing personal protective equipment during the COVID-19 pandemic How New York City's emergency ventilator stockpile ended up on the auction block COVID-19 and Italy: what next? Data analysis and modeling of the evolution of COVID-19 in Brazil Analysis and forecast of COVID-19 spreading in China Population flow drives spatio-temporal distribution of COVID-19 in China Learning as we go: an examination of the statistical accuracy of COVID19 daily death count predictions Capacity planning and management in hospitals Erlang-R: a time-varying queue with reentrant customers, in support of healthcare staffing Note -Effects of centralization on expected costs in a multi-location newsboy problem A generalization of the inventory pooling effect to nonnormal dependent demand Pooling and dependence of demand and yield in multiple-location inventory systems Inventory pooling under heavy-tailed demand Aggregation, variance and the mean Universal fluctuations in heavy-ion collisions in the Fermi energy domain The flux-dependent amplitude of broadband noise variability in X-ray binaries and active galaxies Scaling theory of temporal correlations and size-dependent fluctuations in the traded value of stocks The origin of bursts and heavy tails in human dynamics Darwin and Einstein correspondence patterns A general model for the structure and allometry of plant vascular systems A high-resolution human contact network for infectious disease transmission Dynamic staffing in a telephone call center aiming to immediately answer all calls Two issues in setting call centre staffing levels Modeling daily arrivals to a telephone call center Are call center and hospital arrivals well modeled by nonhomogeneous Poisson processes? Poisson and non-Poisson properties in appointment-generated arrival processes: the case of an endocrinology clinic The advantage of indices of dispersion in queueing approximations Measurements and approximations to describe the offered traffic and predict the average workload in a single-server queue A Guide to Modern Econometrics Geographic differences in COVID-19 cases, deaths, and incidence -United States Fluctuation scaling in complex systems: Taylor's law and beyond Fluctuations in network dynamics Patterns of annual seed production by northern hemisphere trees: a global perspective A general model for the origin of allometric scaling laws in biology Foundations of Inventory Management Monitoring the number of COVID-19 cases and deaths in Brazil at municipal and federative units level The authors would like to thank the Area Editor, Professor Yijun Li, the editorial member and two anonymous referees for their insight-9 ○ Recall the ventilator and mask stockpiling example in New York City mentioned in Section 1 , it could be even worse if taking the maintenance and other costs into consideration in practice. ful and detailed comments that have significantly improved this paper. This research was supported in part by the National Natural Science Foundation of China (Grants No. 72042015 , 72091211 , 72031006 and 71722006 ). (a) Worldwide: The dataset is provided by the WHO. It includes daily new cases and daily deaths of 237 countries or regions in the world, which correspond to the columns named "New_cases " and "New_deaths " in the original file. (b) US: The dataset is provided by the US CDC. It contains daily new cases and daily deaths (including both confirmed and probable cases) in 60 regions of US (including 50 states, New York City, Washington D.C. and 8 overseas territories). We exclude the data of all territories and combine the data of New York City with New York state, and use only the daily new confirmed cases (i.e., column "new_case ") and daily deaths (i.e., column "new_death ") from the 50 states and Washington D.C. Note that the CDC may correct the historical data, so the dataset may be slightly different if downloading in different time. The date we downloaded the dataset is December 8, 2021. (c) Italy: The dataset is provided by the Civil Protection Department of Italy. The raw dataset records the cases, deaths and the usage of medical resources in 21 regions everyday. Note that Trentino-Alto Adige region consists of Bolzano and Trento provinces, which have been recorded separately. In our analysis, we combine Bolzano and Trento together as one region, and use the daily new cases (i.e., column "nuovi_positivi ") and calculate daily new deaths from the accumulative deaths (i.e., column "deceduti ") by subtracting the accumulative deaths at previous date from the accumulative deaths at current date. (d) Brazil: The dataset is obtained from the Federal University of Viçosa [42] , which is originally provided by the Ministry of Health of Brazil. The Federal University of Viçosa research team has collected the data of new cases, deaths, tests and vaccinations in 27 states. We use the accumulative cases and deaths data (i.e., columns "totolCasesMS " and "deathsMS ") to calculate the daily new cases or daily deaths by a similar subtraction as done for Italy. Note that we select March 22, 2020 as the starting time since the first record is on March 21, 2020 in this dataset. (e) China: The CSSE at Johns Hopkins University provides the dataset in two separate files as accumulative cases and accumulative deaths. The two files record the accumulative cases and deaths in 195 countries or regions. We include Taiwan in our analysis and deleted the records with "unknown " in column "Province/State " in raw datasets. Then, we use the accumulative cases and deaths data in China at province level, and calculate the daily new cases and daily deaths by subtraction. (f) Hubei in China: We use the file called "DXYArea.csv " in the dataset, which is provided by a medical technology company DXY.cn in China. The dataset includes new cases, deaths and cure data in the city level in China. We exclude the data of "Area not defined " and those left blank in column "cityEnglishName ", and use the accumulative cases and deaths (i.e., columns "city_confirmedCount " and "city_deadCount ") of 17 cities in Hubei province to calculate the daily new cases and daily deaths by subtraction. Note that there may be more than one record in some dates, we just use the last record in these dates.