key: cord-0956118-ilhs93ws authors: Chowell, G.; Rothenberg, R.; Roosa, K.; Tariq, A.; Hyman, J. M.; Luo, R. title: Sub-epidemic model forecasts for COVID-19 pandemic spread in the USA and European hotspots, February-May 2020 date: 2020-07-04 journal: nan DOI: 10.1101/2020.07.03.20146159 sha: 90cfe501dbea2ce87eb69d2f8cb59a42e7635864 doc_id: 956118 cord_uid: ilhs93ws Mathematical models have been widely used to understand the dynamics of the ongoing coronavirus disease 2019 (COVID-19) pandemic as well as to predict future trends and assess intervention strategies. The asynchronicity of infection patterns during this pandemic illustrates the need for models that can capture dynamics beyond a single-peak trajectory to forecast the worldwide spread and for the spread within nations and within other sub-regions at various geographic scales. Here, we demonstrate a five-parameter sub-epidemic wave modeling framework that provides a simple characterization of unfolding trajectories of COVID-19 epidemics that are progressing across the world at different spatial scales. We calibrate the model to daily reported COVID-19 incidence data to generate six sequential weekly forecasts for five European countries and five hotspot states within the United States. The sub-epidemic approach captures the rise to an initial peak followed by a wide range of post-peak behavior, ranging from a typical decline to a steady incidence level to repeated small waves for sub-epidemic outbreaks. We show that the sub-epidemic model outperforms a three-parameter Richards model, in terms of calibration and forecasting performance, and yields excellent short- and intermediate-term forecasts that are not attainable with other single-peak transmission models of similar complexity. Overall, this approach predicts that a relaxation of social distancing measures would result in continuing sub-epidemics and ongoing endemic transmission. We illustrate how this view of the epidemic could help data scientists and policymakers better understand and predict the underlying transmission dynamics of COVID-19, as early detection of potential sub-epidemics can inform model-based decisions for tighter distancing controls. Mathematical models have been widely used to understand the dynamics of the ongoing coronavirus disease 2019 (COVID-19) pandemic as well as to predict future trends and assess intervention strategies. The asynchronicity of infection patterns during this pandemic illustrates the need for models that can capture dynamics beyond a single-peak trajectory to forecast the 5 worldwide spread and for the spread within nations and within other sub-regions at various geographic scales. Here, we demonstrate a five-parameter sub-epidemic wave modeling framework that provides a simple characterization of unfolding trajectories of epidemics that are progressing across the world at different spatial scales. We calibrate the model to daily reported COVID-19 incidence data to generate six sequential weekly forecasts for five 10 European countries and five hotspot states within the United States. The sub-epidemic approach captures the rise to an initial peak followed by a wide range of post-peak behavior, ranging from a typical decline to a steady incidence level to repeated small waves for sub-epidemic outbreaks. We show that the sub-epidemic model outperforms a three-parameter Richards model, in terms of calibration and forecasting performance, and yields excellent short-and intermediate-term 15 forecasts that are not attainable with other single-peak transmission models of similar complexity. Overall, this approach predicts that a relaxation of social distancing measures would result in continuing sub-epidemics and ongoing endemic transmission. We illustrate how this view of the epidemic could help data scientists and policymakers better understand and predict the underlying transmission dynamics of COVID-19, as early detection of potential sub-20 epidemics can inform model-based decisions for tighter distancing controls. The asynchronicity of the infection patterns of the current coronavirus disease 2019 (COVID-19) pandemic illustrates the need for models that can capture complex dynamics beyond a single-peak trajectory to forecast the worldwide spread. This is also true for the spread within nations and within other sub-regions at various geographic scales. The infections in these 5 asynchronous transmission networks underlie the reported infection data and need to be accounted for in forecasting models. We analyze the COVID-19 pandemic assuming that the total number of new infections is the sum of all the infections created in multiple asynchronous outbreaks at differing spatial scales. We assume there are weak ties across sub-populations, so we represent the overall 10 epidemic as an aggregation of sub-epidemics, rather than a single, universally connected outbreak. The sub-epidemics can start at different time points and affect different segments of the population in different geographic areas. Thus, we model sub-epidemics associated with transmission chains that are asynchronously triggered and that progress somewhat independently from the other sub-epidemics. based simulations. Our sub-epidemic approach addresses this need in both the emerging and endemic stages of an epidemic. This approach is analogous to the model used by Blower et al. (2) to demonstrate how the rise and endemic leveling of tuberculosis outbreaks could be explained by dynamical changes in the transmission parameters. A related multi-stage approach was used by Garnett (3) to explain 5 the pattern of spread for sexually transmitted diseases and changes in the reproductive number during the course of an epidemic. Rothenberg et al. (4) demonstrated that the national curve of Penicillinase-Producing Neisseria gonorrhoeae occurrence resulted from multiple asynchronous outbreaks. As with HIV/AIDS, which has now entered a phase of intractable endemic transmission 10 in some areas (5), COVID-19 is likely to become endemic. New vaccines and pharmacotherapy might mitigate the transmission, but the disease will not be eradicated in the foreseeable future. Some earlier predictions based on mathematical models predicted that COVID-19 would soon disappear or approach a very low-level endemic equilibrium determined by herd immunity. To avoid unrealistic medium-range projections, some investigators artificially truncate the model 15 projections before the model reaches these unrealistic forecasts. Here, we demonstrate a five-parameter sub-epidemic wave modeling framework that provides a simple characterization of unfolding trajectories of COVID-19 epidemics that are progressing across the world at different spatial scales (6) . We systematically assess calibration and forecasting performance for the ongoing COVID-19 pandemic in hotspots located in the 20 USA and Europe using the sub-epidemic wave model, and we compare results with those obtained using the Richards model, a well-known three-parameter single-peak growth model (7). The sub-epidemic approach captures the rise to an initial peak followed by a wide range of post-. CC-BY-NC-ND 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. The copyright holder for this preprint this version posted July 4, 2020. . https://doi.org/10.1101/2020.07.03.20146159 doi: medRxiv preprint peak behavior, ranging from a typical decline to a steady incidence level to repeated small waves for sub-epidemic outbreaks. This framework yields excellent short-and intermediate-term forecasts that are not attainable with other single-peak transmission models of similar complexity, whether mechanistic or phenomenological. We illustrate how this view of the epidemic could help data scientists and policymakers better understand and predict the 5 underlying transmission dynamics of COVID-19. We retrieved daily reported cumulative case data of the COVID-19 pandemic for France, 10 the United Kingdom (UK), and the United States of America (USA) from the World Health Organization (WHO) website (8) and for Spain and Italy from the corresponding governmental websites (9, 10) from early February to May 24, 2020. We calculated the daily incidence from the cumulative trajectory and analyzed the incidence trajectory for the 5 countries. 15 We also retrieved daily cumulative case count data from The COVID Tracking Project (11) from February 27, 2020 to May 24, 2020 for five representative COVID-19 hotspot states in the USA, namely New York, Louisiana, Georgia, Arizona and Washington. 20 The concept of weak ties was originally proposed by Granovetter in 1973 (12) to form a connection between microevents and macro events. We use this idea to link the person-to-person . CC-BY-NC-ND 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. (which was not certified by peer review) The copyright holder for this preprint this version posted July 4, 2020. In the absence of native immunity, specific viricidal treatment, or a working vaccine, our non-pharmacological preventive tools-testing, contact tracing, social separation, isolation, 10 lockdown-are the key influences on sub-epidemic spread. The continued importation of new cases will result in low-level endemic transmission. A model based on sub-epidemic events can forecast the level of endemic spread at a steady state. This can then be used to guide intervention efforts accounting for the continued seeding of new infections. 15 We use a five-parameter epidemic wave model that aggregates linked overlapping subepidemics (6) . The strength (e.g., weak vs. strong) of the overlap determines when the next subepidemic is triggered and is controlled by the onset threshold parameter, . CC-BY-NC-ND 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. (which was not certified by peer review) The copyright holder for this preprint this version posted July 4, 2020. Intermediate values of (0 0, then the total number of sub-epidemics n tot is finite and depends on C thr , q, and, K 0 . The sub-epidemic is only triggered if C thr ≤ K i , resulting in a finite number of subepidemics, 10 The brackets denote the largest integer that is smaller than or equal to *. The total size of the epidemic wave composed of n tot overlapping sub-epidemics has a closed-form solution: The consecutive sub-epidemics decline according to the inverse function given by: When q > 0 , then the total number of sub-epidemics n tot is finite and is given by: . CC-BY-NC-ND 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. The copyright holder for this preprint this version posted July 4, 2020. . https://doi.org/10.1101/2020.07.03.20146159 doi: medRxiv preprint The total size of an epidemic wave is the sum of n overlapping sub-epidemics, In the absence of control interventions or behavior change ( q = 0 ), the total epidemic size depends on a given 5 number n of sub-epidemics, The initial number of cases is given by C 1 (0) = I 0 where I 0 is the initial number of cases in the observed case data. The cumulative cases, , is the sum of all cumulative infections over the n overlapping subepidemics waves: 10 Fitting the model to the time series of case incidence requires estimating up to five model parameters Θ = (C thr ,q,r, p, K ) . If a single sub-epidemic is sufficient to fit the data, then the model is simplified to the three-parameter generalized-logistic growth model. The model parameters were estimated by a nonlinear least 15 square fit of the model solution to the observed incidence data (2). This is achieved by searching for the set of parameters Θ =θ 1 ,θ 2 . CC-BY-NC-ND 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. The copyright holder for this preprint this version posted July 4, 2020. . https://doi.org/10.1101/2020.07.03.20146159 doi: medRxiv preprint where t i are the time points at which the time-series data are observed, and N is the number of data points available for inference. Hence, the model solution f (t i ,Θ) yields the best fit to the time series data y t i , where Θ is the vector of parameter estimates. We solve the nonlinear least squares problem using the trust-region reflective algorithm. We used parametric bootstrap, assuming an error structure described in the next section, to quantify the uncertainty in the 5 parameters obtained by a non-linear least squares fit of the data, as described in refs. (3, 4) . Our best-fit model solution is given by f (t,Θ) where Θ is the vector of parameter estimates. Our MATLAB (The MathWorks, Inc) code for model fitting along with outbreak datasets is publicly available (5). The confidence interval for each estimated parameter and 95% prediction intervals of the model fits were We model a negative binomial distribution for the error structure and assume a constant variance/mean ratio over time (i.e., the overdispersion parameter). To estimate this constant ratio, we group every four daily 20 observations into a bin across time, calculate the mean and variance for each bin, and then estimate a constant variance/mean ratio by calculating the average of the variance/mean ratios over these bins. Exploratory analyses indicate that this ratio is frequently stable across bins, except for 1-2 extremely large values, which could result from a sudden increase or decrease in the number of reported cases. These sudden changes could result from changes in case definition or a weekend effect whereby the number of reported cases decreases systematically during 25 . CC-BY-NC-ND 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. The copyright holder for this preprint this version posted July 4, 2020. . https://doi.org/10.1101/2020.07.03.20146159 doi: medRxiv preprint weekends. Hence, these extreme large values of variance/mean ratio are excluded when estimating the constant variance/mean ratio. To assess both the quality of the model fit and the short-term forecasts, we used four performance metrics: 5 the mean absolute error (MAE), the mean squared error (MSE), the coverage of the 95% prediction intervals, and the mean interval score (MIS) (6) . The mean absolute error (MAE) is given by: Here y t i is the time series of incident cases describing the epidemic wave where are the time points of the time series data (7). Similarly, the mean squared error (MSE) is given by: In addition, we assessed the coverage of the 95% prediction interval, e.g., the proportion of the 10 observations that fell within the 95% prediction interval as well as a metric that addresses the width of the 95% prediction interval as well as coverage via the mean interval score (MIS) (6, 8) which is given by: where L t and U t are the lower and upper bounds of the 95% prediction interval and Ι{} is an indicator function. Thus, this metric rewards for narrow 95% prediction intervals and penalizes at the points where the observations are outside the bounds specified by the 95% prediction interval where the width of the prediction interval adds up to the penalty (if any) (6). The mean interval score (MIS) and the coverage of the 95% prediction intervals take into account the uncertainty of the predictions whereas the mean absolute error (MAE) and mean squared error (MSE) only assess 20 t i . CC-BY-NC-ND 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. The copyright holder for this preprint this version posted July 4, 2020. . https://doi.org/10.1101/2020.07.03.20146159 doi: medRxiv preprint the closeness of the mean trajectory of the epidemic to the observations (9). These performance metrics have also been adopted in international forecasting competitions (8). For comparison purposes, we compare the performance of the sub-epidemic wave model with that obtained from the 3-parameter Richards model (10), a well-known single-peak growth model given by: where θ determines the deviation from symmetry, and again r is the growth rate, and K is the final epidemic size. . CC-BY-NC-ND 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. The copyright holder for this preprint this version posted July 4, 2020. . https://doi.org/10.1101/2020.07.03.20146159 doi: medRxiv preprint Figure S1 . The calibration performance metrics across five countries are uniformly better for the overlapping subepidemic models (for MAE, MSE, and MIS, smaller is better; for % covered, larger is better). . CC-BY-NC-ND 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. The copyright holder for this preprint this version posted July 4, 2020. . https://doi.org/10.1101/2020.07.03.20146159 doi: medRxiv preprint Figure S2 . The calibration performance metrics across five hotspots in the USA are uniformly better for the overlapping sub-epidemic models (for MAE, MSE, and MIS, smaller is better; for % covered, larger is better). . CC-BY-NC-ND 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. The copyright holder for this preprint this version posted July 4, 2020. . https://doi.org/10.1101/2020.07.03.20146159 doi: medRxiv preprint Figure S3 . The best fit of the sub-epidemic model to the COVID-19 epidemic in Italy. The sub-epidemic wave model successfully captures the multimodal pattern of the COVID-19 epidemic. Further, parameter estimates are well identified, as indicated by their relatively narrow confidence intervals. The top panels display the empirical distribution of r, p, K,and q. Bottom panels show the model fit (left), the sub-epidemic profile (center), and the residuals (right). Black circles correspond to the data points. The best model fit (solid red line) and 95% prediction 5 interval (dashed red lines) are also shown. Cyan curves are the associated uncertainty from individual bootstrapped curves. Three hundred realizations of the sub-epidemic waves are plotted using different colors. . CC-BY-NC-ND 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. The copyright holder for this preprint this version posted July 4, 2020. . https://doi.org/10.1101/2020.07.03.20146159 doi: medRxiv preprint Figure S4 . The best fit of the sub-epidemic model to the COVID-19 epidemic in France. The sub-epidemic wave model successfully captures the multimodal pattern of the COVID-19 epidemic. Further, parameter estimates are well identified, as indicated by their relatively narrow confidence intervals. The top panels display the empirical distribution of r, p, K,and q. Bottom panels show the model fit (left), the sub-epidemic profile (center), and the residuals (right). Black circles correspond to the data points. The best model fit (solid red line) and 95% prediction 5 interval (dashed red lines) are also shown. Cyan curves are the associated uncertainty from individual bootstrapped curves. Three hundred realizations of the sub-epidemic waves are plotted using different colors. . CC-BY-NC-ND 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. The copyright holder for this preprint this version posted July 4, 2020. . https://doi.org/10.1101/2020.07.03.20146159 doi: medRxiv preprint Figure S5 . The best fit of the sub-epidemic model to the COVID-19 epidemic in the United Kingdom. The subepidemic wave model successfully captures the multimodal pattern of the COVID-19 epidemic. Further, parameter estimates are well identified, as indicated by their relatively narrow confidence intervals. The top panels display the empirical distribution of r, p, K,and q. Bottom panels show the model fit (left), the sub-epidemic profile (center), and the residuals (right). Black circles correspond to the data points. The best model fit (solid red line) and 95% 5 prediction interval (dashed red lines) are also shown. Cyan curves are the associated uncertainty from individual bootstrapped curves. Three hundred realizations of the sub-epidemic waves are plotted using different colors. . CC-BY-NC-ND 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. The copyright holder for this preprint this version posted July 4, 2020. . https://doi.org/10.1101/2020.07.03.20146159 doi: medRxiv preprint Figure S6 . The best fit of the sub-epidemic model to the COVID-19 epidemic in Louisiana, USA. The subepidemic wave model successfully captures the multimodal pattern of the COVID-19 epidemic. Further, parameter estimates are well identified, as indicated by their relatively narrow confidence intervals. The top panels display the empirical distribution of r, p, K,and q. Bottom panels show the model fit (left), the sub-epidemic profile (center), and the residuals (right). Black circles correspond to the data points. The best model fit (solid red line) and 95% 5 prediction interval (dashed red lines) are also shown. Cyan curves are the associated uncertainty from individual bootstrapped curves. Three hundred realizations of the sub-epidemic waves are plotted using different colors. . CC-BY-NC-ND 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. The copyright holder for this preprint this version posted July 4, 2020. . https://doi.org/10.1101/2020.07.03.20146159 doi: medRxiv preprint Figure S7 . The best fit of the sub-epidemic model to the COVID-19 epidemic in Georgia, USA. The sub-epidemic wave model successfully captures the multimodal pattern of the COVID-19 epidemic. Further, parameter estimates are well identified, as indicated by their relatively narrow confidence intervals. The top panels display the empirical distribution of r, p, K,and q. Bottom panels show the model fit (left), the sub-epidemic profile (center), and the residuals (right). Black circles correspond to the data points. The best model fit (solid red line) and 95% prediction 5 interval (dashed red lines) are also shown. Cyan curves are the associated uncertainty from individual bootstrapped curves. Three hundred realizations of the sub-epidemic waves are plotted using different colors. . CC-BY-NC-ND 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. The copyright holder for this preprint this version posted July 4, 2020. . https://doi.org/10.1101/2020.07.03.20146159 doi: medRxiv preprint Figure S8 . The best fit of the sub-epidemic model to the COVID-19 epidemic in Arizona, USA. The sub-epidemic wave model successfully captures the multimodal pattern of the COVID-19 epidemic. Further, parameter estimates are well identified, as indicated by their relatively narrow confidence intervals. The top panels display the empirical distribution of r, p, K,and q. Bottom panels show the model fit (left), the sub-epidemic profile (center), and the residuals (right). Black circles correspond to the data points. The best model fit (solid red line) and 95% prediction 5 interval (dashed red lines) are also shown. Cyan curves are the associated uncertainty from individual bootstrapped curves. Three hundred realizations of the sub-epidemic waves are plotted using different colors. . CC-BY-NC-ND 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. The copyright holder for this preprint this version posted July 4, 2020. . https://doi.org/10.1101/2020.07.03.20146159 doi: medRxiv preprint Figure S9 . The best fit of the sub-epidemic model to the COVID-19 epidemic in Washington. The sub-epidemic wave model successfully captures the multimodal pattern of the COVID-19 epidemic. Further, parameter estimates are well identified, as indicated by their relatively narrow confidence intervals. The top panels display the empirical distribution of r, p, K,and q. Bottom panels show the model fit (left), the sub-epidemic profile (center), and the residuals (right). Black circles correspond to the data points. The best model fit (solid red line) and 95% prediction 5 interval (dashed red lines) are also shown. Cyan curves are the associated uncertainty from individual bootstrapped curves. Three hundred realizations of the sub-epidemic waves are plotted using different colors. . CC-BY-NC-ND 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. The copyright holder for this preprint this version posted July 4, 2020. . https://doi.org/10.1101/2020.07.03.20146159 doi: medRxiv preprint Figure S10 . The sub-epidemic decline function across countries and USA states based on results presented in Table 1 . 5 10 le . CC-BY-NC-ND 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. The copyright holder for this preprint this version posted July 4, 2020. . https://doi.org/10.1101/2020.07.03.20146159 doi: medRxiv preprint Figure S11 . Mean performance of the sub-epidemic wave and the Richards models in 2-20 day ahead forecasts conducted during the epidemic in Italy. The sub-epidemic model outperformed the Richards model across all metrics and forecasting horizons except for 2-day ahead forecasts based on the MAE and the MSE. . CC-BY-NC-ND 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. The copyright holder for this preprint this version posted July 4, 2020. . https://doi.org/10.1101/2020.07.03.20146159 doi: medRxiv preprint Figure S12 . Mean performance of the sub-epidemic wave and the Richards models in 2-20 day ahead forecasts conducted during the epidemic in Spain. The sub-epidemic model outperformed the Richards model across all metrics and forecasting horizons, but the MSE and MAE reached similar values at longer forecasting horizons. . CC-BY-NC-ND 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. The copyright holder for this preprint this version posted July 4, 2020. . https://doi.org/10.1101/2020.07.03.20146159 doi: medRxiv preprint Figure S13 . Mean performance of the sub-epidemic wave and the Richards models in 2-20 day ahead forecasts conducted during the epidemic in France. The sub-epidemic model outperformed the Richards model across all metrics and forecasting horizons. . CC-BY-NC-ND 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. The copyright holder for this preprint this version posted July 4, 2020. . https://doi.org/10.1101/2020.07.03.20146159 doi: medRxiv preprint Figure S14 . Mean performance of the sub-epidemic wave and the Richards models in 2-20 day ahead forecasts conducted during the epidemic in the UK. The sub-epidemic model outperformed the Richards model across all metrics and forecasting horizons except for 2-day ahead forecasts for which the Richards model reached somewhat better performance. 5 . CC-BY-NC-ND 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. The copyright holder for this preprint this version posted July 4, 2020. . https://doi.org/10.1101/2020.07.03.20146159 doi: medRxiv preprint Figure S15 . Mean performance of the sub-epidemic wave and the Richards models in 2-20 day ahead forecasts conducted during the epidemic in New York. The sub-epidemic model outperformed the Richards model across all forecasting horizons based on the PI Coverage and the MIS except for 2-day ahead forecasts. However, the Richards model more frequently outperformed the sub-epidemic wave model based on the MAE and MSE. 5 . CC-BY-NC-ND 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. The copyright holder for this preprint this version posted July 4, 2020. . https://doi.org/10.1101/2020.07.03.20146159 doi: medRxiv preprint Figure S16 . Mean performance of the sub-epidemic wave and the Richards models in 2-20 day ahead forecasts conducted during the epidemic in Louisiana. The sub-epidemic model outperformed the Richards model across all metrics and forecasting horizons. . CC-BY-NC-ND 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. The copyright holder for this preprint this version posted July 4, 2020. . https://doi.org/10.1101/2020.07.03.20146159 doi: medRxiv preprint Figure S17 . Mean performance of the sub-epidemic wave and the Richards models in 2-20 day ahead forecasts conducted during the epidemic in Georgia. The sub-epidemic model outperformed the Richards model across all metrics and forecasting horizons. . CC-BY-NC-ND 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. The copyright holder for this preprint this version posted July 4, 2020. . https://doi.org/10.1101/2020.07.03.20146159 doi: medRxiv preprint Figure S18 . Mean performance of the sub-epidemic wave and the Richards models in 2-20 day ahead forecasts conducted during the epidemic in Arizona. The sub-epidemic model outperformed the Richards model across all metrics and forecasting horizons. . CC-BY-NC-ND 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. The copyright holder for this preprint this version posted July 4, 2020. . https://doi.org/10.1101/2020.07.03.20146159 doi: medRxiv preprint Figure S19 . Mean performance of the sub-epidemic wave and the Richards models in 2-20 day ahead forecasts conducted during the epidemic in Washington. The sub-epidemic model outperformed the Richards model across all metrics and forecasting horizons. 5 10 . CC-BY-NC-ND 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. The copyright holder for this preprint this version posted July 4, 2020. . https://doi.org/10.1101/2020.07.03.20146159 doi: medRxiv preprint Figure S20 . Sequential 20-day ahead forecasts of the sub-epidemic wave model for the COVID-19 epidemic in the USA. Black circles correspond to the data points. The model fit (solid red line) and 95% prediction interval (dashed red lines) are also shown. The vertical line separates the calibration period (left) from the forecasting period (right). The sequential forecasts were conducted on March 30, April 6, April 13, April 20, April 27, and May 4, 2020. 5 10 Figure S21 . Sequential 20-day ahead forecasts of the sub-epidemic wave model for the COVID-19 epidemic in Italy. Black circles correspond to the data points. The model fit (solid red line) and 95% prediction interval (dashed red lines) are also shown. The vertical line separates the calibration period (left) from the forecasting period (right). 15 The sequential forecasts were conducted on March 30, April 6, April 13, April 20, April 27, and May 4, 2020. . CC-BY-NC-ND 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. The copyright holder for this preprint this version posted July 4, 2020. . https://doi.org/10.1101/2020.07.03.20146159 doi: medRxiv preprint 5 . CC-BY-NC-ND 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. The copyright holder for this preprint this version posted July 4, 2020. . https://doi.org/10.1101/2020.07.03.20146159 doi: medRxiv preprint Figure S22 . Sequential 20-day ahead forecasts of the sub-epidemic wave model for the COVID-19 epidemic in France. Black circles correspond to the data points. The model fit (solid red line) and 95% prediction interval (dashed red lines) are also shown. The vertical line separates the calibration period (left) from the forecasting period (right). The sequential forecasts were conducted on March 30, April 6, April 13, April 20, April 27, and May 4, 2020. 5 10 . CC-BY-NC-ND 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. The copyright holder for this preprint this version posted July 4, 2020. . https://doi.org/10.1101/2020.07.03.20146159 doi: medRxiv preprint Figure S23 . Sequential 20-day ahead forecasts of the sub-epidemic wave model for the COVID-19 epidemic in Spain. Black circles correspond to the data points. The model fit (solid red line) and 95% prediction interval (dashed red lines) are also shown. The vertical line separates the calibration period (left) from the forecasting period (right). The sequential forecasts were conducted on March 30, April 6, April 13, April 20, April 27, and May 4, 2020. 5 . CC-BY-NC-ND 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. The copyright holder for this preprint this version posted July 4, 2020. . https://doi.org/10.1101/2020.07.03.20146159 doi: medRxiv preprint Figure S24 . Sequential 20-day ahead forecasts of the sub-epidemic wave model for the COVID-19 epidemic in the UK. Black circles correspond to the data points. The model fit (solid red line) and 95% prediction interval (dashed red lines) are also shown. The vertical line separates the calibration period (left) from the forecasting period (right). The sequential forecasts were conducted on March 30, April 6, April 13, April 20, April 27, and May 4, 2020. 5 . CC-BY-NC-ND 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. (which was not certified by peer review) The copyright holder for this preprint this version posted July 4, 2020. . https://doi.org/10.1101/2020.07.03.20146159 doi: medRxiv preprint Figure S25 . Sequential 20-day ahead forecasts of the sub-epidemic wave model for the COVID-19 epidemic in New York State. Black circles correspond to the data points. The model fit (solid red line) and 95% prediction interval (dashed red lines) are also shown. The vertical line separates the calibration period (left) from the forecasting period (right). The sequential forecasts were conducted on March 30, April 6, April 13, April 20, April 27, and May 4, 2020. 5 . CC-BY-NC-ND 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. (which was not certified by peer review) The copyright holder for this preprint this version posted July 4, 2020. . https://doi.org/10.1101/2020.07.03.20146159 doi: medRxiv preprint Figure S26 . Sub-epidemic profiles of the sequential 20-day ahead forecasts for the COVID-19 epidemic in New York. Different colors represent different sub-epidemics of the epidemic wave profile. The aggregated trajectories are shown in gray and black circles correspond to the data points. The vertical line separates the calibration period (left) from the forecasting period (right). The sequential forecasts were conducted on March 30, April 6, April 13, April 20, April 27, and May 4, 2020. 5 . CC-BY-NC-ND 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. (which was not certified by peer review) The copyright holder for this preprint this version posted July 4, 2020. . https://doi.org/10.1101/2020.07.03.20146159 doi: medRxiv preprint Figure S27 . Sequential 20-day ahead forecasts of the sub-epidemic wave model for the COVID-19 epidemic in Louisiana. Black circles correspond to the data points. The model fit (solid red line) and 95% prediction interval (dashed red lines) are also shown. The vertical line separates the calibration period (left) from the forecasting period (right). The sequential forecasts were conducted on March 30, April 6, April 13, April 20, April 27, and May 4, 2020. 5 . CC-BY-NC-ND 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. (which was not certified by peer review) The copyright holder for this preprint this version posted July 4, 2020. . https://doi.org/10.1101/2020.07.03.20146159 doi: medRxiv preprint Figure S28 . Sequential 20-day ahead forecasts of the sub-epidemic wave model for the COVID-19 epidemic in Georgia. Black circles correspond to the data points. The model fit (solid red line) and 95% prediction interval (dashed red lines) are also shown. The vertical line separates the calibration period (left) from the forecasting period (right). The sequential forecasts were conducted on March 30, April 6, April 13, April 20, April 27, and May 4, 2020. 5 . CC-BY-NC-ND 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. (which was not certified by peer review) The copyright holder for this preprint this version posted July 4, 2020. . https://doi.org/10.1101/2020.07.03.20146159 doi: medRxiv preprint Figure S29 . Sequential 20-day ahead forecasts of the sub-epidemic wave model for the COVID-19 epidemic in Arizona. Black circles correspond to the data points. The model fit (solid red line) and 95% prediction interval (dashed red lines) are also shown. The vertical line separates the calibration period (left) from the forecasting period (right). The sequential forecasts were conducted on March 30, April 6, April 13, April 20, April 27, and May 4, 2020. 5 . CC-BY-NC-ND 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. (which was not certified by peer review) The copyright holder for this preprint this version posted July 4, 2020. . https://doi.org/10.1101/2020.07.03.20146159 doi: medRxiv preprint Figure S30 . Sequential 20-day ahead forecasts of the sub-epidemic wave model for the COVID-19 epidemic in Washington. Black circles correspond to the data points. The model fit (solid red line) and 95% prediction interval (dashed red lines) are also shown. The vertical line separates the calibration period (left) from the forecasting period (right). The sequential forecasts were conducted on March 30, April 6, April 13, April 20, April 27, and May 4, 2020. 5 10 15 . CC-BY-NC-ND 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. (which was not certified by peer review) The copyright holder for this preprint this version posted July 4, 2020. . https://doi.org/10.1101/2020.07.03.20146159 doi: medRxiv preprint Caution Warranted: Using the Institute for Health Metrics and Evaluation Model for Predicting the Course of the COVID-19 Pandemic The intrinsic transmission dynamics of 20 tuberculosis epidemics The geographical and temporal evolution of sexually transmitted disease epidemics Epidemiologic Aspects of Control of Penicillinase-Producing Neisseria gonorrhoeae The HIV endemic: maintaining disease transmission in at-risk urban areas A novel sub-epidemic modeling framework for shortterm forecasting epidemic waves The strength of weak ties Real-time forecast of multiphase outbreak Modeling and inverse problems in the presence of uncertainty Transmission dynamics of the great influenza pandemic of Fitting dynamic models to epidemic outbreaks with quantified uncertainty: A Primer for parameter uncertainty, identifiability, and forecasts A novel sub-epidemic modeling framework for shortterm forecasting epidemic waves: Datasets and fitting code. figshare Strictly proper scoring rules, prediction, and estimation 21. M4Competition. Competitor's Guide: Prizes and Rules A novel sub-epidemic modeling framework for short-term forecasting epidemic waves Modeling and inverse problems in the presence of uncertainty Transmission dynamics of the great influenza pandemic of Fitting dynamic models to epidemic outbreaks with quantified uncertainty: A Primer for parameter uncertainty, identifiability, and forecasts Author contributions: GC conceived the study. KR and AT contributed to data analysis. All