key: cord-0706638-y691hark authors: Desjardins, M.R.; Hohl, A.; Delmelle, E.M. title: Rapid surveillance of COVID-19 in the United States using a prospective space-time scan statistic: Detecting and evaluating emerging clusters date: 2020-04-08 journal: Appl Geogr DOI: 10.1016/j.apgeog.2020.102202 sha: ccb223e997151f795690b6ce962510dea02394f9 doc_id: 706638 cord_uid: y691hark Coronavirus disease 2019 (COVID-19) was first identified in Wuhan, China in December 2019, and is caused by severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2). COVID-19 is a pandemic with an estimated death rate between 1% and 5%; and an estimated [Formula: see text] between 2.2 and 6.7 according to various sources. As of March 28th, 2020, there were over 649,000 confirmed cases and 30,249 total deaths, globally. In the United States, there were over 115,500 cases and 1891 deaths and this number is likely to increase rapidly. It is critical to detect clusters of COVID-19 to better allocate resources and improve decision-making as the outbreaks continue to grow. Using daily case data at the county level provided by Johns Hopkins University, we conducted a prospective spatial-temporal analysis with SaTScan. We detect statistically significant space-time clusters of COVID-19 at the county level in the U.S. between January 22nd-March 9th, 2020, and January 22nd-March 27th, 2020. The space-time prospective scan statistic detected “active” and emerging clusters that are present at the end of our study periods – notably, 18 more clusters were detected when adding the updated case data. These timely results can inform public health officials and decision makers about where to improve the allocation of resources, testing sites; also, where to implement stricter quarantines and travel bans. As more data becomes available, the statistic can be rerun to support timely surveillance of COVID-19, demonstrated here. Our research is the first geographic study that utilizes space-time statistics to monitor COVID-19 in the U.S. Coronavirus disease 2019 was first identified in Wuhan city, Hubei province, China in December of 2019 (Huang et al., 2020; Li et al., 2020) and is caused by severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2). COVID-19 is a pandemic (cases confirmed in more than 140 territories) with an estimated death rate between 1% and 5% (Roser & Ritchie, 2020) ; and an estimated R 0 between 2.2 and 6.7 (Liu, Gayle, Wilder-Smith, & Rockl€ ov, 2020; Sanche et al., 2020) . As of March 28th, 2020, there were over 649,000 confirmed cases and 115, 500 total deaths, globally. In the United States (U.S.), there were over 115,000 cases and 1891 deaths (Dong, Du, & Gardner, 2020) . Approximately 80% of confirmed cases are mild, with symptoms including fever, cough, and shortness of breath (Ruan, Yang, Wang, Jiang, & Song, 2020) . Severe cases may experience pneumonia, multi-organ failure, and death (Mahase, 2020) . The vast majority of deaths from COVID-19 are those with preexisting conditions (e.g. hypertension and heart disease), are immunocompromised, or above 60 years old (Wu & McGoogan, 2020) . During an emerging infectious disease like COVID-19, it is critical to implement space-time surveillance that can prioritize locations for targeted interventions, rapid testing, and resource allocation. One such method is the space-time scan statistic (Kulldorff, 1997) , which is widely used to identify significant clusters of disease. Space-time scan statistics supplement and can study basic rate maps of disease by relying on a variety of data models to determine whether the observed space-time patterns of a disease are due to chance or randomly distributed. In other words, scan statistics detect clusters that are outliers (e.g. unexpected clustering given baseline conditions). The statistic utilizes circles or ellipses (scanning window) that are centered on grid points and move (scan) systematically across a study area to identify clusters of cases (each window counts number of aggregated cases per geographic unit). In its space-extension, the location, size, and duration of statistically significant clusters of disease cases are subsequently reported (Desjardins et al. 2018; Owusu, Desjardins, Baker, & Delmelle, 2019; Whiteman, Desjardins, Eskildsen, & Loaiza, 2019; Desjardins, Hohl, Delmelle, & Casas, 2020) . To routinely monitor outbreaks, the prospective space-time scan statistic (Kulldorff, 2001) is one method to detect "active" or emerging clusters of disease, which can be used for surveillance during an ongoing epidemic. The statistic will detect clusters that are "active" at the end of the study period; but as more data (e.g. confirmed cases) becomes available, the statistic can be rerun to confirm the presence and track the clusters in space and time, update relative risks for each location affected by a disease, and detect new emerging clusters. The main purpose of using a prospective statistic rather than retrospective is to only focus on significant clustering that is "active" or present at the time of the analysis; which disregards clusters that may have existed previously, and are no longer a public health threat (Kulldorff, 2001) . For example, the prospective space-time scan statistic has been utilized to detect emerging clusters of shigellosis (Jones, Liberatore, Fernandez, & Gerber, 2006) , measles (Yin, Li, Ma, & Feng, 2007) , thyroid cancer (Kulldorff, 2001) , and syndromic surveillance (Yih et al., 2010) . Since COVID-19 data are updated daily, our approach can contribute to timely monitoring of the pandemic, focusing on the United States in this study. This study contributes to ongoing COVID-19 surveillance efforts by detecting significant space-time clusters of reported cases at the county level in the U.S. The space-time prospective statistic is especially useful since it detects active and emerging clusters of COVID-19, which can inform public health officials and decision-makers where and when to improve targeted interventions, testing sites, and necessary isolation measures to mitigate further transmission. Our prospective analysis can be rerun each day as new data become available to detect new emerging clusters and identify areas where transmission is decreasing; suggesting where COVID-19 is potentially no longer a public health threat. To demonstrate the notion of detecting new emerging clusters when adding updated case data using the prospective space-time scan statistic, we report results for two time periods: January 22nd-March 9th, 2020 and January 22nd-March 27th, 2020. Since COVID-19 is a highly infectious disease that can affect all segments of the population, we decided not to adjust for age. However, since the highest proportion of deaths occur among the elderly and those with preexisting conditions, an age-adjusted Bernoulli model accounting for cases and deaths could be conducted but is beyond the scope of this research. We collected COVID-19 case and location data from Johns Hopkins University's Center for Systems Science and Engineering GIS dashboard (Dong et al., 2020) . These data are freely available on their GitHub page (https://github.com/CSSEGISandData/COVID-19). Temporally, these data are currently updated daily, and we use available data between January 22nd and March 27th, 2020. Spatially, the daily confirmed cases if COVID-19 are aggregated at the county level. Using the spatial location information in the COVID-19 dataset, we assigned the case counts to the appropriate counties in a geographic information systems compatible file we obtained from the U.S. Census. We focused our analysis on the contiguous 48 states and Washington D. C., excluded cases recorded at the state-level (no county-level information available) and cases diagnosed on the "Grand Princess" and "Diamond Princess" cruise ships. The infected passengers on the cruise ships were sent to various quarantine locations throughout the U.S. and their exact locations are not provided in the dataset. The COVID-19 dataset reports cumulative case counts (Fig. 1) . Therefore, for each day in the study period, we subtracted the previous day's count (n t-1 ) from the current day's count (n t ) to obtain the number of new cases. To identify space-time clusters that are still occurring or "active", we utilize the prospective version of the Poisson space-time scan statistic (Kulldorff, 2001; Kulldorff, Athas, Feurer, Miller, & Key, 1998) and implemented in SaTScan™ (Kulldorff, 2018) . As such, we can identify COVID-19 clusters that are still active (excess risk still present) during the last day in our dataset. In other words, we detect space-time clusters of COVID-19 that are emerging and "disregard" clusters in the study period that do not have a statistically significant excess relative risk (i.e. more observed than expected COVID-19 cases). In other words, the prospective statistic evaluates potential clusters that are still occurring at the end of the study period. The space-time scan statistic (STSS) employs moving cylinders that scan the U.S. for potential space-time clusters of COVID-19 cases. The base of the cylinder is the spatial scanning window and the height reflects the temporal scanning window. The center of the cylinder is defined as the centroid of each U.S. county. Next, each cylinder is expanded until a maximum spatial and temporal upper bound is reached, while each cylinder is a potential cluster. We set the upper bounds to have a maximum spatial and temporal scanning window size of 10% of the population at-risk to avoid extremely large clusters; and 50% of the study period, respectively. Each cluster's duration was set to a minimum of 2 days and a cluster must contain at least 5 confirmed cases of COVID-19. In other words, an unknown large number of cylinders of different spatial and temporal sizes are generated around each centroid until the maximum spatial and temporal thresholds are reached; the observed and expected case counts are computed within each cylinder, which are derived from the total number of centroids captured in each cylinder. We selected the discrete Poisson data model, where we assume that the COVID-19 cases follow a Poisson distribution according to the population of the geographic region. The null hypothesis H 0 states that the model reflects a constant risk with an intensity μ, which is proportional to the at-risk population. The alternative hypothesis H A states that the number of observed COVID-19 cases exceeds the number of expected cases derived from the null model (elevated risk within a cylinder). The expected number of COVID-19 cases (μ) under the null hypothesis H 0 is derived as follows in Equation (1): with p the population in i; C the total COVID-19 cases in the U.S.; and P the total estimated population in the U.S. Note that the model assumes that the population is static for each location at each time period. A maximum likelihood ratio test is used to identify scanning windows with an elevated risk for COVID-19, which is defined in Equation (2): with L(Z) the likelihood function for cylinder Z, and L 0 the likelihood function for H 0; n Z the number of COVID-19 cases in a cylinder; μðZÞ the number of expected cases in cylinder Z; N the total number of observed cases for the entire U.S. across all time periods; and μðTÞ the total number of expected cases in the study area across all time periods. The cylinder has an elevated risk when the likelihood ratio is greater than 1, that is nZ μðZÞ > NÀ nZ NÀ μ ðZÞ . Furthermore, the space-time scan statistic uses different cylinder sizes, and the cylinder with the highest likelihood ratio (maximum) is the most likely cluster. Monte Carlo testing is utilized (999 simulations) to assess the statistical significance of space-time clusters. Each simulation is conditioned to the same number of cases, and the likelihood is computed, so we obtain 999 likelihood ratios for each candidate cluster representing the distribution of the likelihood ratio under H 0. Secondary clusters are also reported if they are statistically significant at the p < 0.05 level. To circumvent the assumption that the relative risk of COVID-19 is homogenous throughout a significant space-time cluster, we also report and visualize the relative risk for each U.S. county that belongs to a cluster. The relative risk (RR) for each location belonging to a cluster is derived from Equation (3): where c is the total number of COVID-19 cases in a county, e is the total number of expected cases in a county, and C is the total number of observed cases in the U.S. RR is the estimated risk within a location divided by the risk outside of the location (i.e. everywhere else). For example, if a county has a RR of 2.5, then the population within that county are 2.5 times more likely to be exposed to COVID-19. The reported clusters also have a relative risk, which is derived the same way as Equation (3); but the clusters RR is estimated risk (observed/expected) divided by the risk outside of the cluster. The incubation period of COVID-19 can be up to 2 weeks, so we detected active clusters that spanned �42 days, which is approximately three incubation periods of onset of the most current COVID-19 case in the dataset. The results identify statistically significant emerging clusters of COVID-19 in the U.S. at the county level between January 22nd-March 9th, 2020 in section 3.1 and between January 22nd-March 27th, 2020 in section 3.2. As the pandemic continues, new data can be added the prospective space-time scan statistic to monitor active clusters and identify areas that no longer are experiencing excess incidence based on available confirmed cases (i.e. areas that no longer have an excess public health risk). Table 1 provides the characteristics of the statistically significant emerging space-time clusters of COVID-19 at the county level from January 22nd and March 9th, 2020. Cluster 1 is found in the northwestern U.S. and includes 23 counties with a RR > 1 (i.e. more observed than expected cases). King County in Washington has a RR of 135.4 with 82 observed cases at the time of this study, and Santa Clara County in California contained 36 observed cases and a RR of 62. Cluster 2 only contains one county (Westchester) in New York with a RR of 639 and 97 observed cases. Cluster 3 contains counties in the mid-Atlantic region of the U.S. with nine counties exhibiting a RR > 1. Nassau County in Long Island, New York contained the highest RR of 80.4 with 17 observed cases. Cluster 4 is in eastern Texas and contains two counties with a RR > 1 (Fort Bend -RR ¼ 47.9; Harris -RR ¼ 8). Cluster 5 is located in northern Georgia with 4 counties with an elevated relative risk: Polk (RR ¼ 104.7), Fulton (RR ¼ 21.3), Cobb (RR ¼ 17.7), and Cherokee (RR ¼ 17.5). Cluster 6 is located in the Midwest, where Summit County, Colorado (RR ¼ 250.9) and Johnson County, Iowa (RR ¼ 154.9) exhibits the highest relative risk. Cluster 7 contains two counties in southern California: Los Angeles (RR ¼ 6.8) and Orange (RR ¼ 4.9). Finally, Cluster 8 is located in southern Florida and contains 4 counties with an elevated risk: Charlotte (RR ¼ 56), Manatee (RR ¼ 52.6), Lee (RR ¼ 27.5), and Broward (RR ¼ 16). Fig. 2 illustrates the extent of eight emerging space-time clusters of COVID-19 at the county-level from January 22nd to March 9th, 2020. We highlight both King (Washington) and Westchester (New York) counties, which are known as the first major hotspots of the outbreaks in the US. King County is known to have the first U.S. case of COVID-19, which was introduced by recent travelers in China; leading to deadly outbreaks in nursing homes and the surrounding area (Bryson-Cahn et al., 2020) . Westchester County includes the city of New Rochelle, which was the location of New York's initial outbreak and was subject to a containment zone spanning a one-mile radius (Wallis, 2020) . The Bay area in California (especially San Francisco) has also been as a major hotspot of COVID-19, which was one of the first areas in the U.S. to implement a "shelter-in-place" order (Fracassa, 2020) . Counties with a relative risk of 0 are more transparent to focus solely on the counties with an elevated risk that "contribute" to the emerging clusters. Fig. 1 indicates that many densely populated counties were within an emerging cluster across the U.S., while we continue to monitor the outbreaks and detect new clusters in the following section using eighteen more days of case data. Table 2 summarizes the characteristics of the twenty-six statistically significant emerging space-time clusters of COVID-19 at the county level between January 22nd and March 27th, 2020. Cluster 1 (the most likely cluster) contains 14 counties in New York (NY), Connecticut, and New Jersey, and Manhattan, NY exhibits the highest RR of 96.8; which was also the highest RR in the U.S. at the time of the analysis. Cluster 2 Table 1 Emerging space-time clusters of COVID-19 from January 22nd-March 9th, 2020 at the county-level (RR ¼ relative risk). Duration ( contains 3 counties in Michigan, and Wayne County exhibiting the highest RR of 4.9. Cluster 3 contains two parishes in the southeastern part of Louisiana and included the New Orleans consolidated city-parish exhibiting a RR of 9.0. Clusters 4, 9, 10, 12-16, 23, and 26 contains one county each: Cook, Illinois (RR ¼ 3.1), Blaine, Idaho (RR ¼ 19.1) which includes the town of Sun Valley and is considered the Idaho COVID-19 hotspot at the time of this publication, Marion, Indiana (RR ¼ 3.7), Summit, Utah (RR ¼ 8.2), Cleburne, Arkansas (RR ¼ 12.8), Caddo, Louisiana (RR ¼ 4.5), Bartow, Georgia (RR ¼ 3.7), Kershaw, South Carolina (RR ¼ 4.5), Clark, Arkansas (RR ¼ 5.3) and Wasatch, Utah (RR ¼ 4.2), respectively. Cluster 5 contains 4 counties in northern Washington State, with Snohomish County exhibiting the highest RR of 2.6. Cluster 6 contains 5 counties in Georgia, and Dougherty County exhibits the highest RR of 8.6. Cluster 7 contains 3 counties in Colorado with a RR > 1, and Gunnison County exhibiting the highest RR of 9.8. Cluster 8 is the largest cluster that Cluster 11 contains 3 counties in Florida, and Broward County exhibits the highest RR of 2.2. Cluster 17 contains 11 counties in Georgia with a RR > 1, and Carroll County exhibits the highest RR of 3.9. Cluster 18 contains 4 counties in Indiana with a RR > 1, and Decatur County exhibits the highest RR of 11.5. Cluster 19 contains 2 counties in Missouri, and St. Louis exhibits the highest RR of 1.9. Cluster 20 contains two counties in California, and San Francisco exhibits the highest RR of 1.9. Cluster 21 contains 3 counties in Tennessee with a RR > 1, and Davidson County exhibits the highest RR of 1.7. Cluster 22 contains 3 counties in Colorado, and Denver exhibits the highest RR of 1.7. Cluster 24 contains 3 counties in Alabama, and Walker County exhibits the highest RR of 3.0. Finally, Cluster 25 contains 5 counties in Mississippi with a RR > 1, and Quitman County exhibits the highest RR of 6.9. Cluster 11 contains 3 counties in Florida, and Broward County exhibits the highest RR of 2.2. Cluster 17 contains 11 counties in Georgia with a RR > 1, and Carroll County exhibits the highest RR of 3.9. Cluster 18 contains 4 counties in Indiana with a RR > 1, and Decatur County exhibits the highest RR of 11.5. Cluster 19 contains 2 counties in Missouri, and St. Louis exhibits the highest RR of 1.9. Cluster 20 contains two counties in California, and San Francisco exhibits the highest RR of 1.9. Cluster 21 contains 3 counties in Tennessee with a RR > 1, and Davidson County exhibits the highest RR of 1.7. Cluster 22 contains 3 counties in Colorado, and Denver exhibits the highest RR of 1.7. Cluster 24 contains 3 counties in Alabama, and Walker County exhibits the highest RR of 3.0. Finally, Cluster 25 contains 5 counties in Mississippi with a RR > 1, and Quitman County exhibits the highest RR of 6.9. Fig. 3 shows the locations and spatial patterns of the twenty-six emerging space-time clusters of COVID-19 at the county level in the U.S. between January 22nd and March 27th, 2020. Adding updated COVID-19 case data produced eighteen more emerging clusters than our analysis in section 3.1. The resulting space-time clusters are smaller in size and more "intense" when running the prospective statistic between January 22nd and March 27th. Notably, the relative risk decreased in Washington State's counties, especially King County where the COVID-19 outbreak was first introduced in the U.S. It is important to highlight that the relative risk throughout the U.S. increased using case data until March 27th; compared to the first analysis in section 3.1 that ended on March 9th. Furthermore, the northeastern U.S. is clearly the epicenter of COVID-19 in the country as shown in Fig. 2. Fig. 2 also shows that some clusters in Fig. 1 have "disappeared" (e.g. southern California and Texas), likely due to increases in testing and vast increases of confirmed cases in many locations after March 9th. Overall, the reported spacetime clusters in Table 2 and Fig. 2 tell a story of the rapid COVID-19 dispersal and transmission across the U.S. In this paper, we utilized a prospective space-time scan statistic to detect emerging clusters of COVID-19 in the United States at the county level, providing results at two distinct time periods. To our knowledge, this study is the first one that utilizes space-time scan statistics to detect emerging clusters of COVID-19 in the United States. The prospective scanning statistic is a valuable surveillance tool to monitor disease outbreaks as they unfold (Kulldorff & Kleinman, 2015) . We suggest that prospective scanning statistics should be utilized in the suite of tools available to public health departments and researchers. It is important to conduct rapid statistical analysis to supplement basic case and disease rate maps available to better understand the highest risk areas of COVID-19; and how risk will progress throughout the duration of this pandemic. Since March 18th, 2020, each of the 50 U.S. states and Washington D.C. reported a confirmed case of COVID-19 (Dong et al., 2020) . The prospective approach utilized in this study can be useful for state and local health departments to monitor the outbreaks in a timely fashion. The main strength of the prospective approach is the ability to add updated COVID-19 counts and rerun the statistic to identify new Fig. 3 . Spatial distribution of emerging space-time clusters of COVID-19 at the county level from January 22nd-March 27th, 2020. emerging clusters; while also tracking the previously detected clusters to determine if they are growing or shrinking in magnitude. Doing so can help determine if current mitigation and isolation techniques are effective at curbing the spread of COVID-19. We demonstrate the notion of the prospective approach by presenting results between January 22nd -March 9th, 2020, and January 22nd -March 27th, 2020. The updated results in section 3.2 showcase the evolution of the COVID-19 outbreaks in the U.S., while 18 more clusters were detected using the updated daily case data. Notably, Manhattan became the epicenter of COVID-19 in the U.S., with a staggering 25% of the confirmed cases across the country. Furthermore, New Orleans and the Fort Lauderdale/Miami areas became hotspots in the southern U.S. Wayne County, Michigan contains Detroit, which also was detected as one of the major hotspots in the U.S when adding the updated daily cases to the prospective scan statistic. One way to further evaluate the evolution of the detected clusters is to relax the statistical significance required (i.e. p < 0.05) and rerun the analysis at numerous spatial and temporal scales. As a result, we can identify locations that may become significant in a few days or a week's time but is beyond the scope of this exploratory paper. Furthermore, the incidence rates are not uniform across the U.S. Population density, age groups, and state and local mitigation measures will influence COVID-19 transmission and the magnitude of current and newly detected spacetime clusters. Healthcare facilities and resources will continue to be tested as more cases are suspected and confirmed with increases in testing (Heymann & Shindo, 2020; Yee et al., 2020) . Isolation measures and intensive contact tracing can successfully control COVID-19 outbreaks and reduce the burden facing hospitals and healthcare providers (Hellewell et al., 2020) . Enhanced hygiene and stricter social distancing measures are required to reduce SARS-CoV-2 circulation, especially when community transmission is detected (Dalton, Corbett, & Katelaris, 2020) . Availability of public datasets are also critical to increase surveillance efforts across the globe and corresponding areas facing substantial increases in transmission (Sun, Chen, & Viboud, 2020) . Confirmed case counts are not enough to understand the true magnitude of the COVID-19 pandemic. Compiling datasets that include suspected, probable, and negative test counts can substantially improve surveillance efforts and our understanding of COVID-19 transmission dynamics (Lipsitch, Swerdlow, & Finelli, 2020) . Despite the strengths of our study, there are limitations worth mentioning. First, there are many counties that were included in the clusters that did not contain any reported cases of COVID-19; however, this is due to the scanning process (an artifact of the statistic) and is circumvented by reporting the relative risk for the locations that belong to each cluster. Second, the case data only include confirmed cases and it is important to highlight that suspected and probable cases are not considered due to unavailability and uncertainty. Therefore, the true magnitude of the COVID-19 pandemic and will not be known for some time. Third, more local-level surveillance and studies are required to understand the transmission dynamics of the current and future emerging clusters; as SaTScan is an exploratory statistic. Fourth, COVID-19 is more severe for the elderly and those with preexisting medical conditions. Future studies can implement case/control cluster techniques with death and case counts (e.g. space-time Bernoulli models), while simultaneously adjusting for age and other relevant covariates. It is also possible to adjust for younger age groups to examine if mitigation guidelines have been successful in any way. Finally, this study utilized COVID-19 case data up until March 27th, 2020. Therefore, the magnitude and number of emerging clusters in our county-level analysis is likely much higher as cases continue to increase across the U.S. We utilized publicly available case data from Johns Hopkins University's Center for Systems Science and Engineering to detect emerging space-time clusters of COVID-19 at the county level in the United States for two separate time periods. We suggest that the counties belonging to emerging clusters should be prioritized when allocating resources and implementing various quarantine and isolation measures to slow viral transmission. COVID-19 and general infectious disease surveillance can benefit from our prospective approach by monitoring outbreaks as they happen as new data becomes available. We emphasize the importance of focusing surveillance on emerging and active clusters during epidemics, essentially dismissing previous clusters that do not threaten public health that would appear in a retrospective analysis. Furthermore, data sharing and availability is crucial and allows a variety of researchers to contribute to our knowledge of COVID-19 and epidemiology, in general. Geographers can play a vital role in mitigating disease transmission, and this study is one example of the plethora of methods that can be implemented in a limited timeframe to effectively inform public health officials and decision-makers about spatial and space-time transmission dynamics. This research did not receive any specific grant from funding agencies in the public, commercial, or not-for-profit sectors. A novel approach for a novel pathogen: Using a home assessment team to evaluate patients for 2019 novel coronavirus (SARS-CoV-2) Pre-emptive low cost social distancing and enhanced hygiene implemented before local COVID-19 transmission could decrease the number and severity of cases Identifying and visualizing space-time clusters of vector-borne diseases Space-time clusters and co-occurrence of chikungunya and dengue fever in Colombia from An interactive web-based dashboard to track COVID-19 in real time. The Lancet Infectious Diseases Bay Area coronavirus shutdown: How life will change with shelterin-place order Feasibility of controlling COVID-19 outbreaks by isolation of cases and contacts. The Lancet Global Health COVID-19: What is next for public health? The Lancet Clinical features of patients infected with 2019 novel coronavirus in Wuhan Use of a prospective space-time scan statistic to prioritize shigellosis case investigations in an urban jurisdiction A spatial scan statistic Prospective time periodic geographical disease surveillance using a scan statistic SaTScan TM user guide for version 9 Evaluating cluster alarms: A space-time scan statistic and brain cancer in Los alamos Comments on 'A critical look at prospective surveillance using a scan statistic Early transmission dynamics in Wuhan, China, of novel coronavirus-infected pneumonia Defining the epidemiology of Covid-19-studies needed The reproductive number of COVID-19 is higher compared to SARS coronavirus Coronavirus: covid-19 has killed more people than SARS and MERS combined, despite lower case fatality rate Residential mobility impacts relative risk estimates of space-time clusters of chlamydia in Kalamazoo County Coronavirus disease (COVID-19). Our World in data Clinical predictors of mortality due to COVID-19 based on an analysis of data of 150 patients from Wuhan The novel coronavirus, 2019-nCoV, is highly contagious and more infectious than initially estimated Early epidemiological analysis of the coronavirus disease 2019 outbreak based on crowdsourced data: A population-level observational study. The Lancet Digital Health What it's like in New Rochelle, N.Y., the site of the state's biggest coronavirus outbreak Integrating vector surveillance data to improve space-time risk estimation of dengue fever in Panama Characteristics of and important lessons from the coronavirus disease 2019 (COVID-19) outbreak in China: Summary of a report of 72 314 cases from the Chinese center for disease control and prevention Novel coronavirus 2019 (COVID-19): Emergence and implications for emergency care Evaluating real-time syndromic surveillance signals from ambulatory care data in four states The early warning system based on the prospective space-time permutation statistic. Wei sheng yan jiu¼ The authors would like to acknowledge Johns Hopkins University Center for Systems Science and Engineering for providing open access to the COVID-19 data. Doing so supports timely research to facilitate surveillance and decision-making in this ongoing pandemic. Next, we thank Dr. Frank Curriero from Johns Hopkins Bloomberg School of Public Health, for his insightful and constructive feedback that improved the quality of this paper. Finally, we would especially like to thank the timely and constructive feedback of the editor and the anonymous reviewers that ultimately improved the quality of this manuscript. Supplementary data to this article can be found online at https://doi. org/10.1016/j.apgeog.2020.102202.