key: cord-0789334-i4syz2t1 authors: He, M.; Fang, K.; Feifei, Z.; Ou, T.; Chen, D. title: A delayed modulation of solar radiation on the COVID-19 transmission reflects an incubation period date: 2020-10-14 journal: nan DOI: 10.1101/2020.10.13.20183111 sha: 85ecec3615330f852388b789c021203f81aa185a doc_id: 789334 cord_uid: i4syz2t1 Laboratory experiments have revealed the meteorological sensitivity of the virus of the coronavirus disease 2019 (COVID-19). However, no consensus has been reached about how the meteorological conditions modulate the virus transmission as it is constrained more often by non-meteorological factors. Here, we find that the non-meteorological factors constrain statistically-least the growth rate of cumulative confirmed cases in a country when the cases arrive around 2500-3000. The least-constrained growth rate correlates with the near-surface ultraviolet flux and temperature significantly (correlation coefficients r=-0.55{+/-}0.08 and -0.45{+/-}0.08 at p 10-5, respectively). In response to increases of 1W/m2 ultraviolet and 1{degrees}C temperature, the growth rate decreases by 0.33{+/-}.11% and 0.18{+/-}.08% per day, respectively. The response to the ultraviolet flux exhibits a delay by about 7 days, providing an independent measure of the incubation period. Our quantifications imply a seasonality of COVID-19 and a high risk of a pandemic resurgence in the upcoming boreal winter, suggesting a need for seasonal adaption in public policies. It has attracted extensive attention whether the coronavirus disease 2019 (COVID- 19) will spread more in the coming winter. Winters are often accompanied by peaks of virus transmissions, such as some Human coronaviruses 1 and influenza 2 . These seasonalities are typically explained in terms of modulations of meteorological conditions on the virus survival, e.g., 3 . This knowledge has raised fears that COVID-19 may surge when the seasons change. A way to predict the COVID-19 transmissions in the winter is to investigate the responses of the virus survival and transmission to the winter weather conditions. Laboratory experiments revealed that the ultraviolet radiation and air temperature modulates the airborne survival of the virus strongly [4] [5] [6] . Several studies have attempted to extract epidemiological evidence, e.g., [7] [8] [9] [10] , through studying the correlations between the number of infections or mortality and the corresponding meteorological conditions using data from priorly selected cities. Their conclusions, however, are controversial. Some attempts did not find the correlation, e.g., 6, 7 , whereas the others reported weak or moderate correlations, e.g., [8] [9] [10] . The results are not conclusive, mainly because the meteorological modulations are constrained by non-meteorological effects. In the current work, we identify the least-constrained growth rate of the infection and quantify its response to meteorological conditions statistically. An uncontrolled outbreak of COVID-19, e.g., 11 , starts with an exponential growth, the rate of which is presumably modulated by external factors such as weather [4] [5] [6] . We estimate the growth rate through a sliding window regression for each country and investigate its response to meteorological conditions across all countries, in contrast to the above-mentioned works 7-10 , which analyzed the number of confirmed cases or mortality from prior selected cities directly. Therefore, our results allow capturing straightforward the modulation of the weather and are not subject to potential statistical bias. The least-constrained growth rate Ideally, outbreaks of infectious diseases e.g., [11] [12] [13] , are typically characterized by two phases with the uncontrolled first phase showing stable exponential growth followed by a second phase with a decreasing growth rate usually after effective artificial controllers 14, 15 (as sketched by the green lines in Figures S1b and S2a). The realistic evolution of the regressed growth rate of confirmed cases decreases largely with time, which can be divided into three primary stages ( Figure S1 as an example and Figure S2 for all countries). In the beginning, infections cannot be confirmed timely and can accumulate until sufficient tests, which is different from the uncontrolled ideal model. Therefore, in Phase I, the decreasing growth rate reflects more the improvement of the capability than the infection growth. In Phase III, the decreasing growth rate could be explained as responses to artificial interventions or controllers, e.g., travel restrictions and changes in human behaviors, e.g., 16, 17 . Also, at the late stage, cured cases are not infectious anymore and therefore lower the growth rate. Phase III corresponds to the decline stage of the ideal evolution model. Therefore, it is most likely in Phase II, in-between the Phases I and III, where the uncontrolled transmission of COVID-19 is recorded timely and sufficiently. Phase II is characterized by a relatively stable growth rate (Figures S1b and S2a), reflecting presumably the most modulation from the weather. In practical, instead of delimiting definitely the stage II for each country, we determine the most stage-II-representative growth rate, referred hereafter as least-constrained growth rate, for all countries through a single-parameter optimization (see "Methods"). The optimization maximizes the correlation between the least-constrained growth rate and meteorological variables. We obtain the solution of the optimization when the accumulative confirmed cases arrive statistically around 2500-3000 (in Phase II in Figure S2a ). In Figure 1 , Africa, central Asia and Central America are generally associated with low least-constrained growth rates, whereas high growth rates are observed in middle and high latitudes of North America and Europe. The obvious spatial clusters of the growth rate suggest its potential links to meteorological conditions. The growth rate exhibits significant correlation with the ultraviolet (UV) flux and the air temperature (r=-0.55±0.08 and -0.45±0.08 at p <<0.01, in Figures 2a and 2b , respectively), but not with the other meteorological conditions, namely, wind speed, relative humidity, diurnal temperature range, and precipitation (p >=0.05, Figures 2c-2f) . The regressions (in red in Figures 2a and 2b ) quantify the responses. An increase in UV flux by 1 W/m 2 is associated with a decrease in the growth rate by 0.33±0.11% per day, and an increase of the temperature by 1 • C is associated with a decrease in the growth rate by 0.18±0.08% per day. A measure of the incubation period Note that the above correlation and regression analyses involve a time displacement of 7 days between the meteorological variables and the growth rate. The correlation between the growth rate and the UV flux (but not the temperature) weakens suddenly as the time displacement decreases when the displacement is shorter than 7 days (Figures 3). The displacement 2/9 . CC-BY-NC 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. The copyright holder for this preprint this version posted October 14, 2020. . indicates the presence of an incubation period as revealed previously 18, 19 . The clinical study 19 suggests that the incubation does not follow a normal distribution but is characterized by a positive skew (more cases occurred below the mean) with a median of 5.1 days and a mean of 6.4 days, whereas a cross-sectional and forward follow-up analysis 18 reported a median incubation of 7.76 days. The discrepancy might be due to sampling bias in the forward follow-up approach which is less capable of detecting incubations shorter than four days. Similarly to the forward follow-up approach, our sliding window in the cross-correlation approach ( Figure 3 ) captures the cases with incubation periods over the most frequency value, and therefore our diagnosis is consistent more with the forward follow-up analysis. Our results provide independent evidence of the incubation period by correlations from all countries, using approaches completely different from the existing estimations. In "Results", we illustrate that the least-constrained growth rate exhibits obvious spatial clusters and significant correlation with the meteorological conditions, namely, UV flux and temperature. The UV correlation exhibits a delay of about seven days, at the temporal scale of the incubation period. While the spatial clusters and the correlation might be attributed to the spatial difference of socioeconomic factors 20, 21 , the delay cannot. The variations of socioeconomic factors are overall at temporal scales much longer than that of the delay, which can neither modulate the COVID-19 nor respond to the UV flux at the time scale of the incubation period. To explain the correlation, in the current section, we explore potential causalities between the UV flux and the growth rate.There are at least three factors through which meteorological conditions can modulate the transmission. The first is human behaviors. When the temperature is low, humans typically spend more time indoors, with reduced social distancing and less ventilation than outdoors. As an example, schools are places of enhanced influenza transmission 22 for intense indoor activities. The second factor is the immune system of susceptible hosts. Solar radiation drives changes in the human immune system by modulating melatonin 23 and/or vitamin D [24] [25] [26] . The last but might be the most important factor is the survival of the virus, namely the virucidal effect of UV. Evidence has revealed that the aerosols as a medium of transmission of COVID-19, as the virus remains active on the surfaces for several hours to days 14 . Intense solar radiation may inactivate the virus on the surface through the physical properties (i.e., shape, size) and the genetic material of the virus 5, 27, 28 . Simulation results suggested that 90% of the virus can be inactive under summer daytime for 6 minutes, whereas the virus becomes inactive for 125 minutes under night condition 4 . In addition, high temperature shortens the virus survival 6, 29, 30 . On the opposite, low temperature is in favor of prolonging survival on infected surfaces and aerosols, which promotes the diffusion of the infection. The modulation of relative humidity, on the other hand, is negligible, as supported by laboratory experiments 4 , which is different from the sensitive modulation on the influenza virus survival 3 and transmission 31 . The 7-day-delayed response to the UV flux ( Figure 3 ) reflects the incubation period, whereas the response to the temperature does not exhibit a delay. A potential scenario is that temperature variation is characterized by a temporal scale longer than the incubation period, and therefore cannot resolve the incubation period. Another potential scenario is that the temperature might not be an independent driver of the transmission but a response to solar radiation. The temperature correlates significantly with the UV flux (r=0.78±0.05 at p=5.8×10 −22 ). We carried out a canonical-correlation analysis, e.g., 32 , between the growth rate and the UV flux and temperature, resulting in a canonical correlation coefficient c UV,T =-0.56±0.08. The canonical correlation coefficient is close to the correlation of UV r UV =-0.55±0.08 (Figure 2a) , which implicates that using both the UV and temperature as predictors can not explain more variance of the growth rate than using the UV alone. The dominant impact of the UV flux can drive a seasonality of COVID-19 transmission and explain the following geographic dependence of COVID-19. (1) The mortality exhibits a latitudinal dependence 26 . (2) The late outbreak in Africa and arid central Asia is attributable to intense UV flux due to the low cloud fraction prior. (3) The onset of the Asian summer monsoon, increases clouds in early May 33 and yields low UV flux, which may account for the late outbreak in India and many southeastern Asia countries until early May. (4) The decrease in UV and temperature during the coming austral winter can contribute to the sharp increase in South America. For example, both the confirmed and dead cases in Brazil ranked second in the world since 13 June. The current study provides epidemiological support for the hypothesis that the ultraviolet radiation and air temperature drives the COVID-19 transmission 26 . Our results also implicate a seasonality of COVID-19 and provide an independent measure of the incubation period. The virus transmits more readily during winter and during the season of global monsoon, which impacts about 70% of the global population 34 . Accordingly, we predict a high possibility of a resurgence in the next boreal winter and suggest to adapt the public policy according to the seasonal variability. The current section extracts a daily infection growth rate for each country from the data of confirmed cases, through a sliding window regression, optimization, cross-correlation, and unit conversion. The early stage of an uncontrolled outbreak is characterized by an exponential growth with time, e.g. 11, 14 . As an example, Figure S1a displays the cumulative confirmed case number y(t) as a function of time t, which follows the exponential law largely. Therefore, we fit the y(t) to an exponential model y = ae b(t−τ) in a 28day-wide sliding window. (Note that the conclusions of the current work are qualitatively not subject to the window size here, learn from the same analyses but with different window sizes from 16 to 60 days, which are not displayed here.) Here, τ denotes the center of the sliding window, a measures the confirmed cases at τ, and the exponent factor b measures the growth rate. Measuring the goodness of the regression is r 2 , which is equal to the square of the correlation coefficient between y(t) and its regression value. A low r 2 represents the growth does not follow the exponential law well. We repeat the regression at τ= 2, 4, ..., 170 days yielding a and b as functions of τ. Displayed in Figure S1b is b(τ) for Afghanistan as an example. Implementing the sliding window regression for all countries results in a(τ), b(τ), and r 2 (τ) for all countries. Scattered in Figures S2a and S2b , are a(τ) and r 2 (τ) against b(τ) for all countries, respectively. We exclude the b values associated with r 2 <0.9. At r 2 <0.9, b exhibits a dependence on r 2 , which could be explained in terms of the three Phases (in "Results"). In Phase III and as a response to the artificial controllers, the growth stagnates and does not follow the exponential law anymore. In principle, all the regressed growth rates b values from all countries could be used for correlation analyses. However, for a given country, the regressed b values are not completely independent of each other due to the overlapped sampling associated with our sliding window. Therefore, in the current subsection, we select only one regressed growth rate, denoted as b m , for each country. We select b m as the maximum b at a> θ a where θ a is a threshold value. θ a is optimized by maximizing the absolute correlation coefficient |r(θ a )| between b m (θ a ) and the corresponding meteorological parameters. The coefficients for the UV flux and temperature |r UV (θ a )| and |r T (θ a )| are displayed in Figure S3 as functions of θ a . |r UV (θ a )|, greater than |r T (θ a )|, maximizes at θ a =2500. The red crosses in Figure S2a denote b m at θ a = 2500, namely, the maximum b at a >2500. Most of these crosses are between 2500< a <3000, suggesting the growth is modulated statistically strongest by the meteorological conditions when there are about 2500-3000 confirmed cases. As an example, the red symbols in Figure S1 illustrate the determined b m and the associated time window. Note that the correlation analyses above are implemented with a time displacement between the sampling window of the growth rate and that of the meteorological variables δt := t UV,T − τ, to avoid the contamination from the COVID-19 incubation period, e.g., 18 . We diagnosis the incubation period through cross-correlation analyses. We first calculate the absolute correlation coefficient |r UV | between 28d-averaged UV flux and b m as a function of the displacement δt. The resultant |r UV (δt)| is displayed as the solid blue line in Figure 3 . The slope of the blue line d|r UV |/dδt is denoted as the red crosses. At -7d< δt <0, the slope d|r UV |/dδt is beyond its standard deviation (outside the shadow), reflecting the correlation decrease sharply as the δt increases. We attribute this sharp decrease to the overlapping of the incubation period with the 28d-wide UV simpling window. Note that the identification of the incubation period is not subjective to the threshold θ a , which is learned from the same analyses but with different θ a =10 3.1 , 10 3.2 , ..., 10 3.8 (not displayed here). The dotted line and bars display the correlation between b m and temperature, and its slope, which does not exhibit a similar sudden drop. According to our regression model, b m is an exponent and e b m measures the ratio of the regressed number of confirmed cases of one day over that of the previous day. Therefore, g := (e b m − 1) * 100% is the daily growth rate by percentage. When b m ≈0, b m is already a first-order approximation of g due to g = b m + O(b 2 m ) ≈ b m , since e b can be expanded into Taylor polynomial Here, O(b 2 ) denotes a variable with absolute value at most some constant times |b 2 | when b is close enough to 0. In "Results", the analyses are based on g. . CC-BY-NC 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. The copyright holder for this preprint this version posted October 14, 2020. . https://doi.org/10.1101/2020.10.13.20183111 doi: medRxiv preprint Figure 1 . Global distribution of the daily growth rate g of COVID-19 cases. Each point represents one country/region. Both the color and the size of the symbols represent the growth rate. The growth rate is estimated through a sliding window regression and optimization detailed in "Methods". . CC-BY-NC 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. The copyright holder for this preprint this version posted October 14, 2020. Precipitation (mm/day) (f) Figure 2 . Correlation between the daily growth rate g and six meteorological variables, (a) the ultraviolet (UV) the flux in the range 250-440nm, (b) the air temperature at 2m above the surface, (c) the diurnal temperature range, (d) the relative humidity, (e) the wind speed at the height of 10m, and (f) the precipitation. In each panel, one cross represents one country, corresponding to one red cross displayed in Figure S2a ; the solid red line presents a robust regression to a linear model g = β 1 * x + β 0 through the least absolute deviations method, and the dashed and dotted lines display the significance level α = 0.05 and 0.01, respectively. Here, x denotes one of the above six variables, and β 1 and β 0 denote the parameters to be determined. The regression results are displayed in red on the bottom of each panel, while the Pearson correlation coefficient r is printed on the upright conner, in the format of r ± ∆r[r l , r u ]. Here, r and ∆r are the mean coefficient and its standard deviation estimated through a bootstrapping method, and r l and r u are the lower and upper bounds for a 95% confidence interval. Also displayed on the top is the p-value for testing the hypothesis of no correlation. . CC-BY-NC 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. (which was not certified by peer review) The copyright holder for this preprint this version posted October 14, 2020. Here, t UV,T is the center of the sampling window of UV or T , whereas τ is the center of the sliding window in which the growth rate is regressed, as instanced in Figure S1 . The shadow illustrates one standard deviation of the slopes below and above the average slop. At δt<-7 as displayed by the black line, the slope of r UV is below the shadow, which is attributable to the incubation period of COVID-19. . CC-BY-NC 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. (which was not certified by peer review) The copyright holder for this preprint this version posted October 14, 2020. . https://doi.org/10.1101/2020.10.13.20183111 doi: medRxiv preprint Human coronavirus circulation in the united states 2014-2017 Absolute humidity and the seasonal onset of influenza in the continental united states Absolute humidity modulates influenza survival, transmission, and seasonality Airborne SARS-CoV-2 Is Rapidly Inactivated by Simulated Sunlight Simulated Sunlight Rapidly Inactivates SARS-CoV-2 on Surfaces Effect of temperature on the infectivity of COVID-19 No association of COVID-19 transmission with temperature or UV radiation in Chinese cities Covid-19 transmission in mainland china is associated with temperature and humidity: A time-series analysis Temperature, Humidity, and Latitude Analysis to Estimate Potential Spread and Seasonality of Coronavirus Disease 2019 (COVID-19) The effects of regional climatic condition on the spread of COVID-19 at global scale Effective containment explains subexponential growth in recent confirmed COVID-19 cases in China Spreading patterns of the influenza a (h1n1) pandemic Exponential growth in ebola outbreak since may 14 The reproductive number of COVID-19 is higher compared to SARS coronavirus Infectious diseases of humans: dynamics and control The incubation period of coronavirus disease 2019 (CoVID-19) from publicly reported confirmed cases: Estimation and application Estimation of incubation period distribution of covid-19 using disease onset forward time: A novel crosssectional and forward follow-up study Importance of collecting data on socioeconomic determinants from the early stage of the COVID-19 outbreak onwards Community and Socioeconomic Factors Associated with COVID-19 in the United States: Zip code level cross sectional analysis Estimating the impact of school closure on influenza transmission from Sentinel data Seasonal variation in host susceptibility and cycles of certain infectious diseases The role of uv radiation and vitamin d in the seasonality and outcomes of infectious disease Vitamin d supplementation to prevent acute respiratory tract infections: systematic review and meta-analysis of individual participant data COVID-19 Fatalities, Latitude, Sunlight, and Vitamin D Inactivation of influenza virus by solar radiation Inactivation of the infectivity of two highly pathogenic avian influenza viruses and a virulent Newcastle disease virus by ultraviolet radiation Epidemiology, genome, and clinical features of the pandemic SARS-CoV-2: a recent view On the global trends and spread of the COVID-19 outbreak: preliminary assessment of the potential relation between location-specific temperature and UV index Low ambient humidity impairs barrier function and innate resistance against influenza infection Multivariate observations Rainy season of the Asian-Pacific summer monsson The global monsoon across time scales: Mechanisms and outstanding issues This study was funded by the National Science Foundation of China (41888101, 41822101 and 41971022), Deutsche Forschungsgemeinschaft (DFG HE6915/1-1), Strategic Priority Research Program of the Chinese Academy of Sciences (XDB26020000), the State Administration of Foreign Experts Affairs of China (GS20190157002), fellowship for the National Youth Talent Support Program of China (Ten Thousand People Plan). Support from the Swedish Formas (Future Research Leaders) project is also acknowledged. We used the COVID-19 data of cumulative confirmed cases until 20 July of 2020 at a country level from COVID-19 Data Repository by the Center for Systems Science and Engineering (CSSE) at Johns Hopkins University. The daily meteorological variables are extracted from the ERA5 reanalysis dataset from the European Centre for Medium-Range Weather Forecasts (ECMWF) (C3S, 2017) . The meteorological variables analyzed herein include the air temperature at 2m above the surface (land, sea or inland waters), precipitation, relative humidity, wind speed at the height of 10m, downward UV radiation flux at the surface (UV, in the range 250-440 nm), and diurnal temperature range. The daily mean meteorological data were averaged for each country to compare with the country-level COVID-19 data.