key: cord-0724496-btiq6pu4
authors: Chu, Amanda M. Y.; Chan, Thomas W. C.; So, Mike K. P.; Wong, Wing-Keung
title: Dynamic Network Analysis of COVID-19 with a Latent Pandemic Space Model
date: 2021-03-19
journal: Int J Environ Res Public Health
DOI: 10.3390/ijerph18063195
sha: 280387e99defb4b8ad8efbcca78ab5a72f9e3134
doc_id: 724496
cord_uid: btiq6pu4

In this paper, we propose a latent pandemic space modeling approach for analyzing coronavirus disease 2019 (COVID-19) pandemic data. We developed a pandemic space concept that locates different regions so that their connections can be quantified according to the distances between them. A main feature of the pandemic space is to allow visualization of the pandemic status over time through the connectedness between regions. We applied the latent pandemic space model to dynamic pandemic networks constructed using data of confirmed cases of COVID-19 in 164 countries. We observed the ways in which pandemic risk evolves by tracing changes in the locations of countries within the pandemic space. Empirical results gained through this pandemic space analysis can be used to quantify the effectiveness of lockdowns, travel restrictions, and other measures in regard to reducing transmission risk across countries.

Since early 2020, many countries have been affected by the novel coronavirus disease 2019 (COVID- 19) pandemic. Large numbers of confirmed cases of COVID-19 have been reported in the time since the World Health Organization's (WHO) declaration of COVID-19 as a global pandemic on 11 March 2020 [1]. By 22 July 2020, there had been 14,780,939 confirmed cases and 608,839 deaths [2] . There is no doubt that the outbreak of COVID-19 has resulted in a serious threat to public health. To stop the spread of the disease and to reduce the risk from the pandemic, different countries have adopted various levels of control measures, including, but not limited to, quarantining, social distancing regulations, travel restrictions, and the locking down of entire cities.

There has been growing interest in research into various aspects of the COVID-19 pandemic, including the ways in which the disease is spreading [3] [4] [5] . There is also a lot of ongoing research into the various psychological [6, 7] , environmental [8, 9] , economic [10, 11] , and financial [12, 13] impacts of COVID-19 and the policies that have been adopted in response to these. There have been several studies carried out to evaluate the effectiveness and impacts of social distancing measures [14] . The findings of these studies have shown that lockdown measures, in particular, appear to be one of the most effective approaches to limiting the spread of infections [15] . Studies have also shown that the imposition of travel restrictions has been effective in reducing correlations in the numbers of infected people across different countries [16, 17] . In this paper, we develop a dynamic pandemic network model to evaluate the performances of several major policies in terms of controlling the spread of COVID-19. The policies include travel restrictions, lockdowns, and reopenings, and the adoption of these policies in different countries is investigated using a pandemic space concept. The pandemic space considered here is not an absolute space, which defines a coordinate system with objects inside the space linked under the governance of Euclidean geometry [18] . Rather, the pandemic space refers to the relationship between objects under the context of interest [19] . In our case, to have a numerical representation of similarities in the prevalence of the pandemic between countries, we insert a coordinate system and project our pandemic space onto the Euclidean space. By defining the pandemic space, any similarity in the level of prevalence can be quantified by the distance between countries within the space. In fact, the concept behind pandemic space is similar to the idea of social space that is used in social network analysis, and has various applications in different fields [20] [21] [22] . In social network analysis, the tightness of the social relationship between two individuals is assessed by measuring the distance between them in a social space. Typically, the locations of the individuals in a social space are latent variables and have to be estimated using data. Similarly, in our pandemic network analysis, we study the effect of COVID-19 related policies by tracking the distance between countries in a pandemic space.

With reference to the classical SIR model [23] , we may describe the manner in which a pandemic evolves as moving through four stages. In stage zero, there are no confirmed cases, or occasionally there might be a few cases in a country. At this stage, the contribution to the pandemic risk is low. Then, during the first stage, the daily number of new confirmed cases increases and accelerates, indicating the beginning of rapid transmission and an increase in the pandemic risk. The second stage begins some time after the initial outbreak, and is characterized by the imposition of control measures against the outbreak and a deceleration in the daily number of new confirmed cases, which peaks during this stage. It is at this stage that the contribution to the pandemic risk is at its highest. Finally, in the third stage, the daily number of new confirmed cases drops. During this stage, the contribution to the pandemic risk is decreasing. As shown in Appendix E, one characteristic between stages is that the correlation of the daily number of new confirmed cases between countries is low when their contributions to the pandemic risk are in different stages. Therefore, when a country has adopted a better policy to control the spread of the disease, that country will enter the next stage faster than other countries. Even if a country has reached a later stage, a loophole in the preventive measure can lead to another wave of rapid transmission.

To express stochastically the effect on the pandemic risk of the stage at which a country finds itself within the pandemic space, we constructed dynamic pandemic networks that link pairs of countries based on whether those two countries are highly correlated. We did this because we knew that those countries that are in the same stage, except when both are in stage zero, are highly correlated. Given a snapshot of the dynamic network at any particular time point, a situation of high connectedness implies that there is similar prevalence, either an upward, a flattened, or a downward trend in the pandemic risk, in most countries. We expect that in a situation like this, there may occur major events related to COVID-19 and that these events may happen either across a group of several countries or right across the whole world. These major events might include, but are not limited to, rapid transmission of the disease across countries at a time when people remain unaware of its infectiousness and severity, simultaneous lockdown restrictions and border closures across a group of countries, and large-scale vaccination. Otherwise, we should see multiple clusters or groups, which would be indicative of a situation in which there are different levels of prevalence or different stages of pandemic evolution in different countries.

By taking this pandemic space perspective, it is possible to detect similarities in the prevalence of each country in contributing to the pandemic risk by measuring the distance between two countries in the pandemic space. When two countries get closer in the pandemic space, we expect that they will exhibit similar patterns of infections (or a higher probability of being linked together in the pandemic network) and that there will be a similar level of prevalence between these two countries. While by using dynamic pandemic networks, it is possible to detect occasional fluctuations in the linkages between two countries at consecutive time points, we can use the pandemic space to detect fluctuations more accurately and robustly based on the distance between two countries over time. Our model can also estimate which countries exhibit similarities in cases where they may not be linked through the pandemic networks.

There are several ways to measure the distances between countries in the pandemic space. One possible approach is to make use of the network statistics in pandemic network data [24] . There have been recent research papers published that propose the use of COVID-19 pandemic network data to predict and estimate the pandemic risk across countries [25] [26] [27] [28] . The authors of those papers based their conclusions on the study of pandemic risk scores and network connectedness, using network density, the clustering coefficient, and the assortativity coefficient. These statistics are useful for tracking pandemic risk and can also offer an early indication of an acceleration in the number of confirmed cases. Our main contribution in this paper is to construct the pandemic space using a latent pandemic space modeling approach, which provides a view of the pandemic network data that is different from that provided by an analysis that relies on using network statistics. We first performed latent pandemic space modeling [29] using the pandemic network data to determine a time-dependent location for each country, and then measured the distance between every pair of countries via their coordinates in the pandemic space. This practice of applying latent space modeling to analyze network data can be dated back to the use of multidimensional scaling on sociometric data [30] . Since then, there have been further advancements in the analysis of social network data of both static [31] and dynamic [21] networks. Our proposed pandemic space concept, together with the latent space modeling, enables us to visualize clusters of countries representing different levels or stages of pandemic risk contribution at any time point and keep tracking the risk over time. Furthermore, this latent pandemic space model can estimate the size of an effect on the risk of pandemic based on distance in the pandemic space, and the country-specific effect of the distance on the probability of being linked.

The remainder of the paper proceeds as follows. Section 2 describes the statistical methods that we have used to construct the pandemic space. Section 3 gives the results of our analysis, while Section 4 provides a discussion of results. Finally, Section 5 offers some conclusions.

As mentioned in the introduction, we built the pandemic space from dynamic pandemic networks using latent pandemic space modeling. The dynamic pandemic networks consisted of 164 countries, visualized as nodes in the networks. We list the details of the 164 countries, including the total infected numbers in those countries as of 22 July 2020, in Appendix D. The edges among the countries in the pandemic networks change every day in our study period. With reference to the number of confirmed cases in the WHO's situation reports [32] , we linked a pair of countries by an edge in day t if, for the last 14 days (i.e., from day t − 13 to day t), the correlation between the daily change in square root of the cumulative number of cases was larger than 0.5 [25] [26] [27] . The study period was from 21 January 2020 to 22 July 2020. Following this [16, 25] , we made use of a moving-window scheme to calculate the 14-day historical correlation. We constructed the network starting from the fourteenth day onward. In the statistical estimation, we regarded the fourteenth day in the data (i.e., 4 February 2020) as day 1 in our network. The data were sufficient for us to create time-varying pandemic networks of T = 170 days.

Denote C it as the number of confirmed cases of COVID-19 in country i on day t, where i = 1, . . . , n, and t = −12, . . . , T. We performed square root transformation to the number of cases in country i from day t − 1 to day t before calculating the daily increment of cases, i.e.,

to make the counts more stable statistically [26, 27, 33] . Then, to determine the growth pattern between country i and j, we computed the 14-day historical correlation between D it and D jt for i = j, i, j = 1, . . . , n, and t = 1, . . . , T, i.e.,

where D it = ∑ 14 k=1 D i(t−k+1) is the 14-day moving average. Finally, for the pandemic network at day t, we assigned the (i, j) entry of the adjacency matrix Y t to be 1 if the historical correlation ρ ijt was greater than 0.5 [27] , and 0 otherwise. The collection of all adjacency matrices formed the dynamic network of pandemic Y. In case when the correlation was undefined, we followed the approach in Appendix A to determine the corresponding value in Y t .

A novelty of the latent space modeling approach is its ability to construct a pandemic space from which we can study the tightness between the pandemic situations in different countries over time graphically. Assume that each country has a latent position on the s-dimensional pandemic space, where s = 2 in our case for the sake of visualization. Each country moves in the pandemic space every day to reflect the change in pandemic risk. If two countries are close to each other, the probability that they will have co-movement in their number of confirmed cases (i.e., be linked in the network) is high. In the pandemic perspective, this co-movement in the number of confirmed cases can be partly attributed to the possibility of cross-border transmission in the early stages of the pandemic, and similarities in the effect of pandemic preparedness between the two countries-infection measures and hygiene awareness for stopping local and community transmission, and vaccination in the future.

Let Z t be an n × s matrix so that its i-th row Z it is the position of country i at day t on the pandemic space. We assume the initial coordinates Z 1 follow a multivariate normal distribution, with the joint density

Furthermore, we assume that the transition of a latent position in the pandemic space from t − 1 to t is also normally distributed, with the joint density

for t = 2, 3, . . . , T, where I s is the s × s identity matrix and N(z|µ, Σ) is the (multivariate) normal density evaluated at z with mean µ and covariance matrix Σ. The parameter τ 2 is involved in the calculation of the average initial distance, E||Z i1 − Z j1 ||, in the pandemic space between countries on day 1. Since the distance ||Z i1 − Z j1 || between country i and j is √ 2τ χ 2 2 distributed and has a mean of τ √ π, we can use the mean distance to estimate the pandemic risk across countries at t = 1. The parameter σ 2 is involved in specifying the distribution of the daily transition in the latent position, ||Z it − Z i(t−1) ||. It can be shown that ||Z it − Z i(t−1) || is distributed as σ χ 2 2 and has a mean of σ √ π/2. This mean helps us to understand how fast a country contribution of pandemic risk can build up or fall off due to the changes in latent positions. If this mean is large, we expect that two countries separated by long distance on the pandemic space might get closer over a short period of time.

To connect the latent position to the pandemic network, we first re-write the joint density of the adjacency matrix Y t at day t to be

. Then, we model η ijt , the logit of the conditional probability, with

where the parameters carry the following meaning:

• d ijt = Z it − Z jt , the distance between two countries in the pandemic space; • β > 0, the overall effect of distance on the link probability P(y ijt ) and the associated pandemic risk; • r i > 0 (with constraint ∑ i r i = 1), can be interpreted as the country-specific effect of the distance on the link probability.

The parameters r i and r j are two factors based on country i and j respectively to adjust the effect of the distance d ijt on the logit of the link probability η ijt . Comparing two pairs of countries, i and j, and i and j, if the ratio of the country-specific risk factors of country i to country i is the same as the ratio between country j to country j , i.e., r i /r i = r j /r j = k for some k > 1, these two pairs have the same probability of being linked when d ijt = kd i j t . In other words, even though the distance between country i and j is k times of the distance between country i and j , their high country-specific risk factors counterbalance the distancing effect, leading to the same link probability, which implies a similar level of prevalence risk between country i and j and between country i and j . An application of the country-specific risk is that when the distance between country i and j is equal to the harmonic mean of their corresponding country-specific risk, i.e., d ijt = 2/(r −1 i + r −1 j ), these two countries have a probability 0.5 of being linked together. Based on r i and r j , it is useful to determine a cut-off distance to classify countries i and j into a cluster, or a group, based on the possibility of their being in the same stage in contributing to the pandemic risk. With the latent space location of country i as the center and r i as the radius, we can draw a circle around country i. Graphically, we can determine whether a pair of countries, say country i and j, have a probability of being linked together exceeding 0.5 by considering whether their pandemic space locations lie inside the circles for country i and j simultaneously.

Parameter β works like a regression coefficient to measure the effect of the distance d ijt on the logit link η ijt . It also determines the maximum probability for two countries being linked together. The maximum is attained when two countries coincide in the pandemic space, that is, when d ijt = 0. In this case, the conditional probability is exp(2β)/(1 + exp(2β)), which is strictly increasing and reaches 1 when β → ∞. Therefore, in the pandemic space, even if two countries are separated by a distance of zero, this does not imply that there must be a link between them, as β is finite.

One remark on the position of countries on the pandemic space is that any distancepreserving transformation in the pandemic space gives an identical value of η ijt . We followed the approach in Appendix B to handle the identifiability issue.

We adopted Bayesian methods for estimating the unknown parameters τ 2 , σ 2 , r i , and β in the latent space model. First of all, we assigned a normal prior for β, an inverse gamma prior for τ 2 and σ 2 , and a Dirichlet prior for r. All the priors were set to be uninformative to allow variability. However, the full posterior is highly complex and intractable. Therefore, we performed Markov chain Monte Carlo (MCMC) sampling with a total of 200,000 iterates to obtain the posterior estimate. This approach has been widely adopted in many previous Bayesian analyses [29, [34] [35] [36] [37] [38] [39] [40] . All full conditional densities required for MCMC are listed in Appendix C.

Denote ζ (l) as the parameter ζ, which is one of the parameters to be estimated in the l-th iteration, andζ (l) as the proposed value in the l-th iteration, which was randomly drawn from the proposal distribution with density f ζ (·; θ ζ ), where θ ζ governs the step size of the proposal. At the beginning of Markov chain Monte Carlo (MCMC), we set an initial value ζ (0) for each parameter. We also set this value to be the prior mean. In each iteration, we sequentially sampled all unknown parameters from proposal distribution. In particular, during the l + 1-th iteration, we 1.

Draw β from N(β (l) , θ β ) with truncation on the non-positive values.

Draw log τ 2 from N(log τ 2 (l) , θ log τ 2 ).

Draw log σ 2 from N(log σ 2 (l) , θ log σ 2 ).

Draw r from Dirichlet distribution with concentration parameter θ r r (l) .

. . , n, and t = 1, . . . , T. In each of the above steps, we calculate the acceptance probability

where π(ζ) is the posterior density of ζ, to decide whether to accept the proposed valuê ζ (l+1) as the next current value, i.e., setting ζ (l+1) =ζ (l+1) . If not, we reject the proposed value and keep the current value, i.e., setting ζ (l+1) = ζ (l) . We also implemented the following MCMC adaptations to improve the convergence of results [41, 42] . In the first stage, which consists of the first 5% of iterates, we allowed the Markov chain to move in order to find the best initial position. In the second stage, we controlled the acceptance rate by adjusting the step size (i.e. θ ζ ) until we completed the first 30% of iterates. After the MCMC adaptations, we let the chain burn-in by neither changing the step size nor including them into our sample for the next 20% of iterates. Finally, after the burn-in period, we took the last 50% of iterates to estimate the posterior. In order to reduce the autocorrelation and speed up the convergence, we recorded one observation for every 100 iterations. Moreover, starting from the 10% of total iterates, every time after we recorded an observation, we calculated the effective sample size, which was measured by the reciprocal of autocorrelation in the latest 100 iterates, and used it to determine the probability for each parameter to be sampled in the next iteration. This probability was fixed after we completed 30% of total iterates. In Figure 1 , we provide a plot of the posterior to show the convergence. The plot on the left-hand side shows the posterior in each MCMC iteration. The plot on the right-hand side focuses on the post-burn-in period, with which our posterior estimates and standard deviation were calculated. 

In this section, we present the statistical results from the latent pandemic space model. As mentioned in previous sections, we linked two countries together and believed that they had a similar level of prevalence if the growth pattern of their confirmed COVID-19 cases showed a certain extent of similarity. Using a 14-day moving-window scheme allowed us to gather enough observations to calculate moving correlations, and also reduce fluctuation due to a single-day spike in the data. The WHO's situation report records the daily number of confirmed cases since 21 January 2020. As we needed 14 days data to calculate the correlation, the study period of our pandemic network was from 4 February 2020 to 22 July 2020.

Before analyzing the modeling results, we followed [25] to present Figure 2 , which shows the pandemic network on the eighteenth day of each month. We can see that in February, there are only a few edges. One month later, we see that there are many more edges in the pandemic network, indicating that more countries are experiencing similar levels of prevalence. If we also take into account the daily number of confirmed cases at that time, it begins to appear that there were major events that accelerated the pandemic risk during that period. After that, although the number of edges in April to June was fewer than in March, indicating a diverging scenario in the daily number of confirmed cases, most countries had at least one link to another country, showing that the pandemic risk (between countries, or community transmission) may be lower but had not disappeared. For July, we again found an increment in the number of edges. We believe that this should be interpreted as signaling an upcoming change in the pandemic risk. This signal can also be detected in the preparedness risk score [25] .

In our estimated pandemic space, we can visualize the above changes in network connectedness in terms of the distances between countries. A shorter distance implies a higher probability of connection and thus a synchronized change in the prevalence of cross-border or community transmission in two countries. Figure 3 shows the pandemic space on the eighteenth day in each month with the top five countries having the highest number of confirmed cases of COVID-19. As of 22 July 2020, the five countries which contributed around 60% of the total number of infections were the United States of America, Brazil, India, the Russian Federation, and South Africa [2]. In Figure 3 , the radii of those circles surrounding the country names represent the country-specific effects. Depending on whether the distance between two countries is smaller than or larger than both radii of their corresponding circles, we can determine whether the probability of a link between them is more than or less than 50%. In particular, if two countries have the same country-specific effect, and if the center of one circle lies on the boundary of the second circle, and vise versa, the probability of the two countries being linked is exactly 50%. Since all circles have similar radii in our case, we can simply consider whether their centers are included in both circles in order to classify whether or not there is a similar level of prevalence between two countries. Therefore, excluding those marginal cases, among the top five countries in the figure, USA-India, USA-South Africa, USA-Russia, Brazil-Russia, Brazil-South Africa, and Russia-South Africa had potentially similar levels of prevalence on 18 March 2020; and USA-India, India-South Africa, and Brazil-South Africa had potentially similar levels of prevalence on 18 July 2020. Figure 4 studies the pandemic risk between countries. We again selected those countries which were in the top five list in terms of the cumulative number of confirmed COVID-19 cases and produced boxplots of the distance between each pair of countries over time. To construct the boxplots, we calculated the distance according to the latent coordinates in the pandemic space, a total of 10 distances from the top five countries in each day. In Figure 4 , we categorize the countries by continent according to the WHO's classification, which has also been implemented in the literature on pandemic network research [25] . We created daily boxplots for the five regions-Africa, the Americas, Asia, the Eastern Mediterranean, and Europe-and we aggregated the top five countries in each of the five regions and labeled them as "Top 5". In general, it is unlikely that all countries would be separated by small distances, as this would imply a high degree of similarity in prevalence between all countries. However, if this is the case, we will see that the boxplots are short and located at a low position. Based on the variability and maximum distances, we have a conservative measure of classifying the periods of high pandemic risk based on the network connectedness. From the "Top 5" graph for the whole world, the pandemic emerges in February 2020, when the median distances are large. The situation seems to get worse in late February 2020, when the median distance drops more than 50% in a week. In fact, the highest levels of connectedness are mid-March and late June to mid-July 2020, at the times of the two main waves of the pandemic. In addition to the two periods previously identified, there are also two other periods of high connectedness or potentially high pandemic risk in the five regions: one in Europe during mid-May 2020, and the other in the Americas during mid-June 2020.

Besides intra-continental pandemic risk, we also studied inter-continental risk. In Figure 5 , we calculated for each day the median distances between clusters of the top five countries from each continent. For example, if country A belongs to Africa, and country B belongs to the Americas, we include the distance between A and B when we calculate the median for the Africa-Americas distance (i.e., the second plot in the first row of the matrix plot in Figure 5 ). We observe that the distances in all plots decrease from around 0.125 in late February to 0.025 in mid-March, showing that the "inter-continental risk" builds up quickly in this period. The distances stay in the range of 0.025 to 0.05 for most of the time after March. For the plots among Africa, the Americas, and Asia, the minimum distance over the whole period of study is attained in mid-March. The distance between Europe and the Americas is relatively small in early June and early July. From Figure 5 , we cannot observe any sign that the pandemic is coming to an end as the distances are mostly below 0.05 till mid-July 2020. Table 1 summarizes the results with the estimate (posterior mean) and the standard deviation (SD) of each parameter in our pandemic space model. The parameter β determines the highest probability of linking two countries when they have the same location, or d ijt = 0. Based on the estimate of β, the highest probability was 72.40%. The parameter τ and σ are the standard deviations of latent coordinates on the first day, and of the daily transition in coordinates, respectively. Therefore, in this pandemic space, we can estimate the mean distance between countries on the first day by τ √ π = 0.1504, and the average distance each country travels in one day by σ √ π/2 = 0.0100. Consistent with the results shown in Figure 4 , the initial pandemic risk between countries is small. Potentially high pandemic risk (i.e., small distance) can be seen in two weeks from mid-February, with reference to Figure 5 , such that the median of distances is not greater than 0.05 for most of the time. From the latent space modeling, we can deduce that the COVID-19 pandemic evolves quickly and that the outbreak situation can become worse within a matter of just two weeks.

The variable invr is the mean of the inverse of country-specific risk factor, r −1 i . Based on our model assumptions, the sum of all country-specific risk factors is fixed to be 1 for the sake of identifiability. If invr attains its minimum value, which is the number of countries, every country has the same probability of linking to another country provided that their distances are the same. On the other hand, countries having larger r i will have a relatively higher probability of linking to another country. In this study, the estimate of invr is close to the minimum, implying that there is no country dominating the contribution to the pandemic risk. From another perspective, if we compare the country-specific risk factor r i , Serbia and New Zealand have the minimum and maximum country-specific risk at 0.0057 and 0.0066, respectively. The minimum and maximum are not far from the mean value of country-specific risk of 0.0061. We list the country-specific risk factor for each of the 164 countries in Appendix D. Table 1 . Estimate of the parameters in the latent space model. The first column contains the posterior mean. The second column contains the posterior standard deviation. β is the regression coefficient, τ is the standard deviation of latent coordinates on the first day, σ is the standard deviation of daily transition in coordinates, and invr is the mean of inverse of country-specific factor. 

Although the exact date depends on each country, most countries set up their travel restriction policies by 31 March 2020 [43] . Therefore, we set 31 March 2020 as a cutoff date for further discussion. Prior to travel restrictions coming into effect, a traveler who was an asymptomatic COVID-19 carrier could transmit the disease to other people in different countries. After the cutoff date, most incoming air traffic was prohibited and many lockdown policies were put in place. The main focus switched to the question of when a suitable time to reopen would be, and another focus was which countries had relatively smaller contributions to the pandemic risk.

In Figure 3 , we notice that for 18 March 2020, the first period of increasing pandemic risk observed in Figure 4 , there were six pairs of countries out of a total of 10 that had similar levels of prevalence. This result coheres with the rapid increase in the number of edges in the pandemic network. In fact, referring to the literature [27, 44] , the number of confirmed cases and the network connectedness accelerated during this period. We believe that the importation and exportation risk of COVID-19 cases via air travel poses a risk in the transmission of COVID-19 [45] . Within one month after the declaration of the COVID-19 pandemic, by which time most countries had imposed travel restriction, we see from Figures 3-5 the effectiveness of such measures in increasing the distance between countries in the pandemic space compared to the situation in March 2020.

Although none of the 10 country pairs had similar levels of prevalence until mid-July 2020, the distances in the 10 country pairs were not as large as they appear to have been in February 2020. The pandemic risk still existed and had the potential to lead to another wave of rapid transmission if travel restrictions were lifted [46] . We also see that on 18 July 2020, there was one edge between USA and India, and one edge between India and South Africa. The close distances between them put those countries into the cluster of similar prevalence, together with the Brazil-South Africa pair, which had no link on that day. Instead of importation and exportation, we believe that the reason behind this closing distance was the partial re-opening of the two countries' economies [47] and the permitting of inter-state air travel [48] . In fact, these measures may also be considered the reason behind the period of increased risk in Europe during mid-May 2020 and in the Americas during mid-June 2020. Figure 5 provides another view from the inter-continental perspective, showing that while most of the restrictions on international air travel remained valid, the inter-continental distances in July 2020 stayed in the low-value range (below 0.05), implying that the similarity in the levels of prevalence across continents was still quite high in mid-July 2020.

To investigate the effectiveness of lockdown policy, we refer to the "Top 5 in Europe" plot in Figure 4 , as most of the European countries experienced a lockdown in late March to late April 2020. Between mid-March and late March 2020, the median distance stayed small and the boxplots are short, indicating that the impact from the first wave of transmission lasted for at least half a month before the lockdown. After two weeks of lockdown (i.e., by around mid-April 2020) the median distance increased to a relatively high level, and for the same time period in the graph, the boxplots become taller. Similarity in the levels of prevalence between countries is weakened as the implementation of social distancing measures restricts opportunities for transmission of the disease, though it should be noted that the effectiveness of measures against the transmission of COVID-19 varies across countries. We also discovered that the median distances of the boxplot dropped again after early May 2020. However, we found that the daily number of new cases was decreasing at that time, indicating that they had a decreasing contribution to pandemic risk. Around that period of time, most European countries began to re-open in a stepwise manner [49, 50] . We believe that the imposition of a lockdown is effective in reducing the pandemic risk, as the distances among countries during periods of lockdown were larger than those before the lockdown in the pandemic space. However, by beginning the process of reopening at a time when the daily number of new cases has not yet reached a stable low value, the distances among countries may decrease again, together with further waves of mass transmission.

In this paper, we developed the pandemic space approach, which was built upon the concept of dynamic latent space modeling, to explore the pandemic risk across countries around the world. Using the pandemic space, we investigated and provided hints on whether the various control measures adopted in different countries, including travel restrictions and lockdowns, were effective at reducing the pandemic risk or not. We first constructed the pandemic network using 14-day data in a moving window scheme. We linked two countries together if their correlation in terms of numbers of infections was high. We followed this rule to build a daily network from 4 February 2020 to 22 July 2020. Then, based on the pandemic network, we performed latent space modeling to produce our pandemic space.

The pandemic space allowed us to identify country pairs that had potentially high transmission risks in some periods, or a synchronized increase in the severity of community transmission. We also explained the different pandemic periods using the maximum distances and the height from time series boxplots of distances. We also investigated the inter-continental risk with the time series of the median distance between each pair of continents. Moreover, using the parameters obtained from our estimation, we examined the highest probability of two countries being linked, the country-specific effect, the initial average distance, and the speed to build up the pandemic risk. As in other papers [28, [51] [52] [53] , using this pandemic space analysis, we concluded that both lockdown and travel restrictions are effective at reducing the pandemic risk across countries. Nevertheless, these two measures, and probably also other control measures, are not sufficient to wipe out the risk of pandemic across countries.

Future works might include considerations of exogenous variables, such as statistics related to travel between countries, the number of people who have received at least one vaccine dose, and lockdown activities, so as to make it possible to conduct Bayesian predictions about similarities in levels of prevalence between countries and the pandemic risk. It would also be interesting to consider pandemic spaces with higher dimensions and interpretations of the latent dimensions in the pandemic space in future research. Funding: This work was partially supported by the Hong Kong RGC General Research Fund (grant number 16307217). The funding body had no role in study design, data collection, and analysis; the preparation of the manuscript; or the decision to publish.

The dataset used in this study can be found on https://covid19.who. int/table.

The authors declare no conflict of interest.

The following abbreviations are used in this paper: Eastern Med. Eastern Mediterranean

To allow every country to have a location in the pandemic space on each day, we impute those missing links between countries as empty; i.e., y ijt = 0. In the pandemic network, a pair of countries has a missing link when the correlation at time t is undefinedi.e., there have been no confirmed COVID-19 cases in the previous 14 days for at least one country in that pair. It is reasonable to claim that the pandemic risk between these countries is negligible when a country has had no new cases in 14 consecutive days.

Since the η ijt in Equation (3) depends on the distance between countries in the pandemic space instead of their coordinates, if we hold other variables unchanged, any distance preserving transformation in the pandemic space, like translation, reflection and rotation, gives an identical value of η ijt . To handle the identifiability issue on latent positions, we use the Procrustes transformation [54] to translate and rotate the whole space at time t such that the sum of squared distance traveled by all countries from t − 1 to t, and from t to t + 1 is minimized; i.e.,

(A1)

We denote φ as all parameters, including β, τ 2 , σ 2 , r i , and Z it , which are not specified as arguments but appearing in the following posterior density function. We first write the likelihood of Y, as a product of conditional distribution:

Then, we derive the following posterior densities:

•

The joint posterior density of Z it is

The posterior densities of τ 2 and σ 2 are

and π(σ 2 |Y, φ) = IG ν σ 2 + 1 2 sn(T − 1), ξ σ 2 + 1 2

respectively, where ν τ 2 and ν σ 2 are the shape parameters in the inverse gamma prior, and ξ τ 2 and ξ σ 2 are the scale parameters of the inverse gamma prior.

• The posterior density of β is

where ν β and ξ β are the mean and variance respectively of the normal prior. •

The joint posterior density of r = (r 1 , . . . , r n ) is

where α i , for i = 1, . . . , n, are the concentration parameters of the Dirichlet prior. 

In this appendix, we demonstrate with heatmaps the 14-day historical correlations of the daily number of new confirmed cases between pairs of countries under the susceptibleinfected-recovered (SIR) model. The SIR model assumes that among the population N who get infected with a certain kind of disease, people can be divided into three classes. The first class "susceptible", S(t), contains individuals who are at risk at time t. The second class "infected", I(t), contains those individuals who have been infected and are able to spread the diseases. The third class "recovered", R(t), contains those who have recovered from the infection and have become immune to the disease. We can use the following mathematical equations to formulate the relationship among these three classes: 

where β is the infection parameter and γ is the recovery parameter. In our demonstration, we assume that the recovery parameter is equal to one-tenth of the infection parameter for simplicity. Moreover, we allow countries 1 and 2 to have infection parameters β 1 and β 2 , respectively. We repeat the calculation after changing these two parameters to be combinations of 0.14, 0.17, 0.2, 0.25, 0.33, 0.5, and 1. Following Equation (1), we first calculate the square root transformed daily number of new cases, D t = S(t − 1) − S(t). Then, we calculate the correlation by Equation (2) from t = 1 to t = 170. Notice that in the first 13 days, there are insufficient data to calculate the correlation. Therefore, the corresponding entries are colored in grey. In Figure A1 , the x-axis and y-axis in each plot refer to the number of days since the first confirmed case in the two countries. The region in green represents the scenarios when the correlation between the two countries is greater than 0.5. We can roughly see three stages: one at the bottom left, a bridge in the middle, and one at the top right. Figure A1 . The x-axis and y-axis in each plot refer to the number of days since the first confirmed case in each of the two countries, respectively. The region in green represents the scenarios when the correlation between the two countries is greater than 0.5.

COVID-19) Situation Report-51; World Health Organization: Geneve, Switzerland, 2020. 2. WHO. Coronavirus Disease 2019 (COVID-19) Situation Report-184; World Health Organization: Geneve

The Spread of the Covid-19 Pandemic in Time and Space

It Is All About Location: Smartphones and Tracking the Spread of COVID-19

Can Socioeconomic, Health, and Safety Data Explain the Spread of COVID-19 Outbreak on Brazilian Federative Units?

A New Social Network Scale for Detecting Depressive Symptoms in Older Japanese Adults

Coronavirus Lockdown as a Major Life Stressor: Does It Affect TMD Symptoms?

The positive impact of lockdown in Wuhan on containing the COVID-19 outbreak in China

Air quality changes in New York City during the COVID-19 pandemic

The socio-economic implications of the coronavirus pandemic (COVID-19): A review

Economic Effects of Coronavirus Outbreak (COVID-19) on the World Economy. SSRN Electron

Impacts of the COVID-19 Pandemic on Financial Market Connectedness

Co-movement of COVID-19 and Bitcoin: Evidence from wavelet coherence analysis

Enacting national social distancing policies corresponds with dramatic reduction in COVID19 infection rates

Is the lockdown important to prevent the COVID-19 pandemic? Effects on psychology, environment and economyperspective

Analysis of travel restrictions for COVID-19 control in Latin America through network connectedness

Outbreak dynamics of COVID-19 in Europe and the effect of travel restrictions

Is Spatial Really that Special? A Tale of Spaces

Dynamic structures of geographic space

Collaboration analysis in recommender systems using social networks

Dynamic social network analysis using latent space models

Social network analysis and mining for business applications

Contributions to the mathematical theory of epidemics-I

The dynamic nature of contact networks in infectious disease epidemiology

On Topological Properties of COVID-19: Predicting and Assessing Pandemic Risk with Network Statistics

Visualizing COVID-19 pandemic risk through network connectedness

Detecting early signals of COVID-19 global pandemic from network density

Pandemic Risk of COVID-19 Outbreak in the United States: An Analysis of Network Connectedness with Air Travel Data

Latent Space Models for Dynamic Networks

Longitudinal approach to subgroup formation: Re-analysis of Newcomb's fraternity data

Latent space approaches to social network analysis

Coronavirus Disease (COVID-19) Situation Reports

The Square Root Transformation in Analysis of Variance

Bayesian spatial-temporal modeling of air pollution data with dynamic variance and leptokurtosis

Bayesian randomized response technique with multiple sensitive attributes: The case of information systems resource misuse

Bayesian analysis of tail asymmetry based on a threshold extreme value model

Bayesian analysis of nonlinear and non-Gaussian state space models via multiple-try sampling methods

Vine-copula GARCH model with dynamic conditional dependence

Efficient estimation of high-dimensional dynamic covariance by risk factor mapping: Applications for financial risk management

A Latent Space Modeling Approach to Interfirm Relationship Analysis

On a threshold heteroscedastic model

Bayesian hierarchical model for spatial extremes with multiple durations

World Tourism Organization. 100% of Global Destinations Now Have COVID-19 Travel Restrictions; World Tourism Organization

Spread and dynamics of the COVID-19 epidemic in Italy: Effects of emergency containment measures

Airport risk of importation and exportation of the COVID-19 pandemic

Is it safe to lift COVID-19 travel bans? The Newfoundland story

How India is dealing with COVID-19 pandemic

Potential for inter-state spread of Covid-19 from Arizona, USA: Analysis of mobile device location and commercial flight data

How Turkey took control of Covid-19 emergency-BBC News

Coronavirus: How lockdown is being lifted across Europe-BBC News. BBC News

Infectious Risks of Air Travel

Will travel restrictions control the international spread of pandemic influenza?

Delaying the international spread of pandemic influenza

Bilinear mixed-effects models for dyadic data