key: cord-0922203-a9jns7hp authors: Goel, Rahul; Bonnetain, Loïc; Sharma, Rajesh; Furno, Angelo title: Mobility-based SIR model for complex networks: with case study Of COVID-19 date: 2021-10-22 journal: Soc Netw Anal Min DOI: 10.1007/s13278-021-00814-3 sha: b13cf5300a74846348d8bc7d0aeed29906309c2f doc_id: 922203 cord_uid: a9jns7hp In the last decade, humanity has faced many different pandemics such as SARS, H1N1, and presently novel coronavirus (COVID-19). On one side, scientists have developed vaccinations, and on the other side, there is a need to propose models that can help in understanding the spread of these pandemics as it can help governmental and other concerned agencies to be well prepared, especially for pandemics, which spreads faster like COVID-19. The main reason for some epidemic turning into pandemics is the connectivity among different regions of the world, which makes it easier to affect a wider geographical area, often worldwide. Also, the population distribution and social coherence in the different regions of the world are non-uniform. Thus, once the epidemic enters a region, then the local population distribution plays an important role. Inspired by these ideas, we propose two versions of our mobility-based SIR model, (i) fully mixed and (ii) for complex networks, which especially takes into account real-life interactions. To the best of our knowledge, this model is the first of its kind, which takes into account the population distribution, connectivity of different geographic locations across the globe, and individuals’ network connectivity information. In addition to presenting the mathematical proof of our models, we have performed extensive simulations using synthetic data to demonstrate the generalization capability of our models. Finally, to demonstrate the wider scope of our model, we applied our model to forecast the COVID-19 cases at county level (Estonia) and regional level (Rhône-Alpes region in France). In this modern age, pandemics are not a rare phenomenon. As in the last decade, we have seen several pandemics such as H1N1, SARS, and EBOLA, and presently humanity is facing its biggest crisis due to COVID-19. The severity of these pandemics can be understood by the death toll claimed by them. According to WHO, the pandemic H1N1/09 virus resulted in 18,036 deaths (Organization WH et al 2009) . On the other hand, the Centers for Disease Control and Prevention (CDC) estimate between 151,700 and 575,400 deaths due to the pandemic H1N1/09 virus (Centers for Disease Control and Prevention 2012). Currently, the coronavirus (COVID-19) pandemic, which started in December 2019 from Wuhan, China, has infected 134,641,198 individuals and claimed 2,917,995 (as of April 9 ,2021) deaths worldwide (CSSE 2020) COVID. Pandemics are different from epidemics in terms of their geographic spread. An epidemic affects many people at the same time. It spreads from person to person and remains local to a specific region. In comparison, when an epidemic engulfs an entire country, continent, or the whole world, it is termed as pandemic. In the past, various models have been proposed for understanding the epidemics spread. These models can be broadly classified into two categories, that is agent-based modeling (Bonabeau 2002; Schelling 1971; Sun 2006 ) and compartmental models (Kermack and McKendrick 1927; Hethcote 2000; Goel et al. 2019) . The agent-based modeling is used for simulating the actions and interactions of autonomous agents as a whole (Epstein 2009 ). These agents can be both individual or collective entities such as organizations or groups. In contrast, differential equations are used in compartmental models, where the population is divided into different compartments such as suspected (S), infected (I), and recovered (R) (Kermack and McKendrick 1927) . Several other variants of these models have also been proposed such as SI (Hurley et al. 2006) , SIS (Nåsell 1996) , SIR (Kermack and McKendrick 1927) , and SIRS (Jin et al. 2007) . Agent-based model researchers often criticize compartmental models as these models struggle to capture the connectivity between different regions of the globe, and different real-world population characteristics, such as worldwide population distribution (Chinazzi et al. 2020; Eubank et al. 2004) . In this study, we propose a mobility-based SIR model, an extension to the classical SIR-based epidemic model, which considers the real-world population distribution across different regions of the world. Most importantly, the model also takes into account the connectivity factor among various regions over the world, which is the key cause in accelerating the process of transforming epidemics into pandemics. We present two versions of our model, (i) fully mixed and (ii) complex networks model, which especially takes into account real-life interactions. We model the regions in a 2-dimensional lattice, where each cell represents the mobility parameter (or direct connectivity) from one region to another. Along with presenting the mathematical proof of our model, we have performed extensive simulations on synthetic data. Moreover, we leverage our model to forecast the COVID-19 cases in Estonia 1 and in the Rhône-Alpes region 2 (one of the largest regions in France) by inferring the population mobility within the studied areas using mobile phone data. These various use cases demonstrate the model's ability to generalize on different types of data. In our previous work Goel and Sharma 2020, we proposed fully mixed model that is composed of the (local) transmission rate of the infection , and to cover the mobility aspect, we introduce parameters: (1) ' ' which is a social connectivity parameter that signifies how well individuals are socially linked with each other, and (2) ' c (i,j) ' that represents individuals mobility from a region j to another region i. Thus, the infection can transfer within the region with the transmission rate and can also be introduced from other regions through global transmission rate which depends upon , c (i,j) , I j (fraction of infected at region j) and . In this study, we extend our work to complex networks model as well. With the help of Fig. 1 , we illustrate our proposed models (fully mixed and for all cells. a shows the fully mixed model and infection in this model can transfer as follows: For region 6, its social connectivity is . The mobility of individuals from region 4 to region 6 and fraction of infected individuals at region 4 is represented as c(6, 4) and I(4), respectively. Therefore, infection can transfer from region 4 to 6 via global transmission rate c(6, 4)I(4) . Similarly, c(6, 13)I(13) and c(6, 16)I(16) signifies the global transmission rate from region 13 and 16, respectively, to region 6. On the other hand, b shows the complex network model: similar to fully mixed model, infection can transfer from region 4 to 6 via global transmission rate c(6, 4) (4) , where takes care of the degree of the individual (see Sect. 3 for detail) complex networks) for better understanding. We applied our model on a synthetic networks as well as on a new region, Rhône-Alpes in France considering the population density and the connectivity among different sectors, which is created using mobile phone data. Finally, we utilize our model to investigate the following questions: • How social connectivity parameter ' ' affects the fraction of individuals in different compartments (susceptible, infected and recovered) The limitation of traditional compartmental epidemiological models is that they do not account for the role of reducing social connectivity (or isolation) and mobility restrictions during the outbreak of a pandemic such as COVID-19. The proposed model overcomes this constraint. We discovered that the reproduction number R 0 for a pandemic is influenced by social connectivity and mobility. We also discovered that restricting movement during a pandemic decreases the fraction of individuals in an infected compartment, and restricting social interaction (or isolation) slows the peak and reduces the number of infected individuals from the pandemic. We believe that these models can help to adopt a balanced strategy to address a pandemic crisis. The contribution of our work is twofold: 1. Previous mobility-based SIR model is proposed for fully mixed model (Goel and Sharma 2020) . The main limitation of fully mixed model is that it assumes that every person at every location is linked to everyone else at that location. However, in reality, people interact with a limited number of people to form a complex network with non-trivial topological features that do not occur in simple networks such as lattices or random graphs but often occur in networks representing real systems (Albert and Barabási 2002) . Therefore, this paper propose mobility-based SIR model for complex networks that is more realistic than fully mixed models. 2. Next, we show that our model perform well on real scenarios by projecting the expected COVID-19 cases at county scale (Estonia) and regional scale (Rhône-Alpes region in France). The rest of the paper is organized as follows. Next, we discuss related works with respect to epidemic modeling. We then describe the model preliminaries and derivations in Sect. 3. Sections 4 and 5 present the evaluation results of our fully mixed and complex networks model, respectively. We conclude with a discussion of future directions in Sect. 6. In this section, we discuss relevant literature with respect to epidemic modeling which involves two different lines of work. The first involves agent-based modeling, while the second is related to compartmental based modeling. In the agent-based modeling, authors model epidemics by simulating the actions and interactions of autonomous agents (both individual or collective entities such as organizations or groups) with a view of assessing their effects on the system as a whole (Epstein 2009 ) by using transportation systems such as road networks (Eubank et al. 2004) , airways (Chinazzi et al. 2020) etc. These models have been used for understanding various epidemics such as smallpox (Burke et al. 2006 ), influenza (Khalil et al. 2012 , cholera (Crooks and Hailegiorgis 2014), and very recently COVID-19 (Chinazzi et al. 2020; Tuomisto et al. 2020) . In contrast to agent-based modeling, differential equation-based compartmental models have also been used for understanding epidemics, which is the basis of this work. This line of literature is mainly based on the classical SIR model proposed by Kermack and McKendrick (Kermack and McKendrick 1927) followed by Anderson and May (1979) ; Anderson et al. (1992) . In Anderson and May (1979) , the authors considered the host population as a dynamic variable rather than constant, as conventionally assumed, which provides a broader understanding of the population behavior during infectious disease. In their work in Anderson et al. (1992) , authors discuss the idea of the basic reproductive rate, threshold about host densities, and modes of transmission. Different variations of the SIR model have also been proposed to capture various real-world scenarios. For example, introducing a delay in the model to capture the incubation period during the spreading (Zhang et al. 2010; Xia et al. 2012; Liu 2015; Arquam et al. 2018) or the introduction of interventions such as antiviral drugs (Towers et al. 2011) . In a different work to represent the nonlinear nature of epidemic spread, a SIR rumor spreading model was proposed in which tie strengths were dependent on nodes' degree (Singh and Singh 2012) . Apart from SIRbased models, there exist several flavors of compartmental models, which represent different scenarios such as SIS (Nåsell 1996) , where individuals do not recover and can become susceptible again. This model has also been studied using varying types of underlying topologies (Shi et al. 2008) . A set of works have also focused on exhibiting the epidemic spreading by using varying types of underlying network structures. For example, authors in Moreno et al. (2002) ; Barthélemy et al. (2005) , and Vespignani (2012) used a scale-free network and in Li and Wang (2006) a small-world evolving networks for evaluating their epidemiological framework. In their work in Kiskowski (2014), researchers combine a discrete, stochastic SEIR (E stands for exposed) model with a three-scale community network model to demonstrate that the different regional trends may be explained by different community mixing rates. A detailed study concerning various epidemic models on varying topologies has been done in Pastor-Satorras et al. (2015) . In another line of work, the authors proposed models to understand epidemics based on the speed of growth. For example, in Viboud et al. (2016) , authors applied their generalized-growth model to characterize the ascending phase of an outbreak on 20 different epidemics. Their findings revealed that sub-exponential growth is a common phenomenon, especially for pathogens that are not airborne. In another work Huang et al. 2016 , researchers explained the rapid spread of H1N1 in 2009 around the world by using a flexible Bayesian, space-time, susceptible-infected-recovered (SIR) modeling approach. Gojovic et al. (2009) developed a simulation model of a pandemic (H1N1) 2009 outbreak in a structured population using demographic data from a medium-sized city in Ontario and epidemiologic influenza pandemic data. A set of work have also focused on mobility aspect during COVID-19 Goel and Sharma 2020; Arenas et al. 2020 . In comparison to previous works, the proposed model introduces mobility and social connectivity parameters in both fully mixed and complex networks models, the key characteristics for turning epidemics into pandemics. In this section, we first explain the classical SIR model and then discuss its limitations with respect to the absence of mobility and social connectivity parameters. Next, we describe our proposed model to understand the spreading of an infection during a pandemic. In 1926, Kermack and McKendrick (1927) proposed the classical SIR model as follows: where s(t), i(t), r(t) are, respectively, the fraction of susceptible, infected and recovered population at time t. However, the classical SIR epidemic model proposed by Kermack and McKendrick (1927) does not consider the heterogeneity and topology of the real-world network. To overcome this limitation, we introduce the mobility and social connectivity parameters in our proposed model. Let 'l' represent the total number of locations, and 'c' denote the connection (or individuals' mobility) between locations. The propagation of infection at each location is explained as follows: each healthy individual can get the infection either from an infected individual located in the same location (local transmission) or from an individual visiting from other connected locations (global transmission). The local transmission rate of infection is represented by and the recovery rate as , with and ∈ [0,1]. Next, we discuss the local transmission of infection (Sect. 3.1.1), the global transmission (Sect. 3.1.2), and then the dynamical behavior of the nonlinear system of infection for fully mixed model (Sect. 3.1.3). (1) Page 5 of 18 105 Let N i be the population at location i, where i ∈ l , and the total population is divided into three compartments. The compartments for location i at time t are as follows: 1. S i (t) : the number of individuals susceptible or not yet infected. This compartment is referred as susceptible compartment. 2. I i (t) : the number of infected individuals which can further spread the disease to the individuals present in the susceptible compartment. This compartment is referred to as infected compartment. 3. R i (t) : the number of individuals who have been recovered from the infected compartment. This compartment is referred as recovered compartment. Our assumptions regarding the transmission of an individual from one compartment to another compartment are as follows: 1. A healthy individual after becoming infected moves from susceptible to the infected compartment. 2. An individual can recover spontaneously at any time with recovery rate . The recovery of an individual is independent of healthy and infected compartments' individuals. 3. Once the individual gets recovered, it will become immune to the disease and, thus, will not transmit the infection to individuals in the susceptible compartment. 4. In addition, this model ignores the demography that is birth or death of individuals. In other words, the population remains constant. Let j (j ⊂ l) represent a set of locations, which are connected to location i. Therefore, ∑ j N j is the maximum possible number of individuals connected to location i, from all the locations j. The parameter c i,j reflects the mobility of individuals from locations j to location i. Global transmission depends upon this mobility parameter of individuals from one location to another. Similar to local transmission, I j is the number of individuals in the infected compartment in location j. Hence, total mobility of infected individuals from all the other connected locations to location i is Considering the above description, the chances of transmission of infection from all the connected locations to loca- . This transmission further depends upon the social connectivity ( ) of all the individuals at location i. Therefore, the proportion of healthy individuals at location i which can get infected from infected individuals from loca- . Thus, the mean-field equations for the dynamics of the pandemic, based on the above discussed interactions are the following: where Eq. 4 describes the rate of change of susceptible individuals at location i, Eq. 5 refers to rate of change of infected individuals, and Eq. 6 explains the rate of change of recovered individuals at location i. The reader can refer to Table 1 for notations and their meaning. Equations (4-6) represent a nonlinear dynamical system of pandemic spreading, where, at any time t, In order to solve mean-field Eqs. (4-6), following assumptions are made (please note that these assumptions are not considered during our experiments): Individuals mobility from location j to i 1. Initially, the population at all locations is equal to N(t) at time t. 2. Individuals in infected compartments are equal to I(t) at all locations at time t and ∑ j I j = �j� ⋅ I j = kI j , where k is the number of locations connected to location i, that is, k = |j|. 3. The mobility of individuals from one location to another location is a fraction of total population N. Let n be the sum of fraction of population mobility from |k| locations. Then, the total individuals mobility from set of locations j to i is n * N . Therefore, ∑ j c i,j = nN. By considering the above assumptions, Eqs. 4 and 6 can be written as From Eqs. 8 and 9 For simplicity, Eq. 12 can be written as: Equation 13 can be rewritten as Solving Eq. 15, we get As pandemic arrives at steady state when t ⟶ ∞ hence dR dt = 0 and R ∞ = C, where C is a constant: Let the initial conditions be: R(0) = 0 , I(0) = I and S(0) = N − I ≈ N . Therefore, Eq. 17 can be written as: Normalizing Eq. 18 by dividing by total population N gives: Therefore, the reproduction number R 0 is In case there is no social connectivity to other locations ( = 0 or k = 0 or n = 0 ), then the mobility SIR model will become the standard SIR model and the reproduction number is R 0 = . Therefore, the reproduction number is directly proportional to social connectivity parameter , number of connected locations k, and depends upon individuals' mobility during a pandemic. The fully mixed model that we presented in this section has one key limitation, i.e., it assumes that every person at every location is linked to everyone else at that location. However, in reality, people interact with a limited number of people to form a complex network with non-trivial topological features that do not occur in simple networks such as lattices or random graphs but often occur in networks representing real systems Albert and Barabási 2002. Therefore, in the next section, we propose mobility-based SIR model for complex networks. In this section, we discuss the local transmission of infection (Sect. 3.2.1), the global transmission (Sect. 3.2.2), and then the dynamical behavior of the nonlinear system of infection by considering complex networks interactions at each location (Sect. 3.2.3). Let N i be the population at location i, where i ∈ l, and k is the degree of each individual, where k ∈ (whole numbers). The total population is divided into three compartments. The compartments for location i at time t are as follows: Our assumptions regarding the transmission of an individual from one compartment to another compartment are same as discussed in Sect. 3.1.1. Let j (j ⊂ l) represent a set of locations, which are connected to location i. Therefore, ∑ j N j (k) is the maximum possible number of individuals of degree k connected to location i, from all the locations j. The parameter c i,j,k reflects the mobility of individuals of degree k from locations j to location i. Global transmission depends upon this mobility parameter of individuals from one location to another. Similar to local transmission, I j is the number of individuals in the infected compartment in all the locations j. Hence, total mobility of infected individuals of degree k from all the other connected locations to location i is ∑ j c i,j,k . Considering the above description, the chances of transmission of infection from all the connected locations to location i are ∑ j c i,j,k . This transmission further depends upon the social connectivity ( ) of all the individuals at location i. Therefore, the proportion of healthy individuals at location i which can get infected from infected individuals from location j is . Thus, the mean-field equations for the nonlin-ear dynamics of the pandemic, based on the above discussed interactions are the following: where where Eq. 21 describes the rate of change of susceptible individuals of degree k at location i, and Eq. 22 refers to rate of change of infected individuals of degree k, and Eq. 23 explains the rate of change of recovered individuals of degree k at location i. Please refer Table 2 for notations and their meaning. Mobility of individuals of degree k from location j to i Ψ(k � ) Infectivity of a node with degree k ′ Equations ( For simplicity, Eq. 33 can be written as: can be rewritten as Solving Eq. 37, we get As pandemic arrives at steady state, when t ⟶ ∞ , hence dR dt = 0 and R ∞ = C, where C is a constant. Let initial conditions are R(0) = 0 , I(0) = I and S(0) = N − I ≈ N . Therefore, Eq. 39 can be written as Normalizing Eq. 40 by dividing by total population N gives: Therefore, the reproduction number R 0 is The R 0 is called basic reproduction number which determines the spread of infection. When R 0 > 1 , the propagation occurs at a fast rate. When R 0 = 1 , the propagation happens at a slow rate. When R 0 < 1 , the propagation finishes. In case there is no social connectivity to other locations ( = 0 or m = 0 or n = 0 ), then the mobility SIR model for complex networks gives the reproduction number as R 0 = ⟨k 2 ⟩ ⟨k⟩ . Therefore, the basic reproduction number is directly proportional to social connectivity parameter , number of connected locations m, depends upon individuals' mobility during a pandemic, and degree of an individual in a complex network. In this section, we first explain our experimental setup, and next, we discuss the results of our simulation conducted using the proposed model for non-complex network on synthetic data. In addition, we also show results of our model when applied for predicting the number of COVID-19 cases at country level (Estonia) and regional level (Rhône-Alpes region in France). For the analysis, we created an aggregated flow matrix whose cells represent the number of trips of individuals per day from origins to destinations. We call this matrix Origin-Destination (OD) matrix. The synthetic OD matrix considered for our experiment follows a random distribution. Furthermore, three different techniques are considered for selecting the seed infection location: 1. Pandemics origin from a random location: In this, a random location is selected as seed infection location, and a small fraction of individuals were infected at that location. Here, the seed location is selected strategically, i.e., in a location which is weakly connected to other locations. This implies the least mobility of individuals from this location to other locations. In this also, the seed location is selected strategically, i.e., in a location which is strongly connected to other locations. This signifies that the highest mobility of individuals from a location is considered for the selection of the infection seed. Our simulation is oriented toward addressing the following questions: • How social connectivity parameter ' ' affects the fraction of individuals in different compartments (susceptible, infected and recovered) during a pandemic? • What are the outcomes of restricting the mobility (for top-X percentile) of strongly connected locations? • What is the relationship between social connectivity parameter ' ' and the mobility restriction (top-X percentile of strongly connected locations)? • How efficiently this model can perform in real scenarios? We answer this question by projecting the expected COVID-19 cases for Estonia and Rhône-Alpes region in France. We perform various simulation experiments to explain the proposed model on OD matrix by using previously discussed techniques for selecting the seed infection location. It is to be noted that the model will behave as a standard SIR model in two cases, (i) if = 0 , (ii) if the mobility is reduced to 100 percentile (that is no mobility allowed) from connected locations. The effect of restricting the mobility from the top-X percentile of highly connected locations with other locations is shown in Fig. 3. Figure 3a , d displays the pandemic dynamics with different percentile of mobility restrictions of highly connected locations starting with 0-30% (keeping = 0.5 ). We observe that in the case of a pandemic, restricting the mobility from the top-10 percentile of highly connected locations can reduce the number of individuals who can get infected to 27%. Therefore, quarantine plays a vital role during pandemics. In order to understand the relationship between and mobility restriction from strongly connected locations, we performed the numerical simulation of the proposed mean-field equations (see Fig. 4 ). It shows the fraction of population in various compartments at time t → ∞ for various value combinations of and mobility restriction. From Fig. 4 , we can infer that the social connectivity parameter ' ' and mobility plays both a fundamental role in determining the dynamics of the pandemics. Therefore, it is advisable to follow a dual strategy approach during a pandemic outbreak as controlling mobility reduces the fraction of infected individuals, and delays the peak. Furthermore, we analyzed the number of days required to reach the point where the highest fraction of individuals get infected (see Fig. 5 ). This indicates that mobility restrictions and minimal social contact will postpone the pandemic's peak and will give sufficient time for preparations, especially for the health sector. Figure 2 displays the influence of the social connectivity parameter ' ' while keeping the other parameters constant. Figure 2a , f shows the pandemic dynamics with different values of starting with = 1 to = 0.1 . We observe that the peak of the infected compartment decreases significantly, as the decreases, and it also takes longer to reach its peak. This indicates that there is a positive impact of lockdown in controlling a pandemic. Figures 6, and 7 display the influence of the social communication parameter ' ' while keeping the other parameters constant for weakly and strongly connected locations, respectively. Figure 6a , f shows the pandemic dynamics with different values of starting with = 1 to = 0.1 for weakly connected locations. Similarly, Fig. 7a , f shows the pandemic dynamics with different values of starting with = 1 to = 0.1 for strongly connected locations. It can be noted that when a pandemic originates from a weakly connected location, it takes longer to reach its peak compared to when it starts from a strongly connected location. This shows that the location of origin also plays an important role during a pandemic. Similar to a random location, reducing mobility from the highly connected locations by 10 percentile can reduce the number of infected individuals between 18 and 27% for weakly and strongly connected locations, respectively. To demonstrate the usability of the model, we applied it to real-time data of Estonia's COVID-19 cases. Figure 8 shows the actual number of cases and the cases forecast by the model using different values for and mobility percentile. For example, = 0.95 indicates that the social connectivity of individuals is reduced by 5% and also top-5 percentile of strongly connected locations are restricted from mobility. Similarly, = 0.7 implies that the social connectivity of individuals is reduced by 30%, and also the top-30 percentile of strongly connected locations have introduced restricted mobility. For simulation, we created the OD matrix between counties of Estonia using call data records Goel et al. 2021; Hiir et al. 2019 . Furthermore, these call interactions are converted into population mobility between counties using Estonian population data from census Estonia 2018. For the local transmission of the virus (within the county), we consider the reproduction number R 0 = 2.5 (Organization W.H.et al. 2020) . Cases reported until March 11, 2020 are considered as an initial condition for the model. The reason behind selecting March 11, 2020 as initial condition is that, till this date no local transmission of the virus was reported 3 . Till the day of initial condition, the Estonian Health Board confirmed 13 cases in Harju and two cases in Tartumaa and Saaremaa each 4 . During the simulation, the number of cases in all other counties are initialized to zero. The infection rate and recovery rate are adjusted according to the value of R 0 for COVID-19. The reported cases in Estonia as well as the forecast cases using the model are shown in Fig. 8 until April 9, 2020. It can be noticed that the model predicted much higher cases of COVID-19 if no restrictions are introduced ( = 1). However, as the restrictions were introduced by the Government, 5 the number of cases got damped (Actual). Thus, the applicability of this model is to forecast a range of predicted number of cases which can help the government and health agencies to understand the impact and introduce proportional interventions to restrict the spread of the epidemic. (b) Log-log degree distribution. The degree distribution of our synthetically generated network shows that it follows power law distribution We also applied our model to real-time data of Rhône-Alpes region's COVID-19 cases. Figure 9 shows the actual number of cases and the cases forecast by the model using different values for and mobility percentile. The region is divided into 14 sectors. For simulation purpose, we again considered the OD matrix between the sectors of the region obtained via network signaling data of Orange (the largest telecommunications provider in France) and census data. 6 This OD matrix has been built using the approach proposed by Fekih et al. in Fekih et al. (2020) . The latter uses a heuristic-based approach to extract trips between sectors at individual level from mobile phone passive traces. As the signaling data collected by the telecommuting operator covers around 30% of the whole population of the region, the resulting trips have been re-scaled using the census data. Finally, after an aggregation step, we obtain the regional OD matrix used in this study. For privacy matters, the number of COVID-19 cases is not reported in France at a fine spatiotemporal resolution in publicly available data. 7 Instead, the dataset only reports cumulative values of COVID-19 cases on a 7-day rolling window for each area of the administrative segmentation of the French territory. In addition, the number of cases is reported discretely, i.e., as a range of values between a lower and upper bound containing the real value. Therefore, in order to obtain a daily estimation of the number of COVID-19 cases per each sector of the analyzed Rhone-Alpes region, two main assumptions have been made. On the one hand, we consider the number of cases as the mid value between the lower and upper bound of the reported range for the given area on a specific day. On the other hand, we replaced the 7-day rolling time window by the median day (i.e., the 4th day of the time window). As a result, to obtain the daily estimation of the number of cases, the cumulative reported estimation provided on a 7-day rolling time window is divided by 7. After summing this estimation for all the administrative areas belonging to a given sector of the OD matrix, we finally obtain an estimation of the number of COVID-19 cases per sector and per day. COVID-19 data cover the period from November 2, 2020 toMarch 19, 2021. For our simulation, the number of cases in all sectors is initialized as on November 2, 2020. For the local transmission of the virus (within the sector), we consider the reproduction number R 0 = 2. 5 Organization W.H.et al. 2020 . The infection rate and recovery rate are adjusted according to the value of R 0 . The reported cases in Rhône-Alpes region in France as well as the forecast cases using the model are shown in Fig. 9 until March 22, 2021. It can be noticed that the model predicted much higher cases of COVID-19 if no restrictions are introduced ( = 1), while we can observe that, for = 0.5, the number of actual cases and forecast ones are quite close to each other. To explain this result, it is worth to remind that strong mobility restrictions were reintroduced in France by the end of October 2020, 8 after the first lockdown ended during summer. The new restrictions contributed to keep low the number of COVID-19 infections (Actual). Moreover, it is reasonable to assume that mobility and social interactions were already significantly reduced at the beginning of this second lockdown, with respect to prepandemic behaviors, as a consequence of the first COVID-19 wave and previously imposed restrictive measures. In conclusion, this second case study confirms the applicability of the model to forecast a range of predicted number of cases. The latter can thus help the government and health agencies to understand the impact and introduce proportional interventions to restrict the spread of the epidemic. In this section, we first explain our experimental setup, and next, we discuss the results of our simulation conducted using the proposed model for complex networks on synthetic network. For the analysis, we created a synthetic network using configuration model Newman 2003, which follows the power law distribution (see Fig. 10 ) with scale-free exponent (gamma) as 3. In particular, we utilize the The configuration model produces a random pseudograph (graph with parallel edges and self-loops) by randomly assigning edges to fit the given degree sequence. We removed all parallel edges and self-loops from our network. Table 3 provides the statistics of our synthetic network. Based on the definition of configuration model and various properties of our synthetic network, we can infer that this network reflects the real-world contact network Voitalov Our simulation is oriented toward addressing the following questions: • How social connectivity parameter ' ' affects the fraction of individuals in different compartments (susceptible, infected and recovered) for a complex network? • What are the outcomes of restricting the mobility (for top-X percentile) of strongly connected locations in a complex network? We perform various simulation experiments to explain the proposed model for complex network on ConNet by selecting the seed infection location randomly. It is to be noted that the model will behave as a standard SIR model in two cases, (i) if = 0 , (ii) if the mobility is reduced to 100 percentile (that is no mobility allowed) from connected locations. Pandemic Origins From a Random Location. Figure 11 displays the influence of the social connectivity parameter ' ' while keeping the other parameters constant. Figure 11a to 11i shows the pandemic dynamics with different values of starting with = 1 to = 0.2 . We observe that the peak of the infected compartment decreases significantly, as the decreases, and it also takes longer to reach its peak. This indicates that there is a positive impact of lockdown in controlling a pandemic. The effect of restricting the mobility from the top-X percentile of highly connected locations with other locations is shown in Fig. 12 . Figure 12a , f displays the pandemic dynamics with different percentile of mobility restrictions of highly connected locations starting with 10-30% (keeping = 0.5 ). We observe that in the case of a pandemic, restricting the mobility from the top-10 percentile of highly connected locations can reduce the number of individuals who can get infected to 15-21%. Therefore, quarantine plays a vital role during pandemics. In order to understand the relationship between and mobility restriction from strongly connected locations, we performed the numerical simulation of the proposed mean-field equations for complex network. The results are similar to the mean-field equations for non-complex network (see Fig. 4 ). Therefore, we can infer that the social connectivity parameter ' ' and mobility both plays an important role during pandemics. Classical compartmental epidemic models are unable to describe the spreading pattern of pandemics such as COVID-19 as they do not take into account the effect of social connectivity and mobility in the spreading of the virus. Our proposed mobility-based SIR models for fully mixed and complex networks shows the significance of social connectivity and mobility during pandemics by taking into consideration the local and the global transmission rate of the infection. We have simulated the fully mixed model by considering three different origins of the infection, namely random location, weakly connected location, and strongly connected location. Our simulation shows that limiting the social connectivity reduces and delays the peak of the infected compartment. Our analysis also shows that restricting the mobility from the top-10 percentile of connected locations can reduce the number of infected individuals between 18 and 27%. From the mathematical proof for our proposed models, we obtained that the reproduction number R 0 directly depends upon social connectivity of individuals, number of connected locations and individuals mobility between locations (and degree of the individual in complex networks model) which is in line with our simulation's results. This indicates that introducing isolation and quarantine is effective in fighting a pandemic crisis. Using the proposed model, we also simulated the real-world scenario by considering the COVID-19 cases in Estonia and Rhône-Alpes region in France. Simulation reveals that the mobility-based SIR model can be helpful to forecast the expected number of cases after some proportion of isolation and quarantine is introduced in the society. We plan to include various future directions for this work such as by simulating the model using additional temporal networks. Another direction would be to use additional mobility data such as transportation networks for better understanding the pandemic behavior. Importantly, we plan to introduce infection delay and recovery delay simultaneously in our future studies. Author Contributions Rahul Goel contributed to formal analysis, investigation, validation, visualization, and writing-original draft. Loïc Bonnetain contributed to data creation for Rhône-Alpes region and writing-review and editing. Rajesh Sharma contributed to supervision and writing-review and editing. Angelo Furno contributed to supervision and writing-review and editing. Funding This research is funded by ERDF via the IT Academy Research Programme, and H2020 framework project, SoBigData++, and CHIST-ERA project SAI. This work has also been supported by the French ANR research project PROMENADE (grant number ANR-18-CE22-0008). Availability of data and material Our CDR datasets are partly location data and cannot be shared due to privacy concerns. Additionally, despite the fact that the datasets are anonymized at two levels, there is still a small chance that specific persons can be identified. The dataset is owned by the two university laboratories involved in this paper and could be accessible for research purposes after signing an NDA (nondisclosure agreement) and on condition of acceptance by the network Statistical mechanics of complex networks Population biology of infectious diseases: part i Modeling the spatiotemporal epidemic spreading of covid-19 and the impact of mobility and social distancing interventions Modelling and analysis of delayed sir model on complex network Dynamical patterns of epidemic outbreaks in complex heterogeneous networks Agent-based modeling: methods and techniques for simulating human systems Individual-based computational modeling of smallpox epidemic control strategies The effect of travel restrictions on the spread of the 2019 novel coronavirus (COVID-19) outbreak COVID-19 Dashboard by the Center for Systems Science and Engineering (CSSE) at Johns Hopkins University. 2020. Crooks AT, Hailegiorgis AB (2014) An agent-based modeling approach applied to the spread of cholera Coronavirus covid-19 global cases by the center for systems science and engineering (csse) at johns hopkins university First global estimates of 2009 h1n1 pandemic mortality released by cdc-led collaboration Modelling to contain pandemics Quarterly bulletin of statistics Estonia. An overview of social and economic developments in Estonia Modelling disease outbreaks in realistic urban social networks A data-driven approach for origin-destination matrix construction from cellular network signalling data: a case study of Lyon region (France). Transportation Mobility based sir model for pandemicswith case study of covid-19 Understanding gender segregation through call data records: an Estonian case study Modeling competitive marketing strategies in social networks Modelling mitigation strategies for pandemic (h1n1) Exploring network structure, dynamics, and function using networkx The mathematics of infectious diseases Impact of natural and social events on mobile call data records-an estonian case study Bayesian estimation of the dynamics of pandemic (h1n1) 2009 influenza transmission in queensland: a space-time sir-based model The basic si model An sirs model with a nonlinear incidence rate A three-scale network model for the early growth dynamics of 2014 west Africa Ebola epidemic Controlling the spreading in small-world evolving networks: stability, oscillation, and topology A delayed sir model with general nonlinear incidence rate Epidemic outbreaks in complex heterogeneous networks The quasi-stationary distribution of the closed endemic sis model The structure and function of complex networks Pandemic h1n1 2009 Coronavirus disease 2019 Epidemic processes in complex networks Dynamic models of segregation An sis model with infective medium on complex networks 22012) Nonlinear spread of rumor and inoculation strategies in the nodes with degree dependent tie strength in complex networks Antiviral treatment for pandemic influenza: assessing potential repercussions using a seasonally forced sir model Tikkanen T (2020) An agent-based epidemic model reina for covid-19 to identify destructive policies Modelling dynamical processes in complex sociotechnical systems A generalized-growth model to characterize the early ascending phase of infectious disease outbreaks Weighted hypersoft configuration model An sir model with infection delay and propagation vector in complex networks Analysis of a delayed sir epidemic model