key: cord-0051801-99h0x8n4 authors: Zemtsov, S. P.; Baburin, V. L. title: COVID-19: Spatial Dynamics and Diffusion Factors across Russian Regions date: 2020-10-20 journal: Reg DOI: 10.1134/s2079970520030156 sha: 7a44cbbd70a888c5011947d32081aa02e0fdd630 doc_id: 51801 cord_uid: 99h0x8n4 The observed spread of coronavirus infection across Russian regions, as a first approximation, obeys the classic laws of diffusion of innovations. The article describes in detail theoretical approaches to the analysis of the spread of social diseases and discusses methodological limitations that reduce the possibility of predicting such phenomena and affect decision-making by the authorities. At the same time, we believe that for most regions, including Moscow, until May 12, 2020, the dynamics of confirmed cases are a reduced and delayed reflection of actual processes. Thus, the introduced self-isolation regime in Moscow and other agglomerations affected the decrease in the number of newly confirmed cases two weeks after its introduction. In accordance with our model, at the first stage, carriers infected abroad were concentrated in regions with large agglomerations and in coastal and border areas with a high intensity of internal and external links. Unfortunately, the infection could not be contained, and it started growing exponentially across the country. By mid-April 2020, cases of the disease were observed in all Russian regions; however, the remotest regions least connected with other parts of Russia and other countries had only isolated cases. By mid-May, at least in Moscow, the number of new cases began to decline, which created the prerequisites for reducing restrictions on the movement of residents. However, the decrease in the number of new cases after passing the peak of the epidemic in May is slower than the increase at the beginning. These facts contradict the diffusion model; thus, the model is not applicable for epidemiological forecasts based on empirical data. Using econometric methods, it is shown that for different periods of diffusion, various characteristics of the regions affect the spread of the disease. Among these features we note the high population density in cities, proximity to the largest metropolitan areas, higher proportion of the most active and frequently traveling part of the population (innovators, migrants), and intensive ties within the community, as well as with other regions and countries. The virus has spread faster in regions where the population has a higher susceptibility to diseases, which confirms the importance of the region’s health capital. The initial stage was dominated by random factors. We conclude this paper with directions for further research. Recent decades, in Russia and abroad, have been marked by the increasing frequency of natural hazards, 1 including epidemics. However, the COVID-19 pandemic is distinguished by its global scale, the unprecedented actions undertaken by the authorities, and the public's response. We performed our calculations in the initial stages of the epidemic. As of mid-May 2020, there were more than 4.5 mln confirmed 1 IPCC Fifth Assessment Report (AR5) (2014): https:// www.ipcc.ch/assessment-report/ar5/. Accessed May 11, 2020. cases of infection globally, including more than 1.56 mln cases, where the patients have recovered, and 0.29 mln fatal cases. 2 As of May 12, 2020, Russia had more than 230 000 confirmed cases of the SARS-CoV-2 coronavirus infection, including about 43000 people who were discharged from hospitals and more than 2100 deaths. 3 According to new data after the article was accepted for publication as of October 5, 2020, 1225 889 cases of coronavirus infection were registered in Russia (35.4 mln in the world), 982 324 people (80.1%) were discharged for recovery. 21 475 patients with coronavirus died in Russia (1.75%), 1.036 mln died in the world (2.95%). 4 We can identify several global trends that determine the growing vulnerability of the population against various diseases: the climate change and the related invasive spread of infections and parasites; the growing population density (urbanization) and mobility, which lead to the rapid spread of diseases; population aging and environmental degradation, which result in lower health (immunity) and higher susceptibility to diseases. Therefore, it is reasonable to believe that the number of future local outbreaks and epidemics will increase. In our opinion, the geography of the distribution of confirmed cases of the disease is characterized by the classic factors of the diffusion of innovations, where the diffusion is affected by the structure of local communities in terms of their population density, connections, social activity, income, etc. Therefore, we find it relevant to apply the corresponding models of spatial diffusion [1, 2] with certain limitations, discussed in detail in the methodological part. In this work, the application of the diffusion model is not aimed at studying new technologies post-factum but at identifying factors and scenarios related to the spread of the disease. The hypothesis of our study is that the spatial distribution of COVID-19 in Russia can be correctly modeled using diffusion models of social innovations, where deviations from the standard model are determined by random factors, limited statistical data, validity of quarantine and other (testing) measures undertaken by the authorities, and society's response to these measures. In regions with a small number of identified cases, the stochastic component and subjective factors can lead to significant deviations from the model. However, as the virus spreads in the regions, the importance of random factors decreases, and the effect of the authorities actions grows. The aim of this study is to describe the geography and dynamics of the spread of confirmed (registered) cases of COVID-19 in Russia using the model of the diffusion of innovations and to estimate the possible factors affecting this process in the regions. The diffusion of innovations is a process of spreading new phenomena in a community through various communication channels [27] . Traditionally, the distribution of innovations in communities was studied by sociologists [27] , economists [15, 18] , and geographers [20] , who had gained extensive experience in terms of modeling social changes (revolutions, new institutions, laws, etc.), introduction of new technologies (Internet, mobile communications, etc.), development of new products and services (computers, smartphones, Internet banking, etc.), and dissemination of information (advertising, propaganda, etc.) 5 . Some works discuss the modeling of social diseases, such as HIV/AIDS [16] , which not only spread from one carrier to another by the principle of infection (neighborhood) but also account for the structure of communities. As we show below, COVID-19 can also be related to this type of disease in the early stages of its spread. According to the classic works [20, 27] , which remain relevant, innovations, including social diseases, spread in the community in its ascending part (up to the saturation point α) following the logistic curve in cumulative form (the S-shaped light curve in Fig. 1a , the proportion of those infected at certain points in time) and obeys the normal distribution law for new infections (the bell-shaped dark curve) [1] . In terms of the time of the spread of the disease since the first confirmed case in the community, several groups in the population can be distinguished [27] (the X axis in Fig. 1a ; Table 1 ), which differ in their ability to contract the disease (age) and transmit it, primarily, based on their social activity: innovators (1), early adopters (2), early majority (3), late majority (4), and laggards (5) . Innovators are active and open members of the community, prone to moving outside the country. In Russia, these are often young people, students, representatives of creative professions, athletes, programmers, etc. Since they often visit other countries, regardless of the size of their income, they could contribute to the spread of the virus in the initial stages. Note that in terms of the diffusion of the disease, this category can also include individual foreign workers from countries-centers of infection, as well as representatives of professions involved in transport services, tourism, and other areas of activity related to international contacts. Early adopters are most often middle-aged people with a high income and social status, such as politicians, top managers, senior officials, famous scientists, doctors, popular representatives of creative professions (actors, directors, musicians, writers, etc.), and others. They actively participate in various public events, visit foreign countries, including for business trips. Due to their status, they are able to ignore the demands of the government. The early adopters themselves are able to exert pressure on the federal and regional authorities with regard to taking certain measures. They are able to infect the largest number of people in the initial stages of distribution, as they represent the center of social networks. The spread of the disease can be stopped, primarily, by isolating the innovators and early adopters; however, this is the most challenging task. Note that the capacities of healthcare systems around the world are not designed to cope with even innovators alone; for example, the number of beds does not cover more than 1.5% of the population in the leading country in terms of this indicator, Japan, and the number of beds for infected patients is even less. 6 Therefore, complete hospitalization of confirmed infected people is not possible in most countries; accordingly, government measures are limited to recommendations for quarantine and self-isolation, which may be ineffective for the reasons given above. Various forms of personal contact represent an important factor in the spread of infectious diseases. After innovators and early adopters became infected abroad, they carried over the infection to their communities (often without knowing that they were infected 7 ). The higher the number of potential carriers of infection the higher its spread rate and the greater the number of new carriers. This process accelerates exponentially until the number of infected members of society reaches half of the maximum possible (the inflection point α of the S-shaped curve in Fig. 1 ). After this, the trend reverses direction. 6 F. Bass [15] divided community representatives into two groups depending on the model of the spread of innovations (Fig. 1b, formula (1)). The first group includes innovators who have adopted the innovation without interacting with members of their community. In our case, we are talking about the first confirmed infected people who visited the foci of the disease abroad (China, Italy). The second group consists of imitators, who have adopted the innovation after personal contact with the innovators; i.e., they were infected in Russia. Thus, the probability of getting infected with coronavirus at time t can be determined by the linear dependence on the number of virus carriers in the community: (1) where p is the coefficient of primary infection or the proportion of innovators, q is the coefficient of imitation or the percentage of those infected from primary carriers, and F(t) is the proportion of infection carriers at time t. In this case, the probability function is a distribution close to normal (Fig. 1b) . Having calculated the derivative, we obtain the probability density function for a newly infected patient at time t (f(t)): (2) where is the maximum potential number of cases. The development of the discussed approach resulted in new models [25] that may be applicable to the analysis of the spread of diseases. For example, spatial models describe the characteristics of the spread of the virus depending on the proximity of the source of infection (China or Moscow) (Fig. 1c) : the source itself is characterized by the maximum number of infected people and the highest rate of spread, near the source these parameters are reduced, and at the -Cascading (hierarchical) mechanism, which suggests the primary infection in the largest agglomerations stemming from innovators who have visited other countries; the disease spreads down the hierarchy of cities (Fig. 2b) ; -Network (chain) mechanism, which combines the first two models and suggests the spread of the disease along transport networks (Fig. 2c) ; in our case, airports, air communications, and sea ports; i.e., the infection is first concentrated in transport hubs. 8 The contagious (contact) model is predominantly valid at microlevels (within urban communities), while the hierarchical and network models are more applicable at the macrolevels (regions and countries). 8 Confined spaces with a large number of potential carriers (aircraft, buses, ships, etc.) contribute to the rapid spread of the virus. To apply the model, we assumed that the curve in Fig. 1b can be described by a parabola's equation, and then, transforming formula (2) into a nondifferential form, we obtain the following expression 9 : where ΔF(t) is the number of new infections per capita at time t and ε(е) is the residual. In this case, it is easy to find the main parameters of the quadratic equation: the inflection point (extremum α in Fig. 1 ) and the maximum number of infected people ( in Fig. 1 ). Nevertheless, modeling itself is possible only after reaching the first stage (the emergence of imitators), and its accuracy becomes sufficient to make reliable forecasts only after reaching the inflection point (extremum of the parabola). The classic model of the spread of infectious diseases, SIR, in contrast to the one that we described, takes into account not only infectious individuals but also other distribution-related components: susceptible and recovered [22, 29] . The SIER model also considers exposed (potentially susceptible, latent) patients and the SEIRD model additionally takes into account 9 Also, a more accurate method based on an estimate of the maximum likelihood can be used for modeling. dead patients. Thus, these models make it possible to simulate the most important parameter, the mortality rate, and estimate the burden on the health system. They are not perfect and can underestimate the peak of the disease and overestimate its duration [19, 23] . The contagiousness of the virus could be underestimated [24] . One of the difficulties in using epidemiological models is related to the need to determine the potential number of exposed people and the contact coefficients in society, in hospitals, etc. Due to the lack of high-quality data, these parameters are often defined based on previous epidemics. In this case, modeling concerns the real number of infected patients, not the statistically confirmed one. Any model is always a simplified reflection of only a part of the factors affecting the process. The medical component is rather successfully modeled by the corresponding epidemiological models. The specification of the model of the diffusion of innovations (3) proposed by the authors can serve the purpose of forecasting only with limitations, since it depends just on the previous rate and the number of those infected. These values are highly volatile, 10 and the differences population structure, advances in medicine, availability of testing, government regulation, etc. Applying the diffusion model, it is necessary to take into account a number of limitations when interpreting the results ( Table 2 ). The mismatch between real cases of infection and those confirmed by statistics is especially important, since in fact, in the data of Rospotrebnadzor and the World Health Organization, we observe underestimated and delayed statistics. Moreover, there are reasons to believe that the extent of these distortions varies between the phases of the epidemic, from country-to-country, from region-toregion, etc., depending on the state of medicine, availability of tests, general public health, etc. However, in our opinion, this number is proportional in regions to the real number of those infected with a certain lag (equal, as will be shown later, to approximately two weeks), and as the number of tests increases, these discrepancies will fade. Due to the lack of more reliable statistics, we assume that modeling concerns only the most severe forms of the disease, which led the person to turn to specialists and take tests; i.e., latent forms are not taken into account. At the same time, the scale of testing and the manifestation of latent forms of the disease are interrelated with the geographical factors of the spread of the epidemic (including the city-village dichotomy). In other words, the confirmed incidence is a reflection of these processes. There are reasons to believe that the correlation between the confirmed incidence and the real rate is rather high at the regional level in the later stages, when random factors and testing issues play a lesser role. 12 In contrast to epidemiological models, we assume the symmetry of the infection rates after the inflection point, which, for example, is not typical for mortality 12 More than 5.4 million laboratory tests for COVID-19 were carried out in Russia as of May 11, 2020, which can be considered a fairly high coverage of the population (https://rospotrebnadzor.ru/about/info/news/). The question of causal relationships between the confirmed morbidity and the number of tests is discussed in more detail in [10] . 1 The first antibody tests in the United States and Italy showed that most carriers did not seek help and were not tested, since the disease was mild. 2 As of mid-April 2020, this limitation in Russia has been overcome [10] ; however, regional imbalances are still possible, especially taking into account the uneven representation of private laboratories. 3 Different regions may simultaneously apply different approaches: testing only those hospitalized with suspicious symptoms, all those who have contacted them, risk groups, everyone, or random testing. Approaches are rapidly changing as the number of cases increases, tests become more available, etc. In particular, on April 8, Russia allowed testing in private laboratories. Even with a similar number of tests per capita, different approaches will yield different results. However, when analyzing Russian regions, we can assume similar requirements, at least in state medical institutions. 4 Since April 8, 2020, the methodology for confirming the diagnosis has changed. According to the recommendation of the Ministry of Health of Russia, not only positive tests but also clinical symptoms were sufficient to confirm the infection. Statistics reflect the spread of the innovation accurately, consistently, and without time lags The ratio of actual infection cases (able to transmit the virus further) and confirmed cases (using which we make model estimates) may differ for several reasons: -the lack of universal testing; therefore, the potential number of infected can be significantly greater 1 ; -delay in the diagnosis, visit to the doctor, and reflection in statistics [10] ; -unreliable existing tests [10] , high proportion of false results; -low availability of tests in the early stages 2 ; -different ways of organizing testing in terms of the scale, obligation, and duration 3 ; -different methods for identifying those infected 4 Diffusion begins around the same time everywhere In cities and regions of Russia, the coronavirus starts spreading at different times Diffusion reaches approximately even saturation Proportions of the maximum number of confirmed cases in all cities and regions are different The distribution shape is close to the logistic curve It is not typical of all cities and regions, as it depends on actions of authorities, community structure, etc. Binary process of adopting the innovation (adopted or not adopted) The patient may recover, may not show symptoms, the disease may not be confirmed from COVID-19 13 (the actual number of deaths decreases rapidly after the inflection point). In accordance with the diffusion model of innovations, in contrast, the downward wave of infections in the regions may be more extended (Fig. 1c) ; i.e., attenuation can take a longer time than during the rising wave. At the same time, we model only the first active part of the epidemic with a large number of new cases. It is obvious that individual outbreaks of the disease will continue to be recorded for a long time in the absence of vaccination of the population. In addition, second and further waves of the disease are possible. To overcome these and other limitations, multifactor models can be used that take into account the advances made in medicine, incidence, testing, and other factors as control variables. However, such models, in the conditions of the rapid spread of the disease, do not actually allow taking into account the daily change in factors. Most indicators in Russian statistics lag by one or two years and do not have a monthly or quarterly breakdown even by region, not to mention municipalities. Therefore, we are forced to use the least squares method for a specific period of the spread of the disease in the regions of Russia, making the assumption that the factors themselves (independent variables) have not changed since the time they were reflected in the statistics and do not significantly distort the intra-annual dynamics. 14 Partially, the choice of the regional level of analysis was determined by the available statistics, although this yields generalized results for modeling a phenomenon that spreads through the close contact between people. At the same time, this kind of modeling is possible only at the regional level, sufficient to identify geographical patterns of diffusion. For example, the intracity level is characterized by mainly microgeographic and random factors related to the location of hospitals, nursing homes, crowded places, etc. Of course, the authors understand the role of the subjective factor; however, it is also most significant at the micro level. In order to identify factors that influenced the spread of infection, in our study we used the following model [1] : (4) where ΔF is the number of new infections per capita per day; i is a Russian region; t is the number of days; F (t -1) is the number of carriers per capita on the previous day; and n is the maximum potential number of infected patients, which depends on n factors in period j: -basic geographical charateristics of area: population density, concentration in one city, density of the transport network, distance to the center of the outbreak, etc.; -characteristics of the regional community: proportion of innovators, income, education, intensity of interaction within the community and with other communities (tourists), anti-contagion actions, etc.; -characteristics of the spread of the disease: the number of days from the first infection, the total number of people infected in the region, the susceptibility of the population to the disease (age, general health, medical infrastructure), etc. As a respiratory disease, COVID-19 spreads most rapidly through personal contact. To account for this, we need to understand how many people in the region reside in territories with certain population densities, which partially overcomes the disadvantage of an excessively generalized level of analysis. Other things being equal, a higher density leads to a greater number of contact between people and a higher rate of the spread of the disease. The hypotheses and all variables are described in Table 3 . Each stage is characterized by its own combination of factors [1, 2] . At the initial stage, the position relative to the source of innovations (according to the neighborhood-based model) and transport centers (the network diffusion model) and the proportion of potential innovators may be more important. In accordance with the hierarchical model of diffusion, large cities and densely populated centers with large airports, where the proportion of community members and migrants frequently traveling abroad is higher, are most affected. Most of the variance remains unexplained due to random factors. At the stage of rapid spread, a key role can be played by the structure of the community and connections between the innovators and the majority. Infection encompasses regions near the foci of the disease, and accordingly, the proportion of temporary migrants is significant here. Poorer regions with high overall morbidity are generally can be more susceptible to the spread of COVID-19. We used crime rates and unofficial employment as an inverse indicator of the trust of community members to each other, and, accordingly, the development of small and medium-sized businesses (attraction points for residents) as an indirect indicator of potential internal ties [3] . At the stage after the inflection point, the role of random factors decreases and the number of new infections in remote regions grows. The earlier the first case is confirmed in the region the lower the infection rate at this stage. Several strongly correlating variables were chosen to describe each factor (and hypothesis); therefore, the authors tested the indicators one at a time, choos- ing the model with the best characteristics: maximum R 2 , minimum Schwarz information criterion, and significance (*) of a larger number of variables. Independent variables are taken with a time lag, which partially solves a possible endogenous problem. Nevertheless, in the regressions, we used only spatial data without taking into account the change in variables over time, therefore it is more correct to speak not about the factors determining the spread of the disease, but about the characteristics of regions where the incidence was higher. Considering the dynamics of the spread of the disease in the Russian regions (Figs. 3 and 4) , we can trace the classic stages of spatial diffusion [2, 20] . The first stage of origin before March 7, 2020: more than 80% of the first recorded infected patients (innovators) were concentrated near the source of the infection (Chinese workers in Zabaykalsky krai) and in the center of the city hierarchy: Italian workers and Russian tourists coming from Italy to Moscow. The rest were randomly shared between transport hubs and near the foreign source of infection. The source of infection (Moscow) and innovators (foreign workers) may have been weakly related to the other regional communities. It was possible to stop the disease under a strict quarantine and forced hospitalization of the arrivals [13] . The Russian authorities evacuated Russian citizens from the Chinese province of Hubei, the one most affected by the epidemic. All visitors from the foci of the disease were quarantined. The government temporarily suspended the issuance of work visas to Chinese citizens and limited air traffic with China and the crossing of the Russian-Mongolian border. However, these measures turned out to be insufficiently effective and belated, 15 since not everyone who arrived from abroad followed the quarantine regulations. The second stage of emergence and slow spread lasting from March 7 to March 15, 2020 was marked by the first confirmed infected cases inside the country due to the neighborhood effect: imitators. 16 From this point on, stopping the spread of the disease became problematic. About 80% of the carriers were concen- 15 At least compared to the measures to combat the outbreak of the cholera epidemic in the south of the former Soviet Union in 1970 [13] . 16 Rospotrebnadzor reported three coronavirus infection cases in Russia: https://tass.ru/obschestvo/7983369. Accessed May 11, 2020. Petersburg, Moscow oblast, Nizhny Novgorod oblast, Sverdlovsk oblast, Rostov oblast, and Tatarstan), introduced the self-isolation regime, which was supposed to slow the spread and allow the healthcare system to cope with the influx of patients and minimize the number of additional deaths. We note that, at the exponential stage, Murmansk oblast and Komi Republic had high infection rates (see Fig. 4 ): in the northern regions, the population is concentrated in several cities, which are characterized by a high proportion of migrants and active interaction with other cities and countries. Temporary workers come from Moscow, for example, to PAO NOVATEK construction sites in Murmansk, while local residents go on vacations to the south. Presumably, in early May, the spread of the disease is before the stage of accumulation near the inflection point. There has been widespread distribution at a decreasing rate in the largest centers, the proportion of infected regions without large agglomerations is growing, and the percentage of people who recovered is increasing (see Fig. 3 ). At the same time, the number of new infections is minimal in the largest cities (with the exception of the two capitals and Nizhny Novgorod) and the peripheral remote regions. In the case of a steady decrease in the number of new cases within the next two weeks, the first restrictions have been lifted [11] . On May 7, 2020, the maximum number of newly confirmed cases of the disease in Moscow was recorded, after which it decreased. The inflection point in Russia was passed on May 11 (11 656 new cases). The saturation stage lied ahead: decay of the spread, absence of large numbers of new infections in multiple regions, and an exponential increase in the proportion of survivors. It is possible to lift almost all restrictions. Unfortunately, unlike the diffusion of technologies, COVID-19 will not disappear completely until a vaccine is found and universal vaccination is carried out. Therefore, the number of new cases began to rise again from September 2020, reaching a new pick of 10888 new cases on October 5. To construct the model of the spread of COVID-19 in Russia using formula (3), we distinguished several periods (Fig. 5) . The first period is the time before March 21, 2020, when the number of confirmed infected cases decreased slightly immediately after the introduction of additional antiepidemic measures in Moscow. According to the calculations for that date, in accordance with the assumptions of the model, the number of confirmed cases would have not exceed 1500, and the end of the active stage of the epidemic would have been in mid-April (Fig. 5a) . Apparently, this period was characterized by multiple random factors and low testing coverage, which could have easily misled the authorities and scientists about the future extent and danger of the disease. It is possible that if the strictest quarantine measures [13] with respect to visitors from abroad had been introduced immediately throughout the country in March, the described scenario could have been realized. However, this was not possible politically with such few infected cases. Even today some members of Russian society consider the measures taken to be an excessive restriction of their freedom. The scenario contradicts the experience of China, where quarantine measures were stricter but the epidemic lasted more than three months. Although according to the assumptions of the spatial diffusion model (see Fig. 1c ), the swiftness and scale of the spread in Russia could have decreased as the disease was spreading from the source of the outbreak (China) due to the growing experience in containment, the health system being more prepared, etc. This scenario was not implemented and the number of confirmed cases in the regions began to grow exponentially. On March 29, the self-isolation regime was introduced in Moscow and many regions, and the population of the largest cities minimized their mobility 18 and social contacts. Until April 15, 2020, the number of new cases was increasing exponentially (Fig. 5b) ; with such dynamics, without the actions of the authorities, the total number of confirmed cases could have approached half a mln by the end of June. But the rate of the spread was reduced, and according to the data of April 27, 2020, we can see the trend deviating from the exponential, which began two weeks after the introduction of the self-isolation regime (Fig. 5c) . In our opinion, this was due to the introduced restrictions and the proper response of citizens. The new scenario assumed about 150 000 confirmed cases across the country and the expected end of the active stage of the epidemic by mid-June. Since in the calculations the majority of those infected are actually Moscow resi- 18 dents, it should be said that these assumptions are more applicable to the capital and large metropolitan areas, while in other regions the epidemic may continue. However, this optimistic scenario was not realized. In the first days of May, the proportion of infections in Moscow began to grow again (see Fig. 3 ), with actual stabilization in half of the regions for a week. Perhaps this is partially due to the introduction of checkpoints in Moscow, when there were queues in the subway in mid-April. Changes in the infection rates and the dynamics of confirmed cases in statistics are separated by about a two-week lag. Extending the general testing and the initiated antibody testing could also affect the dynamics. A negative course of events might lead to the need to expand the regime of self-isolation and extend it until July, which in turn would have a devastating effect on small and medium-sized businesses and the entire economy in Russia [7] . Fortunately, the number of new cases in Russia began to decline slowly at the end of May, mainly due to a decrease in Moscow. As can be clearly seen from the plots (see Fig. 5 ), the diffusion models constructed using real data, rather poorly predict the development of events in the early stages of the spread of the disease before the inflection point, since they do not take into account the actions of the authorities and the public response. In addition, the smaller the number of cases the higher the stochastic component. The estimated evidencebased scenarios will be significantly differ for individual regions (see Fig. 1c) , and for the majority it is not applicable at all due to the prevalence of random factors and differences in conditions that we sought to account for in the next section of this paper. Although the number of new cases in Russia began to decline at the end of May, and many restrictions were lifted, the rate of these changes was significantly lower than anticipated in the model. And the graph in Figure 1a is asymmetric. Taking into account the second wave of the epidemic, the proposed model is not applicable for forecasting due to significant distortions in statistics, unpredictability of anti-epidemic measures and the reaction of residents. In order to identify the most significant regional characteristics affecting the rate of spread of the disease, we first tested the hypothesis on the high significance of the population density (Table 4) , taking into account the control variables: the cumulative number of carriers per capita in the region the previous day (Corona t -1 ) and the number of days since the first confirmed case (Day) [28] . In the first period, before the introduction of restrictions on movement (as of March 15, 2020) , the only positive significant factor was the proportion of residents living in municipalities with the population density above 3000 people/km 2 (Pop10000). These are mostly the highly densely populated areas of the country's largest agglomerations (for example, the central regions of Moscow) and transport hubs, as described above. In the second period, two weeks after the introduction of restrictions (as of April 13, 2020) , the significant positive factors included the saturation level achieved (Corona t -1 ) and the number of days since the beginning of the epidemic (Day), that is, the number of previously infected, due to the neighborhood effect and exponential growth. At the same time, the epidemic did not reach the least densely populated municipalities; therefore, the spread of the disease was lower in regions where the proportion of residents living in municipalities with the population density of less than 8 people/km 2 was high (Pop8). In the period of exponential growth, the process covered the whole country, and in the regions where it started earlier, the rate of the spread decreased (negative coefficient in variable Day). In the last period, on approaching the inflection point, the number of cases in the most densely populated areas again increased (Pop10000), where the density of confirmed carriers per capita was high (Corona t -1 ). This was due to the growing number of new cases in Moscow, Moscow oblast, and Nizhny Novgorod oblast, which may be explained by the expanding testing 19 . It can be clearly seen from the increasing R 2 that the stochastic process, almost inexplicable at the initial stage (R 2 is slightly above zero), was becoming increasingly deterministic (R 2 is above 0.6), except for the last period when the proportion of unexplained dispersion again increased (see Tables 4, 5) . Then we tested all the hypotheses given in Table 3 , alternately testing indicators for each of our favorite factors, and selected models with the best characteristics, where all variables are significant (Table 5) . At the first stage of diffusion, until March 15, one of the most significant factors is the total annual morbidity of the population. If the total morbidity in the region (Desgov) in previous year was 1% higher than in other regions, the number of newly confirmed cases of COVID-19 per mln inhabitants was 0.042% more. At the same time, it is not clear what effect prevails here, the low public health [8] and susceptibility to infections or the proper development of the healthcare system and testing that is able to detect a new infection. As expected, the positive significant factor at the ini-REGIONAL RESEARCH OF RUSSIA Vol. 10 No. 3 2020 ZEMTSOV, BABURIN tial stage is the higher proportion of innovators, those visiting countries-foci of the disease (Tur2) every year and having technological skills (Online). The share of innovators is higher in the largest agglomerations, northern mining centers and near major Russian and foreign markets (EGP) [6] . Due to the epidemic outbreak in China (the largest market in the world) and in Moscow, the favorable economic and geographical position might contribut to the primary infection of the public. The rate of the spread is lower in regions where the level of trust in society is lower due to high crime rates (Crime) and, accordingly, where the intensity of interaction within the community, the density of small businesses, and the interpersonal communication are lower [3] . All other things being equal, none of the models had significant characteristics of transport accessibility (Aero), although it was assumed that large airports could be sources of infection. Most likely, this actually was the case; however, the presence and the proximity of densely populated (Pop10000) and large center, as measured by various indicators of geographical location (EGP, Demo), were more significant. In other words, it is not the presence of the airport that is important but the size of the center in which it is located, because infection rates were affected by the high density of social contacts, while air traffic was gradually restricted. The significance of the presence (Urb) and proximity of large cities (Demo) in the region was higher at the exponential stage of growth. This was the neighborhood effect: the infection spread from large centers (Moscow) to neighboring regions due to the return of temporary labor migrants and migration to summer cottages, which is clearly seen in Fig. 4 . At the stage of exponential growth (as of April 13, 2020) , despite the introduced self-isolation measures, the number of new infections per capita was 0.14% higher in regions where the proportion of Russian citizens accommodated in hotels was 1% higher (Rostur), that is, in regions with a high intensity of business trips from other regions. In regions with the higher annual overall morbidity of the population (Desgov) and higher health care costs (Medexp), the number of new cases was also higher. Such regions have a higher density of laboratories, better quality of equipment for detecting the disease, potentially lower public health, and a higher proportion of older people. In regions where the crime rate (Crime) is higher and, accordingly, the density of small business and social interaction is lower, the penetration rate of COVID-19 is lower. The number of new cases is 2.2% higher in regions where the poverty level (Poverty) is 1% higher, as the infection has spread to poor regions, and the poor population cannot fully adapt: stop working or switch to remote working, move to a summer house, order goods online, etc. At the late exponential stage (according to the data on April 27, 2020), the influence of some factors has changed to the opposite direction. For example, in regions with a higher overall incidence rate (Desgov) and medical staff availability (Medpers), the spread rate has become lower. Apparently, diffusion in large cities has slowed, and the disease has spread to regions with less developed health care systems. In the northern regions with high per capita incomes (Income_Min) and a high concentration of the population in certain municipalities, the number of new cases is greater. At the last stage, diffusion has spread to the northern large urban regions with a high proportion of residents with higher education (UrbHEd) and to the least developed regions with high levels of unofficial employment (Unform) and poverty (Poverty). According to this study, the diffusion model of innovations with a number of assumptions and limitations can be applicable for the spatial analysis of the spread of confirmed COVID-19 cases. We considered only severe statistically confirmed cases. The real extent of the spread is unknown due to the supposedly two-week lag between contracting the disease and diagnosis, the lack of universal antibody testing, 20 and other limitations described in detail in the methodology. The diffusion of COVID-19, in the early stages discussed in the article, was mainly determined by the situation in Moscow; thus, it should be noted that the scenarios considered by the authors characterized the situation in the capital, where the end of the active phase of the epidemic was expected by early July. In fact, increase in the number of contacts as a result of lifting quarantine measures, did not lead to a possible jump in the incidence rate in Moscow, where, according to the data as of July 10, 2020, only 637 new cases were recorded (a close value at the beginning of the epidemic is typical for April 7, 2020) . Unfortunately, the number of new cases is declining slowly in other regions with large agglomerations: St. Petersburg, Nizhny Novgorod, Omsk, Sverdlovsk, Novosibirsk, Voronezh regions, as well as in the centers of labor migration: Tyumen region, Khanty-Mansiysk and Yamalo-Nenets autonomous regions. At the same time, as a number of factors affect the spread and confirmation of the disease in the Russian regions, the disease continue spreading in the peripheral and semiperipheral regions: Magadan, Amur, Sakhalin, Kurgan regions. New cases will be observed until the formation of collective immunity and / or the full vaccination of the population. The first confirmed carriers of coronavirus came from abroad and were initially concentrated in the largest metropolitan areas, maritime centers, and regions neighboring China; however, the spread of the disease could not be contained. The experience of countries such as the Republic of Korea, Taiwan, Japan, and Singapore shows the importance of massive, prompt, and high-quality testing and tracing of all contacts of those infected at the initial stage. Such an approach, when properly implemented, can contain the spread of the epidemic without large-scale quarantine measures. However, at the initial stage, such measures were not sufficiently effective in Russia. By mid-April, the first cases were confirmed in all regions of the country. Therefore, the self-isolation regime that allows reducing the burden on the healthcare system during the period of the exponential growth of infections became necessary. However, its effectiveness was confirmed in statistics only after two weeks as a deviation from the exponential trend. At the same time, Sweden (at the initial stage) did not introduce harsh restrictive measures; instead, their authorities only recommended reducing social and other contacts. However, in this case, the level of confidence of citizens in the government and their ability and willingness to follow the recommendations are important. In Russia, with a low level of general trust in most regions [9] , such a policy did not work before the introduction of prohibitive measures. This is clearly seen in the dynamics of the self-isolation index of Yan- 20 The domestic antibody testing system was registered on April 10, 2020, and medical personnel began to be tested on April 15 [10] . It is known that the majority of those infected are asymptomatic. dex, which shows how light the road traffic is. This index rose only on March 19 after the introduction of the self-isolation regime. Among other regional characteristics determining the spread of the virus in the Russian regions, we note the high population density in individual municipalities, as an indirect indicator of social contacts in transport, in residential buildings, etc., proximity to the largest metropolitan areas as foci of infection, high proportion of the most active and frequently traveling part of the population, and the intensity of ties within the community and with other countries and regions. The rate of exponential spread is higher in regions with a high susceptibility of the population to diseases, which confirms the corresponding statements on the importance of the region's health capital. At the same time, the factors differ significantly at different stages of the spread. In the initial stages, the effect of random events is high, which reduces the ability to correctly model processes. We only considered confirmed infection cases. Without the data on the excess mortality in March-April over the mortality in these months on average in previous years, we cannot talk about the most significant effects of the epidemic due to statistical and other distortions in the current period [17, 21] . For example, many who died from COVID-19 had chronic illnesses. There are some data on regions in Italy and the UK, 21 where a significant (up to 60%) excess of the total mortality in March-April was observed. At the same time, the correct and final data on mortality in the European Union will not be available for at least another two months and that for Russia not until the end of the year. According to the preliminary data for Moscow, mortality in April 2020 is 20% higher than the average for the previous decade: 11 846 deaths compared to 9866. 22 It can be expected that the additional mortality rate from all causes in Russia in comparison with previous years will also increase. For example, mortality from heat waves and smoke in the European part of the country in 2010 was previously underestimated, while according to the results of the year, the additional mortality amounted to more than 55000 cases, including 11000 cases, or 60% increase compared to previous years , in Moscow [14] . This can also be the subject of further research. The socioeconomic consequences of the COVID-19 epidemic in 2020 in Russia still have to be estimated; however, the increase in morbidity, mortality, and the introduction of quarantine measures are resulting in higher unemployment and lower economic growth worldwide. The most negative impact will be on the business sector [7] . Although in Moscow most of those employed can switch to working remotely and in industrial areas with continuous manufacturing, people can and are obliged to work, in regions that depend on tourism and on small businesses in retail trade and household services, the consequences may be more severe. Moreover, in large agglomerations, the opportunities for adaptation of both the population and business are high due to the availability of various forms of remote work and a high degree of Internet penetration, as well as its use for ordering goods and services. The level of digitalization of the economy will dramatically increase. Estimates for the opportunities and risks of the digital economy were discussed earlier [30] . This is another prospective subject for research, which can clarify which regions and cities may experience more noticeable social changes in the event that the self-isolation regime is extended. Regions-innovators and innovative periphery of Russia. Study of ICT-products diffusion Innovatsionnyi potentsial regionov Rossii (Innovative Potential of Russian Regions) Entrepreneurship and institutions: Does the relationship exist at the regional level in Russia? Postal innovations in pre-Soviet Siberia Spatial diffusion of innovations: the scope of uncertainty and the network model Assessing the potential of economic-geographical position for Russian regions Development trends of small and medium-sized enterprises amid pandemicinduced crisis The impact of public health capital on the economic growth in Russian regions Social capital as a factor in the economic growth of Russian regions Comparative analysis of approaches to testing for Covid-19 in Russia and foreign countries Approaches to elimination of restrictions accepted to prevention of distribution of new coronavirus infection Infodemia: existing approaches to the analysis of panic, phobias, rumors, and fakes during epidemics and proposals to prevent them The cholera epidemic in the USSR in 1970 Heatwaves, air quality, and mortality in the European part of Russia in the summer of 2010: preliminary assessment results A new product growth for model consumer durables Diffusion of innovations and HIV/AIDS Epidemiological and clinical characteristics of 99 cases of 2019 novel coronavirus pneumonia in Wuhan, China: a descriptive study The Spatial Diffusion of Technology Dynamics of COVID-19 epidemics: SEIR models underestimate peak infection rates and overestimate epidemic duration, medRxiv Innovation Diffusion as a Spatial Process Estimation of SARS-CoV-2 mortality during the early stages of an epidemic: a modeling study in Hubei, China and Northern Italy, medRxiv Early dynamics of transmission and control of COVID-19: a mathematical Identification and estimation of the SEIRD epidemic model for COVID-19 The reproductive number of COVID-19 is higher compared to SARS coronavirus New-Product Diffusion Models Modeling and forecasting the diffusion of innovation-A 25-year review Diffusion of Innovations Differential COVID-19-attributable mortality and BCG vaccine use in countries Nowcasting and forecasting the potential domestic and international spread of the 2019-nCoV outbreak originating in Wuhan, China: a modeling study The risks of digitalization and the adaptation of regional labor markets in Russia An assessment of regional innovation system efficiency in Russia: the application of the DEA approach An extended SEIR model considering homepage effect for the information propagation of online social networks The authors thank V.M. Kidyaeva, N.K. Kurichev, T.N. Mikhailova, I.S. Kashnitsky, the anonymous reviewers, and other colleagues for their discussion and valuable comments. This study was supported as part of the 2020 RANEPA state task and Program 5-100 of Kant Baltic Federal University on Increasing Competitiveness. The authors declare that they have no conflict of interest.