key: cord-0233604-aubyej23 authors: Volov, V. T.; Zubarev, A. P. title: Toward ultrametric modeling of the epidemic spread date: 2020-05-15 journal: nan DOI: nan sha: 78f0c72284af8ea42f4fba96db3929ed5a00ec8b doc_id: 233604 cord_uid: aubyej23 An ultrametric model of epidemic spread of infections based on the classical SIR model is proposed. Ultrametrics on a set of individuals based on theire hierarchical clustering relativly to the average time of infectious contact is introduced. The general equations of the ultrametric SIR model are written down and their particular implementation using $p$-adic parameterization is presented. A numerical analysis of the $p$-adic SIR model and a comparison of its behavior with the classical SIR model are performed. The concept of hierarchical isolation and the scenario of its management in order to reduce the level of epidemic spread is considered. Recently, there has been activity in the development of mathematical models of the spread of infectious diseases (see, for example, [1, 2, 3, 4, 5] )). This is primarily due to both the detection of old foci of existing viral infections, such as plague, anthrax, Ebola, and the emergence of the new infections caused by coronaviruses such as SARS-CoV (2002) , MERS-CoV (2015) , and SARS-CoV-2 (2019). Almost all mathematical models describing the development of epidemics are based on the model proposed in 1927 by Kermack and McKendrick in work [6] , which is now known as the classic SIR model. This model is based on 3-step development of the epidemic, in which healthy but susceptible individuals (susceptible) S as a result of infection go to the infected class (infected) I individuals who, in turn, move to class R (removed), i.e. recover by acquiring immunity (recovered), or die (dead). Note that the SIR model was constructed by analogy with the theory of homogeneous chemical reactions. This model has many generalizations that include additional stages of individuals known as SIRS, SEIR, SEIRS, and others. The essence of the simplest SIR model is as follows. Let S (t) is the number of susceptibles, I (t) is the number of infected, I (t) is the number of removed individuals. The following simplifying assumptions are accepted: 1) the set of individuals is homogeneous; 2) the number of individuals under consideration (susceptible, infected and removed) is constant, i.e. S (t) + I (t) + R (t) = N; 3) the probability of transmission per unit time from an infected individual to a susceptible is constant; 3) the probability of reinfection is zero. Given these simplifying assumptions the SIR model equations are written aṡ Here β is infection rate (or the average number of contacts with susceptible individuals that leads to the new infected individuals per unit of time per infectivee), and γ is removing rateis (or the average rate of removal of infectives per unit of time per infectives in the population). The joint solution of equations (1) and (3) gives where S (0) ≡ S 0 and R 0 = β γ is reproductive ratios. From (4) follows where R ∞ ≡ lim t→∞ R (t). Equation (5) with S 0 N = 1 is solvable for R ∞ N on the segment (0, 1) only under condition R 0 > 1 and there is no solutions on this segment when R 0 < 1. For this reason , conditionR 0 > 1is interpreted in this model as an outbreak condition of epidemic. Today, many generalizations of the classic SIR model have been proposed, which use more complex scenarios for the development of the epidemic, taking into account the latency period, the finiteness of the time when individuals are in the stage of immunity, the impact of vaccination, etc. The classical SIR model, as well as most of its generalizations, are essentially homogeneous models, i.e. they assume that the intensity of infection does not depend on any relationship between healthy and infected individuals. Nevertheless, even in such homogeneous models, adequate estimates of the scenario for the development of real epidemics are possible. Recently, a number of heterogeneous generalizations of the SIR models based on deterministic or random networks have also been proposed (see, for example, [7, 8, 9, 10] ), taking into account the heterogeneous nature of the populations in which the epidemic is spreading. As a rule, this heterogeneity is associated with the difference in the characteristics of individual individuals, or with the heterogeneity of territorial population. Meanwhile, the most important characteristic that plays a crucial role in the speed of spread of the epidemic is the distribution of the duration of infectious contact between pairs of individuals. This value can be a characteristic of population heterogeneity, and it allows us to introduce a distance function on a set of individuals (metric) in terms of which it is possible to generalize the classical SIR model. In this paper, we propose an inhomogeneous generalization of the classical SIR model, which is based on the hierarchical clustering of the population according to the degree of potential infectious influence of individuals on each other. We describe the procedure for such hierarchical clustering of a set of individuals and show that such clustering involves the introduction of a distance function on the space of individuals, which is ultrametrics. This distance function is not directly related to the spatial distance between individuals, it is determined only by the potential for transmission of infection from one individual to another per unit of time. Using the ultrametrics function on a set of individuals, we generalize the equations of the classical SIR model and arrive at a model that we call the basic ultrametric SIR model. As you know, a convenient tool for parameterizing ultrametric spaces is the field of p-adic numbers. We consider the ultrametric set of individuals that maps to the boundary of a finite hierarchical graph with a constant number of branches. Since such a set is parametrized by the factor set of a p-adic ball of radius greater than 1 over the ring of p-adic integers. The corresponding ultrametric SIR model, we call the p-adic SIR model. We present a numerical analysis of the p-adic SIR model and compare its behavior with the classical SIR model. We also introduce the concept of hierarchical isolation index and consider the simplest scenario for managing this parameter in order to reduce the spread of the epidemic. Let U = {x} be a set of individuals in number N. Infectious contact A of two individuals x and y we will call any continuous time interval A = (t 1 , t 2 ) which satisfies the condition: during interval A one of the individuals (for example, x) can be infected by another individual y, provided that individual y is infected throughout the entire interval A, and the probability of infection per unit of time f (t) differs from zero on interval A almost everywhere. Note that the concept of infectious contact is a purely model concept. In almost every specific case, the value of the infective contact is extremely difficult to calculate accurately,but it is always possible to approximate it. However, in any real assessment of the duration of infectious contact, it must be understood that this concept does not necessarily mean physical contact between two individuals. The probability of infection of one individual by another individual may occur indirectly, i.e. through a physical contact with surrounding bodies (air, objects) that became sources of infection after physical contact with an infected individual. However, infectious contact does not imply the possibility of infection of a healthy individual by an already infected individual through any other individuals. Intuitively speaking, the term "infectious contact" is a continuous period of time during which a particular individual has the potential for infection (no matter how small it is) at any time, and this possibility of infection is caused by a particular individual. Let T 0 be the value of a certain time interval during which we observe the behavior of individuals. Then for individual x, there is a finite number of infectious contacts A 1 , A 2 , . . . , A k , k ≥ 0 with individual y during this interval. Denote by T (x, y) the sum of the interval lengths A 1 , A 2 , . . . , A k . Obviously always T (x, y) ≤ T 0 . Since the behavior of many individuals is random, T (x, y) is a random function which depends on the specific implementation of an ensemble of populations of individuals. Expected value of τ (x, y) = E [T (x, y)] is the average time of infectious contact between two individuals during interval T 0 . It is obvious that the function τ (x, y) is symmetric. Function τ (x, y) plays the main role in the hierarchical clustering of individuals, which we describe below. Consider an infinite decreasing sequence of numbers of real positive integers t 1 , t 2 , t 3 , . . ., 1 , ..., = U as follows. We will require that for any i and for any individual x ∈ B i 1 there exists another individual y ∈ B i 1 from the same group for which inequality τ (x, y) ≥ t 1 holds. Obviously, this partition is the only one. It follows from this partition that if i = j, then ∀ x ∈ B i 1 and ∀ y ∈ B j 1 then we have τ (x, y) < t 1 . Groups B (i) 1 will be called the first level clusters. can be combined into larger sets of individuals -the second level clusters, which we will denote by B In this case, we require that for any individual x ∈ B i 2 there exists another individual y ∈ B i 2 from the same cluster for which inequality τ (x, y) ≥ t 2 holds. It is x and y belong to the same first level cluster and t 2 ≤ τ (x, y) < t 1 if x and y belong to different first level clusters. Similar to the above mentioned clusters of the second level can be combined into third level clusters B 3 from the same cluster for which inequality τ (x, y) ≥ t 3 holds. The third level clusters can be combined into fourth level clusters B , and so on. This procedure of hierarchical clustering of individuals must eventually be interrupted, because at some n-th step we will get a single cluster of n-th level B (1) n that will coincide with the set of individuals U. For any individual x ∈ B i n ≡ U, there exists an individual y ∈ B i n ≡ U for whisch τ (x, y) ≥ t n holds. So, the hierarchical procedure for clustering individuals described above allows us to introduce on set U the structure of hierarchically nested clusters B (j) i . By construction, any two clusters of arbitrary levels either do not intersect, or one is contained in the other. The resulting hierarchical structure of clusters at various levels can be described by a hierarchical graph. An example of such a graph is shown in Figure 1 . Hierarchical clustering of the set of individuals U allows us to introduce the ultrametrics d (x, y) on this set. We define function d (x, y) in the following way. If individuals x and y belong to the same level i cluster, but belong to different level i − 1 subclusters, Proof. To prove that function d (x, y) is ultrametric, it is sufficient to prove that any "triangle" (x, y, z), where x ∈ U, y ∈ U, z ∈ U is isosceles, and the two largest "sides" , individuals x and y belong to some level i cluster B (i) , but they belong to different subclusters B y) . Suppose also that d (y, z) < d (x, y) and d (x, z) < d (x, y). Then each of the pairs (y, z) and (x, z) belongs to some cluster of a level smaller than i. Let (y, z) ∈ B j 1 , (y, z) ∈ B k 1 , where j, k < i. Both clusters B j 1 and B j 1 contain the same element y . Therefore, one of them is contained in the other. Then the larger cluster must contain all three individuals x, y, z. But this means that x and y are contained in a cluster of a level smaller than i. Therefore, simultaneous execution of inequalities d (y, z) < d (x, y) and d (x, z) < d (x, y) is not possible. Hence either d (y, z) = d (x, y), or d (x, z) = d (x, y). The claim is proven. In its meaning, ultrametrics d (x, y) on a set of individuals is a value equal to the ratio of the time of observation of the population to the minimum boundary value of the average times of infectious contact of any two individuals belonging to the same minimal cluster as individuals x and y. We make the following model assumption, which is that the probability of infection of a healthy individual x by an infected individual y per unit of time is proportional to value 1 d (x, y) . Naturally, this assumption is a rather rough model approximation, since it assumes that during the infectious contact of individuals, the probability of infection per unit of time f (t) of a healthy individual infected is constant and does not depend on individuals. Naturally, real function f (t) is not constant and depends on many factors (contact space, pathogen concentration, individual susceptibility, etc.). However, this approximation allows us to simplify the model significantly, while preserving its main quality property -the hierarchical nature of the development of the epidemia. Let P I (x, t), P R (x, t),P S (x, t) be the probability that at time t, individual x is correspondingly healthy, infected, or removed. Then we can write the following ultrametric generalization of the equations (1) -(3) Function makes sense of the probability of infection by an infected individual y of a healthy individual x per unit of time. For large N, the total number of healthy, infected, and retired individuals is Then in the case of the triviality of ultrametrics d (x, y) = 1, x = y, 0, x = y, and the independence of the functions P I (x, t), P R (x, t), P S (x, t) from variable x, taking into account S(t) = NP S (x, t), S(t) = NP S (x, t), R(t) = NP R (x, t), we arrive at a system of equationsṠ which for large N in approximation N − 1 N ≈ 1 coincides with the system (1) -(3). We will call the system of equations (7) -(9) equations of the basic ultrametric SIR model. Recall the definition of p-adic number. Let Q be a field of rational numbers and let p be a fixed prime number. Any rational number x = 0 is uniquely represented as where γ is an integer, and a, b are natural numbers that are not divisible by p and have no common multipliers. p-Adic norm |x| p of number x ∈ Q is defined by the equalities |x| p = p −γ , |0| p = 0. The field of p-adic numbers Q p is defined as a completion of the field of rational numbers Q by p-adic norm |x| p . The norm on Q p induces the metric d(x, y) = |x − y| p which is ultrametric, i.e. satisfies the strong triangle inequality (6). We will denote: B i (a) = {x ∈ Q p : |x − a| p ≤ p i } -a ball of radius p i centered at point a, On Q p there exists a unique (up to a factor) Haar measure d p x which is invariant with respect to translations d p (x + a) = d p x. We assume that d p x is a full measure; that is,Ẑ Under this hypothesis the measure d p x is unique. For more information about p-adic numbers, p-adic analysis and its applications, see [11, 12, 13] . For our purposes, we can assume that the number p is a natural number p = m > 2. In this case Q p is a ring of m-adic numbers Q m with the pseudonorm |x| m , which also induces on Q m the ultrametrics d(x, y) = |x − y| p [14] . So let p = m ≥ 2 be a natural number. Further, let each level i cluster contains exactly p level i−1 clusters, the number of levels is n, and the total number of individuals is N = p n . In this case, the set of individuals U can be parameterized by the factor set B n /Z p and we assume that U = B n /Z p . Alternatively, we can describe the set of individuals by B n ⊂ Q p , but assume that each individual is described by a ball of unit radius. Let D (λ) be an arbitrary non-negative non-decreasing function defined on R + and satisfy the condition D (0) = 0. Then function D |x − y| p is also an ultrametrics on B n and we can write the equations of the basic ultrametric SIR model (7) -(9) in the form: where Ω(|x| p p i ) = 1, |x| p p i ≤ 1, 0, |x| p p i > 1. As function D |x − y| p in equations (14) -(16), we choose To preserve the interpretation, we will assume that functions P S (x, t), Here W 0 is the class of functions that are constant on any ball of unit radius, L 1 (B r , d p x) is the class of functions that are integrable on B r and C 1 (R + ) is the class of functions that are differentiable with respect to t. As each individual is described by a ball of unit radius. In this case, the possible nonzero values of ultrametrics (17) are p iα , i = 1, 2, . . . n. This means that the boundaries of the average infection contact times t i of individuals belonging to different maximum subclusters of level i and i + 1 clusters are correlated as In the p-adic model under consideration, we will call value r the index of hierarchical isolation. We will assume that the coefficient of β depends on α. We fix this dependence by requiring that the average value of W of function W (x, y) (10), averaged over all pairs of individuals, coincides with β N , where β is infection rate of classis SIR model. In this case we have Imposing the requirement W = β N we get and β (0) = βp −n . Thus, the p-adic SIR model is characterized by the following parameters: p -the number of maximum subclusters for each cluster; n is the number of levels for hierarchical clustering; β -infection rate of classical SIR model; γ -removing is the ratio of the boundaries of the average times of infectious contact of individuals belonging to different maximum subclusters of clusters of neighboring levels. (17), it follows that functions P S (x, t), P I (x, t), P R (x, t) will be constant on subsets S i = x ∈ B n /Z p : |x| p = p i for i = 1, 2, . . . , n. We denote by e i (x) a characteristic function of a subset of S i . Then functions P S (x, t), P I (x, t), P R (x, t) can be decomposed by the basis of functions {e 0 (x) , e 1 (x) , . . . , e n (x)}: Substituting this expansion into equations (14) -(16) we get the following systeṁ The total number of susceptible, infected, and retired individuals is Below we investigate numerical solutions of equation (19) -(21) by Runge-Kutta-Fehlberg 45 method. Figure 2 shows dependence of the cumulative number of infected individuals C = I + R on time in the classical and p-adic SIR models. In doing so, we have choosen p = 5 (the number of individuals in the minimal cluster), n = 8 (the number of clusters), and α = 0.5. Value of infection rate for the classical SIR model is chosen as β = 0.2. We also choose the average time of the disease course is equal 10 days, which corresponds to value γ = 0.1. Accordingly, the reproduction number for the classical SIR model is R 0 = 3 (for comparison, the reproductive ratios for flu is R 0 = 1.3 ÷ 2.8, for COVID-19 R 0 = 2.2 ÷ 5.7 [15] ). For these parameters, we have r =2.24 and the ratio of the maximum and minimum average time of infection contact between individuals is Figure 4 corresponds to the case of a very weak spread of the epidemic in p-adic model.. Fig. 5 show dependence of the cumulative number of infected individuals s in p-adic SIR model for paramiters p = 5, n = 8, β = 0.3, γ = 0.1 and different α. As it can be seen from these dependences there must exist critical α = α 0 (p, n, β, γ) such that at α < α 0 the spread of the epidemic takes place, but at α > α 0 the epidemic does not spread. For values p = 5,n = 8, β = 0.3, γ = 0.1, we have the numerical value of α 0 ∼ 1. Unfortunately,we could not get an exact analytical expression for function α 0 (p, n, β, γ). In p-adic SIR model, hierarchical isolation index (18) can be considered as a control parameter that can be changed to control the spread of an epidemic. To illustrate, we will consider a situation where, as the epidemic grows, enforcement restrictions are introduced to redistribute the time of infectious contact between individuals in clusters of different levels, while maintaining the average contact time is constant. In reality, the introduction of such restrictions means that infectious contact between pairs of individuals with a small ultrametric distance must de increased, and infectious contact between pairs of individuals with a large ultrametric distance must be decreased. In the p-adic model, the introduction or strengthening of already introduced restrictions means an increase in the α parameter. In Figure 6 , we present dependence of the cumulative number of infected individuals in the case of controlling their hierarchical isolation, i.e., increasing the value of parameter α at time t 1 from value α 1 = 0.8 to value α 2 = 1.1 with other parameters equal to p = 5, n = 8, β = 0.2, γ = 0.1. We see that taking adequate enforcement restriction to increase the hierarchical isolation index from r 1 = 3, 62 to r 2 = 5, 87 reduces the cumulative number of infected by more than 2 times. In this paper, we have developed an ultrametric model of the epidemic spread of infection in the population, based on the classical SIR model. This model is also based on the concept of infectious contact between any two individuals in the population, which determines the potential for transmission of infection from one individual to another. The formalization of this concept leads to hierarchical clustering of the population and the introduction of ultrametrics distance on a set of individuals. The ultrametric distance between individuals reflects the measure of transmission of infection between individuals and is included in the ultrametric generalization equations of the classical SIR model. Changing the ultrametric structure of the proposed model can affect the spread of the epidemic in the population. Therefore, in contrast to the classic SIR model and its modifications, the proposed model can provide recommendations for avoiding a cumulative scenario of epidemic development based on managing the process of infectious interaction between clusters of individuals at various levels. This corresponds to a certain algorithm for isolating isolated strata of the population at the micro, meso, and macro levels. In conclusion, we note that the ultrametric model proposed in this paper is the basic model. It can be supplemented with various additional scenarios, such as the possibility of re-infection, the latent period, the time spent by individuals in the immune stage, the impact of vaccination, etc. We reserve the implementation and application of numerous extensions of the basic ultrametric model for future research. Infection Deseases of Humans: Dynamics and Control Mathematical Epidemiology of Infectious Diseases: Model Building, Analysis and Interpretation Modeling Infectious Diseases in Humans and Animals Mathematical models in population biology and epidemiology Statistical physics of vaccination Contribution to the Mathematical Theory of Epidemics How viruses spread among computers and people Epidemic outbreaks in complex heterogeneous networks Epidemic spreading on heterogeneous networks with identical infectivity SIR dynamics in random networks with heterogeneous connectivity On p-adic mathematical physics, p-Adic Numbers p-Adic analysis and mathematical physics Ultrametric calculus. An Introduction to p-adic Analysis Some aspects of the m-adic analysis and its applications to $m$-adic stochastic processes, p-Adic Numbers High Contagiousness and Rapid Spread of Severe Acute Respiratory Syndrome Coronavirus 2