key: cord-0047813-d04881lq authors: Volov, V. T.; Zubarev, A. P. title: Toward Ultrametric Modeling of the Epidemic Spread date: 2020-07-13 journal: P-Adic Num Ultrametr Anal Appl DOI: 10.1134/s2070046620030061 sha: cbdb14873c6307ee99ea9b754069aa394eee5051 doc_id: 47813 cord_uid: d04881lq An ultrametric model of epidemic spread of infections based on the classical SIR model is proposed. Ultrametrics on a set of individuals is introduced based on their hierarchical clustering relative to the average time of infection contact. The general equations of the ultrametric SIR model are written down and their particular implementation using the [Formula: see text]-adic parametrization is presented. A numerical analysis of the [Formula: see text]-adic SIR model and a comparison of its behavior with the classical SIR model are performed. The concept of hierarchical isolation and the scenario of its management in order to reduce the level of epidemic spread is considered. Recently, there has been activity in the development of mathematical models of the spread of infectious diseases (see, for example, [1] [2] [3] [4] [5] ). This is primarily due to both the detection of old foci of existing viral infections, such as plague, anthrax, Ebola, and the emergence of the new infections caused by coronaviruses such as SARS-CoV (2002) , MERS-CoV (2015) , and SARS-CoV-2 (2019). Almost all mathematical models describing the development of epidemics are based on the model proposed in 1927 by Kermack and McKendrick in work [6] , which is now known as the classic SIR model. This model is based on 3-step development of the epidemic, in which healthy but susceptible individuals (S) as a result of infection go to the infected class (I) individuals who, in turn, move to the removed class (R), i.e. recover by acquiring immunity, or die. Note that the SIR model was constructed by analogy with the theory of homogeneous chemical reactions. This model has many generalizations that include additional intermediate stages of individuals known as SIRS, SEIR, SEIRS, and others. The essence of the simplest SIR model is as follows. Let S (t) is the number of susceptible, I (t) is the number of infected, R (t) is the number of removed individuals. The following simplifying assumptions are accepted: 1) the set of individuals is homogeneous; 2) the number of individuals under consideration (susceptible, infected and removed) is constant, i.e. S (t) + I (t) + R (t) = N ; 3) the probability of transmission per unit time from an infected individual to a susceptible individuals is constant; 3) the probability of reinfection is zero. Given these simplifying assumptions the SIR model equations are written asṠ Here β is infection rate (or the average number of contacts with susceptible individuals that leads to the new infected individuals per time unit per infective), and γ is removing rates (or the average rate of removal of infective per unit time per infective). The joint solution of equations (1.1) and (1.3) gives where S (0) ≡ S 0 and R 0 = β γ is reproductive ratio. It follows from (1.4) that only under condition R 0 > 1 and there is no solutions on this segment when R 0 < 1. For this reason, condition R 0 > 1 is interpreted in this model as an outbreak condition of epidemic. Today, many generalizations of the classic SIR model have been proposed, which use more complex scenarios for the development of the epidemic, taking into account the latency period, the finiteness of the time when individuals are in the stage of immunity, the impact of vaccination, etc. The classical SIR model, as well as most of its generalizations, are essentially homogeneous models, i.e. they assume that the intensity of infection does not depend on any relationship between susceptible and infected individuals. Nevertheless, even in such homogeneous models, adequate estimates of the scenario for the development of real epidemics are possible. Recently, a number of heterogeneous generalizations of the SIR models based on deterministic or random networks have also been proposed (see, for example, [7] [8] [9] [10] ), taking into account the heterogeneous nature of the populations in which the epidemic is spreading. As a rule, this heterogeneity is associated with the difference in the characteristics of individuals, or with the heterogeneity of population density. Meanwhile, the most important characteristic that plays a crucial role in the speed of spread of the epidemic is the distribution of the time duration of infectious contact between pairs of individuals. This value can be a characteristic of population heterogeneity, and it allows us to introduce a distance function on a set of individuals (metric) in terms of which it is possible to generalize the classical SIR model. In this paper, we propose an inhomogeneous generalization of the classical SIR model, which is based on the hierarchical clustering of the population according to the degree of potential infectious influence of individuals on each other. We describe the procedure for such hierarchical clustering of a set of individuals and show that such clustering involves the introduction of a distance function on the set of individuals, which is ultrametrics. This distance function is not directly related to the spatial distance between individuals, it is determined only by the potential for transmission of infection from one individual to another per unit time. Using the ultrametric distance on a set of individuals, we generalize the equations of the classical SIR model and arrive at a model that we call the basic ultrametric SIR model. As known, a convenient tool for parameterizing ultrametric spaces is the field of p-adic numbers. We consider the ultrametric set of individuals that can be mapped to the boundary of a finite hierarchical graph with a constant number of branches. Such a set can be parameterized by the set of a p-adic balls of unit radius contained in a p-adic ball of radius greater than 1. We call the corresponding ultrametric SIR model the p-adic SIR model. We present a numerical analysis of the p-adic SIR model and compare its behavior with the classical SIR model. We also introduce the concept of hierarchical isolation index and consider the simplest scenario for managing this parameter in order to reduce the spread of the epidemic. Let U = {x} be a set of N individuals. Infectious contact A of two individuals x and y we will call any continuous time interval A = (t 1 , t 2 ) which satisfies the condition: during interval A one of the individuals (for example, x) can be infected by another individual y, provided that individual y is infected throughout the entire interval A, and the probability of infection per unit time f (t) is non zero on interval on A almost everywhere. Note that the concept of an infectious contact is a purely model. In almost every specific case, the value of the infective contact is extremely difficult to calculate accurately, but it is always possible to approximate it. However, in any real assessment of the duration of an infectious contact, it must be understood that this concept does not necessarily mean physical contact between two individuals. The probability of infection of one individual by another individual may occur indirectly, i.e. through a physical contact with surrounding bodies (air, objects) that became sources of infection after a physical contact with an infected individual. However, an infectious contact does not imply the possibility of infection of a susceptible individual by an already infected individual through any other individuals. Intuitively speaking, the term "infectious contact" is a continuous period of time during which a particular individual has the potential for infection (no matter how small it is) at any time, and this possibility of infection is caused by a particular other individual. Let T 0 be the value of a certain time interval during which we observe the behavior of individuals. Then for individual x, there is a finite number of infectious contacts Since the behavior of individuals is random, T (x, y) is a random function which depends on the specific implementation of an ensemble of populations of individuals. Expected value τ (x, y) = E [T (x, y)] is the average time of an infectious contact between two individuals during interval T 0 . It is obvious that the function τ (x, y) is symmetric. Function τ (x, y) plays the main role in the hierarchical clustering of individuals, which we describe below. Consider an infinite decreasing sequence of numbers of real positive numbers as follows. We will require that for any i and for any individual x ∈ B can be combined into larger sets of individuals -the second level clusters, which we will denote by B In this case, we require that for any individual x ∈ B x and y belong to the same first level cluster and t 2 ≤ τ (x, y) < t 1 if x and y belong to different first level clusters. Similarly the clusters of the second level can be combined into third level clusters B 3 from the same cluster for which inequality τ (x, y) ≥ t 3 holds. The third level clusters can be combined into fourth level clusters B 3 , and so on. This procedure of hierarchical clustering of individuals must eventually be interrupted, because at some n-th step we will get a single cluster of n-th level B (1) n that will coincide with the set of individuals U . For any individual x ∈ B (i) So, the hierarchical procedure for clustering individuals described above allows us to introduce on set U the structure of hierarchically nested clusters B (j) i . By construction, any two clusters of arbitrary levels either do not intersect, or one is contained in the other. The resulting hierarchical structure of clusters at various levels can be described by a hierarchical graph. An example of such a graph is shown in Figure 1 . Hierarchical clustering of the set of individuals U allows us to introduce the ultrametrics d (x, y) on this set. We define function d (x, y) in the following way. If individuals x and y belong to the same level i cluster, but belong to different level i − 1 subclusters, then we put d (x, y) = T 0 t i . If x = y, we will put d (x, y) = 0. Proof. To prove that function d (x, y) is ultrametric, it is sufficient to prove that any "triangle" (x, y, z), where x ∈ U , y ∈ U , z ∈ U is isosceles, and the two largest "sides" are equal. Let d (x, y) = max {d (x, y) , d (y, z) , d (x, z)}. Then by definition d (x, y), individuals x and y belong to some level i cluster B (i) , but they belong to different subclusters B y) . Suppose that inequalities d (y, z) < d (x, y) and d (x, z) < d (x, y) hold. Then each of the pairs (y, z) and (x, z) belongs to some cluster of a level smaller than i. Let approximation allows us to simplify the model significantly, while preserving its main quality propertythe hierarchical nature of the development of the epidemic. Let P I (x, t), P R (x, t), P S (x, t) be the probability that at time t, individual x is correspondingly susceptible, infected, or removed. Then we can write the following ultrametric generalization of the In t) , we get the system of equationṡ Recall the definition of p-adic number. Let Q be a field of rational numbers and let p be a fixed prime number. Any rational number x = 0 is uniquely represented as where γ is an integer, and a, b are natural numbers that are not divisible by p and have no common multipliers. The p-adic norm |x| p of number x ∈ Q is defined by the equalities |x| p = p −γ , |0| p = 0. The field of p-adic numbers Q p is defined as a completion of the field of rational numbers Q by p-adic norm |x| p . The norm on Q p induces the metric d(x, y) = |x − y| p which is ultrametric, i.e. satisfies the strong triangle inequality (2.1). We will denote: On Q p there exists a unique (up to a factor) Haar measure d p x which is invariant with respect to translations d p (x + a) = d p x. We assume that d p x is a full measure; that is, Under this hypothesis the measure d p x is unique. For more information about p-adic numbers, the padic analysis and its applications, see [11] [12] [13] . For our purposes, we can assume that number p is a natural number p = m > 2. In this case Q p is a ring of m-adic numbers Q m with the pseudo-norm |x| m , which also induces on Q m the ultrametrics d(x, y) = |x − y| p [14] . So let p ≥ 2 be a natural number. Further, let each level i cluster contains exactly p level i − 1 clusters, the number of levels is n, and the total number of individuals is N = p n . In this case, the set of individuals U can be parameterized by the factor set B n /Z p and we assume that U = B n /Z p . Alternatively, we can describe the set of individuals by B n ⊂ Q p , but assume that each individual is described by a ball of unit radius. Let D (λ) be an arbitrary non-negative non-decreasing function defined on R + and satisfies the condition D (0) = 0. Then function D |x − y| p is also an ultrametrics on B n and we can write the equations of the basic ultrametric SIR model (3.1) -(3.3) in the form: where To preserve the interpretation, we will assume that functions P S (x, t), P I (x, t), P R (x, t) lie in the class Here W 0 is the class of functions that are constant on any ball of unit radius, L 1 (B r , d p x) is the class of functions that are integrable on B r and C 1 (R + ) is the class of functions that are differentiable with respect to t. As each individual is described by a ball of unit radius. In this case, the possible non-zero values of ultrametrics (4.6) are p iα , i = 1, 2, . . . n. This means that the ratio of boundaries of an average infection contact times of individuals belonging to different maximum subclusters of level i and i + 1 clusters is In the p-adic model under consideration, we will call value r the hierarchical isolation index. We will assume that coefficient β depends on α. We fix this dependence by requiring that the average value of W of function (3.4) over all pairs of individuals, coincides with β N , where β is an infection rate of classic SIR model. In this case we have Imposing the requirement W = β N we get and β (0) = βp −n . Thus, the p-adic SIR model is characterized by the following parameters: p is the number of maximum subclusters for each cluster; n is the number of levels for hierarchical clustering; β is an infection rate of classical SIR model; γ is an removing rate; α = log r log p , where r = t i t i+1 is the ratio of the boundaries of the average times of an infectious contact of individuals belonging to different maximum subclusters of clusters of neighboring levels. Let there be a single infected individual at the initial time t = 0. This means the following choice of initial conditions of the Cauchy problem for equations (4.3)-(4.5): From the structure of equations (4.3)-(4.5) and the type of function (4.6), it follows that functions P S (x, t), P I (x, t), P R (x, t) will be constant on subsets S i = x ∈ B n /Z p : |x| p = p i for i = p-ADIC NUMBERS, ULTRAMETRIC ANALYSIS AND APPLICATIONS Vol. 12 No. 3 2020 1, 2, . . . , n. We denote by e i (x) a characteristic function of a subset of S i . Then functions P S (x, t), P I (x, t), P R (x, t) can be decomposed by the basis of functions {e 0 (x) , e 1 (x) , . . . , e n (x)}: Substituting this expansion into equations (4.3)-(4.5) we get the following systeṁ The total number of susceptible, infected, and removed individuals is Figure 2 shows the dependence of the cumulative number of infected individuals C = I + R on time in the classical and p-adic SIR models. In doing so, we have chosen p = 5 (the number of individuals in the minimal cluster), n = 8 (the number of clusters), and α = 0.5. The value of an infection rate for the classical SIR model is chosen as β = 0.3. We also choose the average time of the disease course is equal 10 days, which corresponds to value γ = 0.1. Accordingly, the reproductive ratio for the classical SIR model is R 0 = 3 (for comparison, the reproductive ratio for flu is R 0 = 1.3 ÷ 2.8, for COVID-19 R 0 = 2.2 ÷ 5.7 [15] ). 5 shows dependence of the cumulative number of infected individuals C (t) in p-adic SIR model for parameters p = 5, n = 8, β = 0.3, γ = 0.1 and different α. As it can be seen from these dependencies there must exist critical α = α 0 (p, n, β, γ) such that at α < α 0 the spread of the epidemic takes place, but at α > α 0 the epidemic does not spread. For values p = 5, n = 8, β = 0.3, γ = 0.1, we have the numerical value of α 0 ∼ 1. Unfortunately, we could not get an exact analytical expression for function α 0 (p, n, β, γ). In the p-adic SIR model, the hierarchical isolation index (4.7) can be considered as a control parameter that can be changed to control the spread of an epidemic. To illustrate, we will consider a situation where, as the epidemic grows, enforcement restrictions are introduced to redistribute the time of an infectious contact between individuals in clusters of different levels, while maintaining the average contact time is constant. In reality, the introduction of such restrictions means that an infectious contact between pairs of individuals with a small ultrametric distance must be increased, and an infectious contact between pairs of individuals with a large ultrametric distance must be decreased. In the p-adic model, the introduction or strengthening of already introduced restrictions means an increase in the α parameter. In Figure 6 , we present dependence of the cumulative number of infected individuals in the case of controlling their hierarchical isolation, i.e., increasing the value of parameter α at time t 1 = 85 from value α 1 = 0.5 to value α 2 = 0.9 with other parameters equal to p = 5, n = 8, β = 0.3, γ = 0.1. We see that taking an adequate enforcement restriction to increase the hierarchical isolation index from r 1 = 2.24 to r 2 = 4.26 reduces the cumulative number of infected by more than 2 times. In this paper, we have developed an ultrametric model of the epidemic spread of infection in the population, based on the classical SIR model. This model is also based on the concept of an infectious contact between any two individuals in the population, which determines the potential for transmission of infection from one individual to another. The formalization of this concept leads to hierarchical clustering of the population and the introduction of the ultrametric distance on a set of individuals. The ultrametric distance between individuals reflects the measure of transmission of infection between individuals and is included in the ultrametric generalization equations of the classical SIR model. Changing the ultrametric structure of the proposed model can affect the spread of the epidemic in the population. Therefore, in contrast to the classic SIR model and its modifications, the proposed model can provide recommendations for avoiding a cumulative scenario of epidemic development based on managing the process of an infectious interaction between clusters of individuals at various levels. This corresponds to a certain algorithm for isolating separate social strata of the population at the micro, meso, and macro levels. In conclusion, we note that the ultrametric model proposed in this paper is the basic model. It can be supplemented with various additional scenarios, such as the possibility of re-infection, the latent period, the time spent by individuals in the immune stage, the impact of vaccination, etc. We reserve the implementation and application of numerous extensions of the basic ultrametric model for future research. The study was supported in part by the Ministry of Education and Science of Russia by State assignment to educational and research institutions under project FSSS-2020-0014. Infection Deseases of Humans: Dynamics and Control Mathematical Epidemiology of Infectious Diseases: Model Building, Analysis and Interpretation Modeling Infectious Diseases in Humans and Animals Statistical physics of vaccination Contribution to the mathematical theory of epidemics How viruses spread among computers and people Epidemic outbreaks in complex heterogeneous networks Epidemic spreading on heterogeneous networks with identical infectivity SIR dynamics in random networks with heterogeneous connectivity On p-adic mathematical physics Adic Analysis and Mathematical Physics Ultrametric Calculus. An Introduction to p-Adic Analysis Some aspects of the m-adic analysis and its applications to $m$-adic stochastic processes High contagiousness and rapid spread of severe acute respiratory syndrome coronavirus 2