key: cord-0922825-9kls0o3y authors: Merchuk, J. C.; Garcia-Camacho, F.; Lopez-Rosales, L. title: Infection Units: A novel approach to the modeling of COVID-19 spread date: 2021-05-04 journal: nan DOI: 10.1101/2021.05.01.21256433 sha: 4b035c5ba0c6100944f69351f55bba7f8bce9576 doc_id: 922825 cord_uid: 9kls0o3y A novel mechanistic model describing the rate of COVID-19 spread is presented, that differs conceptually from previously published deterministic models. One of its main characteristics is that the pool of infected people is not assumed to be homogeneously mixed, but rather as a passage into which individuals enter upon contagion, move within it in a plug-flow manner and leave at recovery, within a fixed time period. So, the present model differs conceptually in the way it describes the dynamics of infection. An infection unit is defined as the amount of COVID-19 virus that generates contagion, if it reaches a susceptible individual. This model separately considers various pools: symptomatic and asymptomatic infected patients; three different pools of recovered individuals; pools of assisted, hospitalized patients; the quarantined and, finally, those who died from COVID-19. The transmission of the disease from an infected person to others is described by an infection rate function, while an encounter frequency function modulates the frequency of effective encounters between the infected and the susceptible. The influence of the model parameters on the predicted results is presented. The effect of social restrictions and of quarantine policy on pandemic spread is shown. For model calibration, a set of experimental data is used. The model enables the calculation of the actual behaviour of the studied pools during pandemic spread. Keywords Symptomatic and asymptomatic COVID-19, Pandemic spread, Mechanistic models It seems that COVID-19 is one of the hardest health problems that humanity has had to deal with throughout its history, not so much because of the severity of the disease, nor its rate of spreading, but because of its global impact as the most rapidly widespread pandemic. This is the case due to the 21 st century combination of accessible, advanced transportation technology and the large volume of international travel for both business and pleasure-a blatant feature of modern consumer societies. Currently, almost all the countries in the world are engaged in trial-and-error processes, in which sanitary measures (including vaccination rate) are competing with economic activities of all kinds in battles between public health management and sustainable population maintenance (Ceylan, 2020) . There is need of a macro-model that describes, as closely as possible, the whole of this complex issue (Acemoglu et al., 2020; Alvarez et al., 2020) . Within such a macro-model, the modelling of the epidemiologic aspect and its particularities is of extreme relevance. The number of research papers published on COVID-19 is vast, and their scope is broad (Fraser et al., 2021) . Sometimes, the spread of a pandemic has tendencies that seem random. Therefore, statistical methods have been applied to predict the spread of such diseases, since they take multiple factors into account by means of time-series models, multivariate linear regressions, grey forecasting models, backpropagation neural networks, etc. However, the aforementioned statistical tools seem to be insufficient for analyzing pandemic randomness, and these models are difficult to generalize, as noted by Ceylan (2020) . He claims that the COVID-19 prevalence in several European countries may be described using variants of an autoregressive integrated moving average ("ARIMA") model, a time-series-type model. Wu et al. (Wu et al., 2020) simulated the expansion of COVID-9 across the most populated Chinese cities connected by airlines, using what they called 'a metapopulation model' with the SEIR variables (Susceptible, Exposed, Infectious, Recovered). The basic calculations applied Markov chain Monte Carlo methods. Models of this type are called 'agent-based' models because they focus on the movement of/and contact between individuals. They require laborious calculations that provide a geographical aspect to pandemic spread. Hunter et al. made a detailed comparison of equation-based models versus agent-based models (Hunter et al., 2018) . Recently Tsori and Granek (Tsori & Granek, 2020) , among others, commented that most of the deterministic models are of the SEIR-type; they also stressed the fact that a 'perfect mixing', that is 'total homogeneity' in each of the pools, is always assumed. They pursue mitigation of this limitation by formulating "a continuous spatial model based on nearest-neighbor infection kinetics," which leads to a reactiondiffusion-type description of pandemic spread. This enables the description of the spatial spread of a pandemic; their results show the two-dimensional spread across an actual geographical map. Nevertheless, their model maintains the same mathematical format as the SEIR-type models by describing the infected (I) pool as a mixed compartment, in the sense that an individual classified as 'infected' may leave this compartment independently of his/her 'external age', i.e., the time spent in it (Danckwerts, 1953) ; here, 'age' is defined, therefore, as the average time an infected individual spends in the I pool. (The significance of this point is explained below.) The Manenti et al. model describes the entire population of the world as a perfectly mixed batch reactor (Manenti et al., 2020) and reaches formulations that are equivalent to the classic SEIR or SIR models, as may be expected. Cao et al. proposed an improvement in the 6-compartment SEIR-type model that adds a pool of quarantined patients (Cao et al., 2020) . They used a time-series analysis exponential smoothing method and the ARIMAX model, often used in statistical modeling to analyze changes that occur over time. A considerable improvement in the SEIR-type models was done by Ivorra et al. (Ivorra et al., 2020) . They called it the θ-SEIHRD model, and it was based on their previously published Bi-CoDis model (Ivorra et al., 2015) . Here, they added seroprevalence as a measured . CC-BY-NC-ND 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. (which was not certified by peer review) The copyright holder for this preprint this version posted May 4, 2021. ; https://doi.org/10.1101 https://doi.org/10. /2021 variable, which is a very important addition to a system with a scant number of measured variables. In this work, we focus on the mechanisms of pandemic spread and present a deterministic model with a novel approach. Our model differs conceptually in the manner of description of the transmission dynamics of the disease, the key being the 'external age distribution' of the individuals exiting the I pool (Danckwerts, 1953) . Here, we try to focus on one basic concept implied in the SEIR formulation and propose an alternative. The SEIR-inspired models, in all their variants, collide with the basic observation that the duration of this illness is an almost constant number of days (Manenti et al., 2020) . The well-known SEIR models describe distinct pools of the susceptible (S), exposed (E), infectious (I), and recovered (R), and sometimes additional pools--all of them defined as completely mixed. In a completely mixed continuous system, the systemic response to a pulse disturbance will always have a bell shape (in the case of an ideal instantaneous pulse, to a descending exponential), as described in basic textbooks (Levenspiel, 1972) . This bell shape represents the 'age distribution' within the compartment; consequently, the 'age' of the individuals leaving the I pool will show a wide distribution. There would be individuals who stay in that pool a very short time, near zero, and others who stay in the pool a very long time, near infinite. This blatantly contradicts our knowledge about the behaviour of COVID-19 and other Coronaviruses that produce sicknesses with quite defined durations. Therefore, SEIR-type models fail to properly predict this viral infection mechanism, especially for shorter time periods. The I pool, as it will be defined here, has an inlet of individuals from the pool of the susceptible, S, and outlets to other pools, but has a completely different behaviour. In the terminology of process engineering, the exhibited behaviour is called 'a plug flow system', resembling a conveyor belt, transporting infected individuals. The main characteristic defining such a system is its population dynamics, i.e., the homogeneity of the 'age' of the individuals leaving the compartment. The 'age' of the elements leaving the system cluster around a certain mean 'residence time', tr, with relatively small variance. In practice, it is well known that a COVID-19-infected individual stays as such for a finite and quite defined period of time only, as estimated in the literature on the basis of experimental findings ( Bar-On et al., 2020) . Though there may be some individual variations in the case of COVID-19, this period of time seems to be consistently around 2 weeks (Liu et al., 2020) . The state of the patient changes throughout this period and may lead either to recovery or to a more severe state and then, either to a full recovery or to death. The actual events during a normal I period take place along one clear timeline in an orderly manner. This is a basic characteristic of the illness, and the SEIR models fail to describe it. Such models may fit the dynamics of infection in the case of a population pool over long periods of time but cannot describe the short-term dynamics. Here, we present an alternative approach that overcomes this weakness. A 'plug flow model' is a diametrically opposed alternative to the totally mixed compartments that characterize the SEIR-type models. In a plug flow system, all the elements that enter a compartment will leave it after residing in it a finite time, tr, which corresponds to the term 'serial interval', commonly used in epidemiology (Last, 2001) . In terms of our model, all the infected individuals will remain in this condition approximately tr days. In an ideal plug flow situation, any change in the input at t=0 will produce the same (or similar) signal at the outlet, at t=tr. In other words, the 'age' of each infected individual . CC-BY-NC-ND 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. The copyright holder for this preprint this version posted May 4, 2021. ; https://doi.org/10.1101/2021.05.01.21256433 doi: medRxiv preprint will increase steadily from 0 to tr, which represents the period during which he/she was a member of the I pool. In practice, there will obviously be a certain distribution around the mean value but, in this version of the model, we disregard it. For convenience in formulating this model, a dimensionless 'residence time ' (τ=t/tr) in the I pool has been defined, as follows. The value of τ varies between 0 (input) and 1 (output), running parallel to time t. It is assumed that all the infected individuals contained within the I pool at any time (t>tr) behave similarly after being infected. The period of ttr, within the Ind pool: Equation (5) shows that, for the calculation at a time t, we must refer back to past values of Ind(t) (before t), following Eq. (4). (The precise numerical calculation procedure is detailed later on in this paper.) Note that, for t