key: cord-0844461-mx2tzqig authors: Gomez, Jonatan; Prieto, Jeisson; Leon, Elizabeth; Rodríguez, Arles title: INFEKTA—An agent-based model for transmission of infectious diseases: The COVID-19 case in Bogotá, Colombia date: 2021-02-19 journal: PLoS One DOI: 10.1371/journal.pone.0245787 sha: 2a13baeb6f8b7dc75970ced9266704fea905f4c8 doc_id: 844461 cord_uid: mx2tzqig The transmission dynamics of the coronavirus—COVID-19—have challenged humankind at almost every level. Currently, research groups around the globe are trying to figure out such transmission dynamics under special conditions such as separation policies enforced by governments. Mathematical and computational models, like the compartmental model or the agent-based model, are being used for this purpose. This paper proposes an agent-based model, called INFEKTA, for simulating the transmission of infectious diseases, not only the COVID-19, under social distancing policies. INFEKTA combines the transmission dynamic of a specific disease, (according to parameters found in the literature) with demographic information (population density, age, and genre of individuals) of geopolitical regions of the real town or city under study. Agents (virtual persons) can move, according to its mobility routines and the enforced social distancing policy, on a complex network of accessible places defined over an Euclidean space representing the town or city. The transmission dynamics of the COVID-19 under different social distancing policies in Bogotá city, the capital of Colombia, is simulated using INFEKTA with one million virtual persons. A sensitivity analysis of the impact of social distancing policies indicates that it is possible to establish a ‘medium’ (i.e., close 40% of the places) social distancing policy to achieve a significant reduction in the disease transmission. Infectious diseases have a substantial impact on public health, health care, macroeconomics, and society. The availability of options to control and prevent the emergence, expansion, or resurgence of pathogens warrants continuous evaluation using different methods. Mathematical models allow characterizing both the behavior and the emergent properties of biological systems, such as the transmission of infectious disease. [1] [2] [3] . Many biological systems have been modeled in terms of complex systems since their collective behavior cannot be simply inferred from the understanding of their components [4, 5] . Computer-based algorithms are used to model properties and dynamic interactions between agents (e.g. persons, cells) or groups of agents within, and across levels of influence in complex systems [6, 7] . In general, agent-based modeling (ABM) can be used for testing theories about underlying interaction mechanics among the system's components and their resulting dynamics. It can be done by relaxing assumptions and/or altering the interaction mechanisms at the individual agent level. ABMs can increase our understanding of the mechanisms of complex dynamic systems, and the results of the simulations may be used for estimating future scenarios [8] . In the past, ABMs have been employed to address various infectious diseases such as, a bioterrorist introduction of smallpox [9] , control of tuberculosis [10] , implementation of distancing measures and antiviral prophylaxis to control H5N1 influenza A (bird flu) [11] , design of vaccination strategies for influenza [12] , devise evacuation strategies in the event of airborne contamination [13] , and curtail transmission of measles through contact tracing and quarantine [14] . In our literature review, some other novel works that include heterogeneous agents and social distancing were proposed to model COVID-19 [15, 16] . The proposed approach, called INFEKTA (Esperanto word for infectious), mainly differs from existing works in that it aims to generate individuals and a complex network of places based on the population density of a determined city including individual interaction in public transportation means. INFEKTA models the disease transition at the person level and takes into consideration individual infection disease incubation periods and evolution, medical preconditions, age, daily routines (movements from house to destination places and back, including transportation medium if required), and enforced of Non-Pharmaceutical Interventions such as social distancing policies may flatten the curve. The complex system model approach considers a system as a large number of entities (equally complex systems that have autonomous strategies and behaviors) that interact with each other in local and non-trivial ways [17] [18] [19] . This approach provides a conceptual structure (a multilevel complex network [20] ) that allows characterizing the interrelation and interaction between elements of a system and between the system and its environment [21] . In this way, a system is composed of sub-systems of second order, which in turn may be composed of subsystems of the third-order [22] . Transmission dynamics of infectious diseases are not traditionally modeled at the individual level, but at the population-level with a compartmental model. However, some recent research use agent-based modeling for doing that [23] . A compartmental model tracks changes in compartments without specifying which individuals are involved [24] and typically reflects health states relevant for transmission (e.g., susceptible, exposed, infectious, and recovered). Basically, these kinds of models represent epidemics of communicable diseases using a population-based, non-spatial approach. The conceptual framework for this approach is rooted in the general population model which divides a population into different population compartments [25] . Compartmentalization typically reflects health states relevant for transmission (e.g., susceptible, exposed, infectious, and recovered, in short SEIR), though more partitioning is possible according to age and/or other relevant host characteristics. Heterogeneous and temporal behavior is modeled through the incorporation of relevant time-dependent social mixing, community structures, and seasonality, relevant for infectious disease dynamics [26, 27] . Process dynamics are captured in transition rates, representing the rate by which an average individual transitions between compartments. Agent-based models (ABMs) are a type of computer simulation for the creation, disappearance, and movement of a finite collection of interacting individuals or agents with unique attributes regarding spatial location, physiological traits, and/or social behavior [23, 28, 29] . ABMs work bottom-up, with population-level behavior emerging from the interactions between autonomous individuals and their environment [23, 30] . They allow the history of every individual to be tracked and network structures to be explicitly represented. In general, ABMs allow: i) To introduce local interaction rules at the individual level, which closely coincide with physical and social interaction rules; (ii) To include behaviors that may be randomized at the observational level, but can be deterministic from a mathematical point of view; (iii) To incorporate a modular structure and to add information through new types of individuals or by modifying current rules; and (iv) To observe systems dynamic that could not be inferred from the examination of the rules of particular individuals [8] . When ABM is used for epidemic modeling, infectious disease transmission dynamics is expected to emerge from the interaction between local interactions between the individuals. Each individual is modeled as an agent with an internal "SEIR" state that represents its infectious disease state (severity and time in it) at any instant of time. Individuals interact between them, i.e., can infect or get infected, when they move at some instant to the same place, their SEIR states, and the infectious disease transmission rates, and how close they are (if they are in crowded places). Notice, the concept of crowded places is natural in INFEKTA and emerges from the agents' interactions (eg. if more individuals move using the same transportation routes) and individual characteristics (e.g. children go to schools.). Transmission rates are usually approximated from the rates obtained by a compartmental model but are used at the individual level, i.e., when individuals interact with each other. Our agent-based model of infectious disease propagation, called INFEKTA, consists of fivelayer components: The virtual space (for a city or town being studied) is an Euclidean complex network [20] : Nodes are places (located in some position of the 2D Euclidean space) where individuals can be at some simulation time and edges are routes (straight lines) connecting two neighbor places. • Place (Node)-A place may be of three kinds: home (where individuals live), public transportation station (PTS), and interest place (IP) i.e., school, workplace, market, and transportation terminal. IPs and PTSs are defined in terms of capacity (maximum number of individuals that can be at some simulation step time). IPs and PTSs may be restricted, during some period, to some or all individuals. Place restriction is established according to the social distancing rule that is enforced during such a period. • Neighbor (Edge)-A PTS is a neighbor to another according to the public transportation system of the city or town being studied. Homes and IPs are considered neighbors to its closest PTS in the 2D Euclidean space. No home is neighbor to any other home neither an IP is neighbor of any other IP. Finally, a home and an IP are considered neighbors if they are neighbors of the same PTS. Each individual has a Home and an IP. The closest distances are computed between each Home and IP and between these places with the closest PTS using their longitude and latitude. If the distance between a Home and IP is shorter than the distance to a PTS this Home will be connected directly to an IP instead of their closest PTS. Otherwise, the closest PTS is connected to each Home and IP respectively. Detailed information can be found in Fig 2 of Section Virtual Space Setup. Virtual time is defined in INFEKTA at two resolution levels: days for modeling the transmission dynamics of the infectious disease, and hours for modeling the moving and interaction of individuals. Therefore, if an individual gets infected more than once during the same day, INFEKTA considers all of them as a single infection event. Any individual movement is carried on the same one hour, it was started, regardless of the traveled Euclidean distance neither the length of the path (number of edges in the complex network). A virtual individual in INFEKTA is defined in terms of his/her demographic, mobility, and infectious disease state information. • • Mobility-The ability of an individual to move through space (we use the graph defining the space for determining the route as proposed in [31, 32] ). Each individual has a mobility plan for every day, plan that is carried on according to the enforced social distancing policy and her/his infectious disease state. The mobility plan is modeled in INFEKTA as a collection of simple movement plans to have i) Policy: social distancing policy required for carrying on the mobility plan; ii) Type: may be mandatory, i.e., must go to the defined interest place) or optional, i.e., any place according to individual's preferences; iii) Day: day of the week the plan is carried on, maybe every, week, weekend, Monday, . . ., Sunday; iv) Going Hour time an individual moves from Home to an IP; v) Duration in hours for coming back to home, and vi) Place: if plan type is mandatory, it is a specific place, otherwise it is an IP selected by the individual according to his/her IP preferences. As can be noticed, we just adapt the terminology from the compartmental models in epidemiology-namely, from the SEIR (Susceptible-Exposed-Infectious-Recovered) model. In INFEKTA, the infectious state of the SEIR model is divided into asymptomatic-infected, seriously-infected, and critically-infected in order to capture how age, gender, IP preferences, medical preconditions (co-morbidity), and social distancing policies can impact the evolution of the infectious disease in an individual. INFEKTA introduces both the M state since some individuals are naturally immune to or can become immune to (after recovering) to certain infectious diseases and the Dead (D) state to distinguish between recovered and dead individuals. Since rates are defined at the individual level, these rates can be defined by taking into consideration, for example, rates at the population level (obtained from a compartmental model), age, gender, and co-morbidity presented in the individual. Remember that those rates are not defined at some time scale (as in compartmental models) but define the rule determining changes in health states of individuals being close enough for interacting at the infectious disease transmission level or after some period of time being in some state. In this way, an individual can change with probability α from S state to state E if close enough to an asymptomatic-Infected (I A ) individual and will change from state I A to state I S if has been at state I A with probability θ if has been on state I A a period of time T I A . • α: is the transmission rate and incorporates the encounter rate between susceptible and infectious individuals together with the probability of transmission. • β: is the rate at which individuals move from the exposed (E) to the Asymptomatic-Infected state (I S ). It's complement (1 − β) is the rate of individuals with symptomatic cases. • γ: is the rate at which individuals move from the exposed (I A ) to the Seriously-Infected state (I S ). • θ: is the rate at which individuals move from the Seriously-Infected (I S ) to the Critically-Infected state (I C ). • ϕ: is the death rate. • ω: is the immune rate that incorporates the probability of becoming immune. • T E : Time an individual will be at the Exposed (E) state before changing to the Asymptomatic-Infected (I A ) or Seriously-Infected (I S ) states. • T I A : Time an individual will be at the Asymptomatic-Infected (I A ) state before changing to the Seriously-Infected (I C ) or Recovered (R) states. • T I S : Time an individual will be at the Seriously-Infected state (I S ) before changing to the Critically-Infected (I C ) or Recovered (R) states. • T I C : Time an individual will be at the Critically-Infected (I C ) before changing to the Dead (D) or Recovered (R) states. • T R : Time an individual will be at the Recovered state (R) before changing to the Immune (M) or Susceptible (S) states. INFEKTA can consider that two individuals were close enough for interacting at the transmission of the infectious disease if they were at the same place (home, interest place, or public transportation station) at the same time. In order to simplify this checking process, it is possible to consider that an individual just visited its home, final interest place, and both the initial and final PTSs when using the public transportation system. The social distancing policy is described in INFEKTA as a finite sequence of rules, each rule having i) Start Time: an initial day for applying the social distancing policy rule; ii) End Time: final day for ending the social distancing policy rule; iii) Level: indicates the kind of restriction applied to the mobility of persons and accesses to places, and iv) Enforce: defines the specific mobility and access restrictions of the social distancing policy. INFEKTA is used for modeling the Transmission dynamic of the COVID-19 in Bogotá city, the largest and crowded city in Colombia. Bogotá is the capital city of Colombia, its urban perimeter population is 7.412.566, is composed by 112 Zonal Planning Units (UPZ, for its acronym in spanish). Each UPZ belongs to one of the 19 urban districts in Bogotá. Also, Bogotá massive public transportation system is called Transmilenio (TM). TM is a bus-based system, which has 143 stations and moves near to 2.500.000 citizens every day. Data used in this research is third party data (public available data) and can be obtained from the referenced public sources [33, 34] . We did not have any special access or privileges to such data. Geographical information of Bogotá is used as the Euclidean space where the moving and interaction complex network is defined. Each one of the TM stations is located and added to the complex network according to the real TM system [33] . Also, the airport and the regional bus terminal are located and connected to the nearest TM station. Demographic information from 112 UPZ is used for generating in the Euclidean space interest places (Workplaces (W), markets (M), and schools (S)), homes (H), and people(P). Places are generated, in each one of the districts, following a 2D multivariate normal distribution N * (μ, S) (μ is the geographic center of the UPZ and S is the co-variance matrix defined by the points determining the perimeter of the district). The number of places in each UPZ is generated based on the population density of each UPZ according to the data available in 2017 [34] . Table 1 shows the amount of data generated for each type of place and for people, also the number of TM stations (Bus), and terminal transportation that we use in the simulation, and Table 2 shows detailed information of the number of interest places generated by UPZ. [35] ; also, the figure shows the associated complex network of connected places (nodes are places and edges are routed between places), the graph was drawn with Gephi [36] . An heterogeneous (varying gender, age, district and home) group of almost one million of individuals (998213) is generated using a stratified sampling based on the demographic information of the city for each district according to the projections to 2030 [34] . An Table 3 shows the total demographic information of virtual people. Also, a sequence of activities was assigned randomly to each individual to define a diary routine. This was done according to the person's age and the hour of the day. For example, some agents Adolescence go to school, and some agents Adult go to work. Time to start routine -going from Home to IP and return-is randomly selected in the interval from 4h and 7h returning between the 17h and 20h. Some agents may move using the PTI system and some others while going directly to its destination place. The route an individual takes is defined according to the complex network. Fig 3 shows three examples of different routines (paths over the graph) for the individuals. The explicit impact level of medical preconditions on the state of the COVID-19 dynamic is not included in this preliminary modeling. We wrapped them in the transition rates and allow modelers to change and play with different rates. Therefore, we set the initial values of these rates as shown in Table 4 . The level attribute of the social distancing rule for the COVID-19 in the virtual Bogotá city is defined as follows: • None: No restrictions to the mobility neither to access to places. • Medium: Many places and few stations are restricted (depending on the type, capacity, etc). Some type of individuals is restricted to stay at home (except those with the required mobility level). i.e., close 40% of the places. • Extreme: Few places are accessible to persons while few stations are restricted. Almost every individual is restricted to stay at home (except those with the required mobility level). i.e., close 80% of the places. The methodology (Data preprocessing; places, population, and routes assignation; network creation) is available in a Github repository, see S1 File. We run a total of 20 experiments and the results (COVID-19 dynamics, sensitive analysis, and social distance policies) show below are the mean of those experiments. Fig 4 shows the state of the COVID-19 dynamics (propagation of the virus) after 100 days when at the beginning of the simulation, just fifty (50) individuals are considered at state Asymptomatic-Infected (I A ) . Fig 4 (left) shows the full dynamics over time. Notice that, using the parameters reported in the literature, the dynamics of the COVID-19 pandemic shows one wave. As expected, the number of asymptomatic, seriously, and critically infected cases grows exponentially when a social distancing policy is not enforced . Fig 4 (right) shows the COVID-19 dynamics at specific ticks (i.e., after 34 days-February 03 and after 67 days-March 07). For each tick, a map of Bogotá with the dynamics is presented. The map shows the percentage of individuals at each state per UPZ. After 34 days, the majority of individuals have been Exposed, a bunch of individuals are in the infected states (asymptomatic, seriously, and critically) and few of them become recovered; After 67 days, all individuals have been exposed to the virus and stay in the recovered state. Clearly, the transmission dynamics of the infectious disease (in this case COVID-19) emerge from the local interaction of the individuals. We analyze the sensitivity to the infection rate (α) parameter to check the robustness of the model. Sensitivity analysis is shown in Fig 5. Notice that by increasing or decreasing the infection disease rate (Fig 5 (left) ), the peak of the transmission dynamics is reached sooner or later on time. When low infection disease rates, the number of cases is also low, reducing the impact on the economy. On the other hand, for high infection disease rates (Fig 5 (right) ) the peak is reached in an early stage, and around half of the population is on one of the infected states (Asymptomatic, Seriously, Critically). Also, we try different scenarios where each one of the social distancing policies is enforced just after 15 simulation days, see Fig 6. As can be noticed, it's evident how social distancing rules help to mitigate the exponential growth on the transmission disease dynamics (COVID- 19) , reducing the number of infectious cases (Asymptomatic, Seriously, and Critically). Interestingly, when the extreme social distancing rule (access to approximately 80% of interest places is restricted) the transmission disease dynamic displays a big second wave with more cases than the first wave. Although our intention was not to predict geographic spread for the city, we observed similarities between the total Seriously-Infected cases in INFEKTA (we assume that individuals in Seriously-Infected state are cases tested in Bogotá) and the current concentration of COVID-19 cases confirmed in Bogotá [44] , see Fig 7. The results show how UPZ with more cases found with INFEKTA matches with geographic areas with more COVID-19 cases. Then, how population distribution is generated from real density data of Bogotá, INFEKTA can be useful to explore policies by Zonal Planning Units (UPZ) or territorial divisions of a selected place providing to recommend actions for before, during, and after pandemic i.e., in planning and coordination efforts through leadership and coordination across sectors. Modeling the Transmission dynamic of an infectious disease such as the COVID-19 is not an easy task due to its highly complex nature. When using an agent-based model, several different characteristics can be modeled, for example, the demographic information of the population being studied, the set of places and the mobility of agents in the city or town under consideration, social distancing rules that may be enforced, and the special characteristics of the infectious disease being modeled. INFEKTA is an agent-based model that allows researchers to combine and study all of those characteristics. Our preliminary results modeling the transmission dynamics of the coronavirus COVID-19 in Bogotá city, the largest and crowded city in Colombia, indicate that INFEKTA may be a valuable asset for researchers and public health decision-makers for exploring future scenarios when applying different social distancing policy rules and controlling the expansion of an infectious disease. Although we are doing a rough and no so real approximation of the transmission dynamics of the COVID19, we are able to obtain similar behaviors, in our preliminary experiments, to those reported for the COVID19 in the real world. Therefore, INFEKTA may be able to provide more accurate results if its parameters are set to real ones: disease transmission rates, virus, incubation periods, comorbidity, houses, interest places, routines, population size (close to nine millions of virtual individuals for the Bogotá city). Despite the usefulness of the INFEKTA, there are some limitations. Since there are fewer people in the model (respect to real people in Bogotá), Transmilenio stations would be less crowded than expected and the model could underestimate the transmission of the disease. Further, the model does not cover individual walking from homes to PTS because contact points correspond to a complex network presented in Fig 2. Finally, the routines that each individual has in the simulation are equal day-to-day (i.e., Monday to Sunday with the same routine), this could influence the stochasticity nature of the model. Our future work will concentrate on studying the transmission of COVID-19 in Bogotá city by considering different scenarios of social distancing rules and by using more realistic information about: i) Relation between personal information and propagation rates of the COVID-19, ii) Places and routes, iii) Population size, and iv) Age and Medical preconditions. Seven challengesfor model-driven data collection in experimental and observational studies Artificial Life andTherapeutic Vaccines Against Cancers that Originate in Viruses Agent-based models in translationalsystems biology Encyclopedia of Complexity and Systems Science Complex Systems Theory andEvolution Causal thinking and complex system approachesin epidemiology. International journal of epidemiology What is a complex system Artificial Life Models in Software Containing bioterrorist smallpox Determinants of cluster distribution in the molecular epidemiology oftuberculosis. Proceedings of the National Academy of Sciences Strategies for containing an emerging influenza pandemic in Southeast Asia Protectinghealth care workers: a pandemic simulation based on Allegheny County.Influenza and other respiratory viruses Combining computational fluiddynamics and agent-based modeling: A new approach to evacuation planning Theeffect of contact investigations and public health interventions in the control andprevention of measles transmission: a simulation study Impact of nonpharmaceutical interventions (NPIs) to reduceCOVID19 mortality and healthcare demand Theeffect of travel restrictions on the spread of the 2019 novel coronavirus(COVID-19) outbreak Systems science methods in public health: dynamics,networks, and agents. Annual review of public health Topological Characterization of ComplexSystems: Using Persistent Entropy Introduction to the modeling and analysis of complex systems Random Graphs and Complex Networks Vol. I. vol. I.Cambridge Series in Statistical and Probabilistic Mathematics An introduction to multi-agent systems Hierarchical structure of biologicalsystems Lessons from a decade ofindividual-based models for infectious disease transmission: a systematic review Mathematical modelsin the evaluation of health programmes. The Lancet Containing papersof a mathematical and physical character Socialcontacts and mixing patterns relevant to the spread of infectious diseases Dynamics and control of diseases in networks withcommunity structure Modelling the epidemiology of infectious diseases for decisionanalysis. Pharmacoeconomics Modelling thetransmission of healthcare associated infections: a systematic review. BMCinfectious diseases Artificial intelligence a modern approach Towards Failure-Resistant MobileDistributed Systems Inspired by Swarm Intelligence and Trophallaxis Lecture Notes in Computer Science (including subseries Visor de Proyecciones de Población SDP Població n UPZ Bogot á An Open Source Software forExploring and Manipulating Networks An agent-based epidemic model REINA for COVID-19 to identify destructivepolicies. medRxiv Covid-19: four fifths of cases are asymptomatic, China figures indicate Estimates of the severity of COVID-19 disease Estimating the number of infections and the impact of non-pharmaceuticalinterventions on COVID-19 in 11 European countries Modellingtransmission and control of the COVID-19 pandemic in Australia Clinical characteristics anddurations of hospitalized patients with COVID-19 in Beijing: a retrospectivecohort study Baseline characteristics and outcomes of 1591 patients infected with SARS-CoV-2admitted to ICUs of the Lombardy Region Casos confirmados Bogotá D.C; 2020 Author Contributions