key: cord-0971141-ix41128f authors: Srinivasa Rao, Arni S.R.; Krantz, Steven G. title: Mathematical Analysis and Topology of SARS-CoV-2, Bonding with Cells and Unbonding date: 2021-05-09 journal: bioRxiv DOI: 10.1101/2021.05.07.443177 sha: 7b33d4477c46fad6767cae5ceac3e616eb7a5e37 doc_id: 971141 cord_uid: ix41128f We consider the structure of the novel coronavirus (SARS-Cov-2) in terms of the number of spikes that are critical in bonding with the cells in the host. Bonding formation is considered for selection criteria with and without any treatments. Functional mappings from the discrete space of spikes and cells and their analysis are performed. We found that careful mathematical constructions help in understanding the treatment impacts, and the role of vaccines within a host. Smale’s famous 2-D horseshoe examples inspired us to create 3-D visualizations and understand the topological diffusion of spikes from one human organ to another organ. The pharma industry will benefit from such an analysis for designing efficient treatment and vaccine strategies. The structure of the virus and spikes of the novel coronavirus (SARS-CoV-2 or COVID-19) that caused the suffering during 2020-2021 is understood in this article through topological constructions. We showed how such careful visualizations help to understand the virus-cell bonding through the distribution of spikes of the SARS-CoV-2. In general, the number of spikes and distribution of the spikes across various virus particles is found to be key in the spread of SARS-CoV-2 [1, 2] , bonding of the spikes [3, 4, 5] , and in understanding the entry of the virus into key organs like the lungs [7, 8, 9, 10] . We found that such a detailed mathematical analysis will eventually assist in the careful design of vaccines and medicines. Several studies analyze the situations of inactivity of the SARS-CoV-2 and the role played by the spikes [11, 12, 13, 14, 15, 16] . Many experimental results on the spikes and their activation, bonding, and inactivation assisted in vaccine development [17, 18, 19] . The pharmaceutical and the vaccine industry would benefit from such detailed visualizations of internal structures and bonding, rate of unbonding, and the role of interventions [20, 21, 22, 23] . In spite of experimental success in identifying a set of vaccine candidates for SARS-CoV-2 and the activity of the spikes, there exist several uncertainties in measuring successful vaccine impact. Theoretically, if a spike is completely bonded by an infected cell and this bonding is executed perfectly then that should lead to a new virus. At the same time, preventing a successful bonding and breaking of the spike would leave the virus incapable of spreading. Experiments leading to the identification of spike structures and their activities need to be more accurate and our present theoretical analysis promises to be useful for assisting in the experiments. Pharmaceutical and vaccination industries need to conduct highly accurate laboratory experiments. These experiments would need to carefully understand the role played by the spikes in SARS-CoV-2. Vaccines are designed to destroy the bonding capacity of these spikes or even destroy the spikes. Mathematical mapping, identification, and analysis of the spikes responsible for virus production within a host that are analyzed in this article are highly insightful for such experiments. Our article will assist in improved design of vaccine experiments and better treatment designs that can take care of all the spikes at the time of entry into a host. Topological analysis is a rich tool and proper usage of it can help in avoiding uncertainties. In this article, we have considered the following four assumptions in our topological constructions of the SARS-CoV-2: (i) Not all virus particles in the host are participating in infecting cells; (ii) Not all the spikes in a single virus may be bonded with cells; (iii) Each spike within a virus will bond with one and only cell; (iv) An empty spike (uninfected spike) of a given virus particle can bond with another cell. This work provides original applications of topology which is one of the powerful tools of mathematical analysis [24, 25, 26, 27] . We cite here general references for the basic ideas of point-set topology. But our discrete constructions in this article are not explained in those sources. In the next section, we have described the basic topological space that we define using the number of spikes per virus particle within a host. We have provided novel usage of the mathematical analysis principles and topological constructions. A fraction of the spikes within a host in the space is allowed to get bonding with the uninfected cells. The entire structure of the space is mapped so that we can better understand the role of bonded and unbonded spikes within a host. Section 3 studies the role of treatment and vaccines in prohibiting the bonding and eliminating the infected host. Overall structure and topology of the virus and bonding/unbonding by cells within the host are described through Figure 2 .2. Let c j i (t 0 ) be the i th novel coronavirus (ç) particle within a host at time t 0 that has j number of spikes. We choose i = 1, 2, ..., n and j = 1, 2, ..., j i . Each of the spikes within the host is uniquely identified by this structure because {c 1 1 (t 0 ), c 2 1 (t 0 ), ..., c j 1 1 (t 0 )} is the distinct set of spikes of the first virus particle and so on. In general, } is the distinct set of spikes of the i th -virus particle for i = 1, 2, ..., n.. Such a construction also allows us to write the expression: The quantity Σ n i=1 Σ j i j=1 c j i (t 0 ) in (2.1) represents the total spikes in the host which are ready to bond with cells within the host. The spikes in the expression (2.1) are not yet bonded. Let S(t 0 ) be the collection of all the spikes which were not yet bonded at time t 0 . Then Let p i be the fraction of the spikes in the i th -virus particle which are bonded with uninfected cells at time t 1 . The quantity p i = 1 indicates that the i th virus particle is fully bonded with uninfected cells, and each of the j i spikes is occupied in bonding. The quantity p i < 1 indicates that some of the spikes out of j i spikes in the virus particle are unoccupied (or empty). We write where ϕ(t 1 ) is the total number of spikes in S(t 1 ) which are bonded with uninfected cells at time t 1 . The cardinality of the set S(t 1 ) is When p i (t 1 ) = 1 for all i, then ϕ(t 1 ) = |S(t 1 )| and when p i (t 1 ) < 1 for at least one i, then ϕ(t 1 ) < |S(t 1 )| . Suppose that f 1 : S(t 0 ) → S(t 1 ), where S(t 1 ) consists of the set of all spikes both bonded and unbonded. The number of spikes that were bonded during [t 0 , t 1 ] is p i (t 1 )j i and j i − p i (t 1 )j i is the number of spikes at t 1 which are not bonded with uninfected cells for i = 1, 2, ..., n. We assume an occupied spike with an uninfected cell will not be available for further bonding. So the bonded spikes at t 1 , i.e., p i (t 1 )j i , have completed their virus bonding capacity by time t 1 , and the remaining spikes available at time t 1 are j i − p i (t 1 )j i . These unbonded spikes will be available for bonding during (t 1 , t 2 ]. Let d k i be the k th bonded spike at time t 1 out of j i spikes at time t 0 for k = 1, 2, ..., p i (t 1 )j i such that The number of occupied spikes among all the virus particles during [t 0 , t 1 ] which will not be available for further bonding are If p i (t 1 ) = 1 for any i at time t 1 then that virus particle has completed the bonding role in the system. It is assumed that p i (t 1 )j i number of spikes for each i (when p i (t 1 ) < 1) would generate p i (t 1 )j i number of new virus particles available for bonding during [t 0 , t 1 ], else (when p i (t 1 ) = 1) it would generate j i number of new virus particles during the same period. See Figure 2 .1. Hence the newer virus particles produced during [t 0 , t 1 ] are n i=1 p i (t 1 )j i , so the available virus particles for bonding at time t 1 will be Not all of the new viruses at time t 1 in (2.4) may be available for bonding during (t 1 , t 2 ] if one or more of the virus particles (out of n) might have all its spikes bonded at time t 1 . Suppose that p i (t 1 )j i = j i for i = 1, 2, ..., m (m < n) and p i * (t 1 )j i * < j i * for i * = 1, 2, ..., n − m, such that Based on (2.6), the number of virus particles available at time t 1 after removing completely bonded virus particles during [t 0 , t 1 ] (i.e., those virus particles for which all of its spikes were bonded with uninfected cells, and adding new virus particles created by the n−m virus particles with available spikes for bonding during [t 0 , t 1 ]) are The total number of bonded spikes due to n−m i * =1 j i * and m i =1 j i are and, from (2.3), The total bonded during [t 0 , t 2 ] is responsible for giving birth to new viruses as described previously. The total number of remaining spikes, those |S(t 1 )| which are available at time t 1 , are the sum of (i) the number of spikes unbonded during [t 0 , t 1 ], and (ii) the number of spikes that are created due to the birth of new virus particles. The listing of the set S(t 1 ) of spikes helps in constructing the function f 2 : S(t 1 ) → S(t 2 ). Here S(t 2 ) is the set of spikes created by S(t 1 ) during (t 1 , t 2 ]. Let us list the elements of the set S(t 1 ) below: The list of unbounded spikes J 1 during [t 0 , t 1 ] is obtained as remaining spikes from the first term of (2.9) as The list of spikes A 1 available at S(t 1 ) because of the first term of the R.H.S. of (2.10) is where the a j i in (2.12) represent the j th spike of the i th virus resulting from n−m i * =1 a i * in (2.10). That is, as per the set in (2.12), there are a 1 number of spikes for the first virus, a 2 numbr of spikes for the second virus, and so on a n−m spikes for the (n − m) th virus. The list of spikes B 1 available at S(t 1 ) due to the resultant of the second term of the R.H.S. of (2.10) is where the b j i in (2.13) represent the j th spike of i th virus resulting out of (2.14) The collections S(t 0 ) and S(t 1 ) constructed above can be treated as two spaces and S(t 1 ) in (2.14) is now seen as a disconnected space. Proof. We have f 1 : S(t 0 ) → S(t 1 ), where S(t 0 ) and S(t 1 ) are the sets of all the distinct spikes at time t 0 and time t 1 . When bonding occurs during [t 0 , t 1 ], the set of all spikes at time t 1 will be S(t 1 ) as seen in (2.14) . This implies that Because of the inequality (2.15), f 1 cannot be 1-1. When the bonding does not occur then |S(t 0 )| = |S(t 1 )| and the elements of S(t 0 ) and S(t 1 ) are not different. Since we can consider c j i and the elements of s(t 0 ) are distinct, we treat here S(t 0 ) as a discrete topological space with |S(t 1 )| = n i=1 j i elements in the space S(t 0 ). Let S X (t 0 ) and S Y (t 1 ) be two subsets of S(t 0 ) such that S X (t 0 ) represents bonded spikes and S Y (t 0 ) represents unbonded spikes during [t 0 , t 1 ]. Then, by the construction of S(t 0 ), the two subsets S X (t 0 ) and S Y (t 0 ) form two disjoint subspaces of S(t 0 ). The space S(t 1 ) as well is a discrete topological space and three subsets of it J 1 , A 1 and B 1 , form three disjoint topological discrete subspaces of S(t 1 ). We define here the topological diffusion D s (t 0 ) of the space created due to newer spikes during [t 0 , t 1 ], as . Proof. We have The set D s (t 0 ) indicates the newer elements created in the combined space (S(t 0 ) ∪ S(t 1 )). The collection of elements of S(t 0 ) and S(t 1 ) in (2.17) are further expressed using (2.14) as follows: Every singleton set within S(t 0 ) is an open subset. That means that each spike in S(t 0 ) is considered as a singleton set and S X (t 0 ) and S Y (t 0 ) form two open subsets of S(t 0 ). In fact, according to discrete topology S X (t 0 ) and S Y (t 0 ) can also be treated as closed subsets (so the space is disconnected). The transformations of the space S(t 0 ) during [t 0 , t 1 ] would lead to newer spaces due to bonding (also argued as in the proof of Lemma 1). Such a creation of new topological spaces and their cardinality can be influenced with a treatment intervention at some time t for t ∈ [t 0 , t 1 ]. Treatment works in reducing the value of p i or the death rates of the virus particles c j i or both. We assume a treatment to kill the virus population (viral load within a host) would increase the mortality rate of the virus population and reduce the bonding of the uninfected cell population with SARS-CoV-2. At time t 0 the viral load would be lower and treatment during [t 0 , t 1 ] would have a higher impact on reducing the viral load than if the treatment was introduced during [t 1 , t 2 ]. We assume a longer time to introduce a treatment after time t 0 according to the longer time the virus population is restricting the virus growth. Since |S(t 0 )| < |S(t 1 )| in the absence of treatment, S(t 0 ) = S(t 1 ) can be achieved (Corollary 2) when treatments are introduced during [t 1 , t 2 ]. We assume the host will be dominated by the virus when virus growth is not controlled and the virus will be eliminated either naturally or due to treatment impact. Let p i be defined as earlier and let q i (t 1 ) be the fraction of the spikes in the i th virus which are bonded during [t 0 , t 1 ] and treatment was introduced at some time s for s ∈ (t 0 , t 1 ]. Here, 0 ≤ q i (t 1 ) < p i (t 1 ). The quantity q i (s) = 0 means there are no bonded spikes at s. The quantity q i (s) would never reach p i (t 1 ). Since the treatment would also increase the mortality rate of the c j i population, we assume that q i (t 1 ) = 0 if i th virus dies at s for s ∈ (t 0 , t 1 ] or q i (t 1 ) = 0 if no bonding with the spikes of the i th virus occurs. When q i (t 1 ) = 0, then all the spikes of the i th virus at t 0 will be available for bonding during (t 0 , t 1 ]. Proof. Given f n : S(t n−1 ) → S(t n ) for all n. When the host is vaccinated prior to t 0 the system will prohibit the spikes to get bonded with cells. Unbounded spikes and virus particles dying over time lead to the decreasing sequence Similar to the argument of the proof of the Theorem 6, we have Given Theorem 7, this sequence of inequalities will emerge: Here κ m i for m = 1, 2, ... is the remaining number of spikes unbounded during [t i−1 , t i ] for i = 1, 2, ..., n. We can create the elements similar to (2.11) to (2.13) for the periods {[t 0 , t 1 ], (t 1 , t 2 ], ..., }. Let J τ , A τ , B τ be the sets defined on the intervals {[t 0 , t 1 ], (t 1 , t 2 ], ..., } similar to J 1 ,A 1 ,B 1 which were defined from (2.11) to (2.13) for the emenets defined on the interval [t 0 , t 1 ]. The topological diffusion created until the treatment initiated at t m is Hence the diffusion of the elements created will start declining with the initiation of the treatment. The smaller the value of t m , the lesser the quantity m−1 τ =1 A τ ∪ B τ . Theorem 9. Topological structures of the virus populations and spikes will be different under vaccination and treatment of hosts even though |S(t n )| − |S(t n+1 )| → 0 as n → ∞. Proof. Suppose the treatment is initiated within a host at t m for m ≥ 1. The topological structure of spikes would first have the increasing property in (3.1), and then will start decreasing as in (3.2) . This leads to |S(t n )| − |S(t n+1 )| → 0 as n → ∞. Under a vaccinated host, as soon as the SARS-CoV-2 virus enters at t 0 , the topological structure of the spike population spread within the host obeys (3.3) , and that leads to |S(t n )| − |S(t n+1 )| → 0 as n → ∞. Hence, two topological structure described above will be different although the limiting number of spikes diminishes. 3.1. Horseshoe mapping. Inspired by Stephen Smale's original famous horseshoe example [28, 30, 31] , we have visualized a discretized version of the same idea with plastic beads in a container. Let us consider a hollow cube and fill it with plastic beads. Suppose all the beads in this cube are transferred into a horseshoe-shaped pipe. See Figure 3 .1. Note that the original horseshoe mapping is continuous and is a diffeomorphism between a square and horseshoe-shaped space. Let us imagine the size of the S(t 1 ) SARS-CoV-2 spikes are located in the throat area of a human host. Suppose during the interval (t 1 , t 2 ] these spikes are spread into the lung area. Assume that the treatment to control the virus is initiated at t 2 such that the throat area spikes are eliminated during (t 2 , t 3 ] and the number of spikes at t 3 , is the set S(t 3 ). This leads to Transformation of the number of spikes of S(t 1 ) in the throat area into number of spikes of lungs area is demonstrated in Figure 3 .2. Suppose the size of the spikes at t 1 are located in the throat area of a host. We have f 1 : S(t 0 ) → S(t 1 ). Under the no treatment assumption during (t 1 , t 2 ], we have f 2 : The topological diffusion in (3.5) gives us, Suppose the treatment for SARS-CoV-2 is introduced at t 2 such that is attained. There are now three possibilities that will arise due to (3.6): Above possibility (ii) we associate with that of Smale's horseshoe type of example and is also demonstrated in Figure 3 .2. During (t 2 , t 3 ] the number of spikes killed in the throat due to the treatment initiated at t 2 and due to the creation of the topological diffusion A 2 ∪ B 2 reaches the size |S(t 1 )| at t 3 . Then this kind of discrete topological transformation of the number of spikes at t 1 into the number of spikes at t 3 in a different location of a host is topologically visualized as an horseshoe type of example. Of course, we are aware in a true sense Smale's horseshoe is a diffeomorphism between two open spaces (a square and a a horseshoe). The current analysis of transformations of the number of spikes located at t 1 in the throat area and the number of spikes at t 3 in the lungs within a host handles the points (elements) of the space discretely. The horseshoe example transforms the points of an open square into an equivalent area horseshoe using continuous mapping. The implications of the horseshoe are plenty-for example, the squeezing and stretching of a square to a horseshoe-shaped space in the creation of hyperbolic dynamics and chaos. In our analogy, the spikes in the throat within a human host do not get transferred to the lungs because virologically spikes do not travel within the host but they grow over time under a no-treatment scenario. Only after treatment is initiated are the spikes in the throat killed and an equivalent number of newer spikes born to remain active for some time in the lungs. We imagine this phenomenon as described through Figure 3 .2 as the transformation of spikes of the throat to that of the lungs. The geometry of the horseshoe is especially meaningful for us because spikes from the throat area due to the initial infected virus population are all located in one place. Then, due to the spread of the virus over the intervals {[t 0 , t 1 ], (t 1 , t 2 ], ..., }, they diffuse into different organs which have geometrically a different shape than the throat. With the example of beads (Figure 3 .1), and assuming no scope for adding a new bead in the cube, the corresponding pipe would take the 2D horseshoe to a 3D similar-shaped pipe through discrete topology. Our spikes analogy is that the horseshoe example was built on 2D and diffusion of spikes within the human organs is imagined in 3D. Our study provides the most accurate mathematical structuring of the space of the SARS-CoV-2 virus and its spikes within the host. See the expression S(t 0 ) = j i j=1 c j i (t 0 ) for 1 ≤ i ≤ n, 1 ≤ j ≤ j i in section 2 and the clustering of spikes into bonded and unbonded spikes. The advantage of such a structure is to provide a detailed scope for mapping a spike that is available for bonding with an uninfected cell. Not a single unbonded spike will be left out in this process. The procedure also helps in tracking a bonded virus in such a way that the birth of newer virus particles through bonded spikes is monitored. See the expression S(t 1 ) = J 1 ∪ A 1 ∪ B 1 , where J 1 , A 1 , and B 1 form three disjoint topological discrete subspaces mapped out from S(t 0 ). The set J 1 emerges out of unbonded spikes in a previous time point and A 1 ∪ B 1 is the collection of spikes generated due to bonding spikes with cells at a previous time point. Our clear-cut visualization of the theoretical constructions helps in understanding the structure of the spike-cells within the space. Laboratory experiments on the virus particles and bonding are usually done on a group of viruses. Our procedure provides deeper insight for better design in conducting experiments on isolated individual viruses. Such a method will help in aggregating the virus population along with their number of spikes and measuring bonded and unbonded spikes for each virus particle. Lemma 1 provides the growth of spikes and their mapping of initial spikes that can create newer spaces within a time interval. One of the central features of our construction is the development of a new measure that we call "topological diffusion." In general topology, no such measure exists. Using this measure, one can study the growth of spikes over time. The topological diffusion introduced in this article not only identifies the new spikes that emerge but how many of those were due to virus particles that had partial bonding of the spikes. These novel ideas make our work more practically implementable in pharmaceutical and vaccine industrial experiments. We have theoretically established this value within a small interval A 1 ∪ B 1 and also over a large interval. The descriptions of A 1 and B 1 are recorded in previous paragraphs and also in section 2. Figure 2. 3 provides an example of measuring the topological diffusion. Theorem 6 provides the impact of a treatment in eliminating virus particles and Theorem 7 provides the impact of the vaccine on eliminating viruses after entry into a host. We also generalize our results over multiple time intervals and the timing of initiation of therapy. Topological diffusion constructed in the article is also associated with Stephen Smale's famous horseshoe type of example. The original example by Smale was constructed as a diffeomorphism of two open spaces, namely, a square and corresponding sized area of a horseshoe. However, the current article considered a transformation of a discrete collection of spikes in one organ of a human host into the equivalent number of spikes in a different organ. Moreover, the demonstration we provided was between two 3-D shaped organs within a human host. Such visualization of the horseshoe example is new in the literature. The study presented in this paper is original and incisive. It uses powerful mathematical techniques-most notably ideas from topology-to analyze the bonding of corona virus cells. Our emphasis on discrete topology is somewhat novel. As a result we obtain insights that will be useful in the production of new and more effective vaccines. We believe that the use of mathematical analysis in a medical context is a new and effective technique for epidemiology that will become recognized and solidly established in future work. How SARS-CoV-2 first adapted in humans SARS-CoV-2 cell entry depends on ACE2 and TMPRSS2 and is blocked by a clinically proven protease inhibitor Structural impact on SARS-CoV-2 spike protein by D614G substitution The SARS-CoV-2 spike protein: balancing stability and infectivity Structural variations in human ACE2 may influence its binding with SARS-CoV-2 spike protein Complete Mapping of Mutations to the SARS-CoV-2 Spike Receptor-Binding Domain that Escape Antibody Recognition Structural analysis of full-length SARS-CoV-2 spike protein from an advanced vaccine candidate Structure, function, and antigenicity of the SARS-CoV-2 spike glycoprotein Distinct conformational states of SARS-CoV-2 spike protein Histopathologic Changes and SARS-CoV-2 Immunostaining in the Lung of a Patient With COVID-19 The Architecture of Inactivated SARS-CoV-2 with Postfusion Spikes Revealed by Cryo-EM and Cryo-ET SARS-CoV-2 spike protein binds to bacterial lipopolysaccharide and boosts proinflammatory activity SARS-CoV-2 spike protein promotes IL-6 trans-signaling by activation of angiotensin II receptor signaling in epithelial cells Characterization of the SARS-CoV-2 S Protein: Biophysical, Biochemical, Structural, and Antigenic Analysis Human SARS CoV-2 spike protein mutations Computationally validated SARS-CoV-2 CTL and HTL Multi-Patch vaccines, designed by reverse epitomics approach, show potential to cover large ethnically distributed human population worldwide Native-like SARS-CoV-2 Spike Glycoprotein Expressed by ChAdOx1 nCoV-19/AZD1222 Vaccine Persistence and evolution of SARS-CoV-2 in an immunocompromised host SARS-CoV-2 vaccines in development Immunoinformatics characterization of SARS-CoV-2 spike glycoprotein for prioritization of epitope based multivalent peptide vaccine A candidate multi-epitope vaccine against SARS-CoV-2 Pharmacologic treatments for coronavirus disease 2019 (COVID-19): a review COVID-19, an emerging coronavirus infection: advances and prospects in designing and developing vaccines, immunotherapeutics, and therapeutics /e) Essentials of topology with applications Introduction to topology Geometric aspects of general topology Differentiable dynamical systems Finding a horseshoe on the beaches of Rio What is a Horseshoe? What is