key: cord-1033185-babzwcvc authors: Goh, Gerard Kian-Meng; Dunker, A. Keith; Foster, James A.; Uversky, Vladimir N. title: Shell Disorder Analysis Suggests That Pangolins Offered a Window for a Silent Spread of an Attenuated SARS-CoV-2 Precursor among Humans date: 2020-08-13 journal: J Proteome Res DOI: 10.1021/acs.jproteome.0c00460 sha: d88dbdf3671c1740fe263d30dbbb6d3ef8de7e6d doc_id: 1033185 cord_uid: babzwcvc [Image: see text] A model to predict the relative levels of respiratory and fecal–oral transmission potentials of coronaviruses (CoVs) by measuring the percentage of protein intrinsic disorder (PID) of the M (Membrane) and N (Nucleoprotein) proteins in their outer and inner shells, respectively, was built before the MERS-CoV outbreak. With M(PID) = 8.6% and N(PID) = 50.2%, the 2003 SARS-CoV falls into group B, which consists of CoVs with intermediate levels of both fecal–oral and respiratory transmission potentials. Further validation of the model came with MERS-CoV (M(PID) = 9%, N(PID) = 44%) and SARS-CoV-2 (M(PID) = 5.5%, N(PID) = 48%) falling into the groups C and B, respectively. Group C contains CoVs with higher fecal–oral but lower respiratory transmission potentials. Unlike SARS-CoV, SARS-CoV-2 with M(PID) = 5.5% has one of the hardest outer shells among CoVs. Because the hard shell is able to resist the antimicrobial enzymes in body fluids, the infected person is able to shed large quantities of viral particles via saliva and mucus, which could account for the higher contagiousness of SARS-COV-2. Further searches have found that high rigidity of the outer shell is characteristic for the CoVs of burrowing animals, such as rabbits (M(PID) = 5.6%) and pangolins (M(PID) = 5–6%), which are in contact with the buried feces. A closer inspection of pangolin-CoVs from 2017 to 2019 reveals that pangolins provided a unique window of opportunity for the entry of an attenuated SARS-CoV-2 precursor into the human population in 2017 or earlier, with the subsequent slow and silent spread as a mild cold that followed by its mutations into the current more virulent form. Evidence of this lies in both the genetic proximity of the pangolin-CoVs to SARS-CoV-2 (∼90%) and differences in N disorder. A 2017 pangolin-CoV strain shows evidence of higher levels of attenuation and higher fecal–oral transmission associated with lower human infectivity via having lower N(PID) (44.8%). Our shell disorder model predicts this to be a SARS-CoV-2 vaccine strain, as lower inner shell disorder is associated with the lesser virulence in a variety of viruses. In December 2019, physicians in Wuhan, China, began to notice a large number of patients with a severe pneumonia-like illness that did not respond to antibiotics. A novel coronavirus that is closely related to the 2003 severe acute respiratory syndrome coronavirus (SARS-CoV) was quickly identified to be responsible for the illness. The virus and its disease were named SARS-CoV-2 and coronavirus disease 2019 (COVID- 19) , respectively. 1 Just as viruses closely related to SARS-CoV can be found among animals such as horseshoe bats and civet cats, 2,3 close relatives of SARS-CoV-2 were found in bats (RATG13) and pangolins. 4,5,7,10−12 In line with the debates on the actual identity of the animal intermediary of SARS-CoV-2, 8, 9, 11, 12 not only are we presenting here more evidence of the greater likelihood that pangolins served as an intermediary host, but we are also able to detect the existence of attenuated strains of CoV closely related to SARS-CoV-2. In 2011−2012, before the outbreak of the Middle Eastern respiratory syndrome (MERS)-CoV, a model that studied the protein intrinsic disorder of the CoV shells was built. 3, 13 The model measured the level of intrinsic disorder in proteins comprising the outer and inner shells of CoVs, M (membrane protein found in the outer shell) and N (nucleoprotein constituting the inner shell. 13 Upon doing so, the CoVs easily clustered into three groups based mainly on the N PID (percentage of intrinsic disorder (PID) in N protein). The model predicted that SARS-CoV (M PID = 8.6%; N PID = 50.2%) would belong to Group B and have intermediate levels of respiratory and fecal−oral transmission potentials, whereas other CoVs, such as porcine transmissible epidemic gastroenteritis virus (TGEV: M PID = 14%; N PID = 43%), were expected to be in Group C, which includes CoVs with lower respiratory but higher fecal−oral transmission potentials. 3, 13 Then the MERS-CoV came in 2012−2013, 14 which presented a great opportunity to test the validity of the disorder-based viral transmission model. The model placed MERS-CoV to the group C, 15 and, indeed, the MERS-CoV reservoir was later found to be among farm animals including camels, which are highly associated with fecal−oral transmission. 14, 16 Furthermore, MERS-CoV was found to be not easily transmissible among humans. 14, 15 Later, another opportunity to test the model came, and this time it came in the form of the COVID-19 outbreak. Models suggested that the SARS-CoV-2 has to be in group B; i.e., a group with intermediate respiratory transmission, given its N PID of 48%. 17−19 Furthermore, something else strange and puzzling was seen in this virus: i.e., it has one of the hardest outer shells in the sizable sample of a wide variety of CoVs that we had analyzed. 18 This could account for the high levels of the SARS-CoV-2 contagiousness, as a hard outer shell is likely to make the virus more resistant to the antimicrobial enzymes found in body fluids, such as saliva and mucus. 20, 21 This should be manifested by higher viral load in the saliva and mucus. In agreement with these expectations, viral loads in the mucus and saliva for SAR-CoV-2 have been observed in clinical studies to be much higher than those of SAR-CoV. 22 Contagiousness, Viral Load, and Virulence: An Enigma An interesting "competing" finding is the fact that SARS-CoV-2 Spike protein (S) binds to the host angiotensin converting enzyme-2 (ACE2) receptor more tightly than the SARS-CoV S protein by astonishing 20−30 times. 10, 23 While both "competing" findings are likely to be true, it is difficult to link the efficient binding of S to ACE2 with the high level of viral shedding as has been observed without necessitating a much higher viral load in the lungs. Why is then SARS-CoV-2 (case-fatality rate (CFR): 2−6%) not more virulent than SARS-CoV (CFR: 9− 10%) 1,3,6,19 if we assume the former's much higher viral load in the lungs, given that these two CoV are genetically close and therefore are likely to produce similar proteins. A more plausible answer would be that there could be a discrepancy between viral loads in the lungs and body fluids, as suggested by the shell disorder model. Nevertheless, the fact that SARS-CoV-2 S binds 20−30 times stronger to ACE2 than the SARS-CoV S protein 10, 23 is an important piece of the puzzle, that we will be addressing in this paper, since it is saying that SARS-CoV-2 is highly adapted to humans. Furthermore, a recent study has shown that part of the sequence pertaining to S may have come from an unrelated human enzyme. 24 If so, how did it evolve with humans? How and when did the virus enter humans? Thus far, these important questions have remained unresolved. We shall see that the data arising from the shell disorder model that combines empirical proteomic analysis with artificial intelligence (AI) have detected a unique window of opportunity, in which an attenuated precursor of SARS-CoV-2 could have entered the human population years ago and, thus, initiated a slow and silent spread before mutating to become the virulent and more contagious form as currently seen. An important concept that will be used constantly throughout this paper is protein intrinsic disorder. Protein intrinsic disorder refers to the lack of structures in parts or a whole functional protein. 25 Disorder plays roles in the molecular recognition and protein−protein/DNA/RNA/polysaccharide binding. 26, 27 There are alternative names, such as natively/naturally unfolded/unstructured/flexible proteins. 26 Multiple disorder predictors have been developed, and one of the earliest such predictors is PONDR VLXT (www.pondr. com). 28−30 This is a neural network (AI: artificial intelligence) that has been trained on the sequences of known ordered and disordered proteins. Because PONDR VLXT is highly sensitive to the local sequence peculiarities and protein−protein/DNA/ RNA interactions, 31,32 it has been highly successfully used to analyze viral proteins, especially shell proteins of a large variety of viruses, including human immunodeficiency virus (HIV), herpes simplex virus (HSV), hepatitis C virus (HCV), Nipah virus (NiV), Ebola virus (EBOV), 1918 HIN1 influenza A virus, CoVs, dengue virus (DENV), and flaviviruses, e.g., yellow fever (YFV), Zika (ZIKV). 33−42 Upon the reading of a protein sequence, PONDR VLXT will provide intrinsic disorder predisposition scores between 0 and 1 for each amino acid residue. Residues of 0.5 or above are those predicted to be disordered. 28−30 An important ratio that will be repeatedly used in this study is PID (percentage of intrinsic disorder), which is defined as the number of residues predicted to be disordered divided by the total number of residues in the protein and multiplied by 100%. The sequences were downloaded from UniProt (https://www. uniProt.org) or GenBank-NCBI (https://www.ncbi.nlm.nih. gov/protein). The sequences were used as inputs to the PONDR VLXT server (https://www.pondr.com), and both the results and sequences were downloaded into a mySQL server using a program written in JAVA. 33 The aforementioned categorization of CoVs based on disorder status of their outer and inner shells and their transmission modes is summarized in Table 1 . While the categorization is done mainly based on N PID , statistical analysis picked up a small increase in correlation between PID and levels of respiratory transmission potential when M PID (r 2 = 0.80) was added as an independent variable in addition to the already used N PID (r 2 = 0.77). 18 This basically means that the statistics is able to detect a Lower levels of respiratory transmission higher levels of fecal−oral transmission UniProt (U): https://www.uniProt.org); GenBank-NCBI (G): (https://www.ncbi.nlm.nih.gov/protein). b More details on the bat samples can be found in Table 2 . Three out of four bat-CoVs are in group B. Note: Large standard deviation can be seen for N PID as denoted by "±". c MHV sits at the borderline and is placed in group C for convenience. d More details on the pangolin samples can be found in Table 2 . Three out of four bat-CoVs are in group C. Standard deviation is denoted by "±". Journal of Proteome Research pubs.acs.org/jpr Article small contribution of M PID to the determination of the categorization of CoV as seen in Table 1 . As reiterated, SARS-CoV-2 and SARS-CoV are in the group B, which includes CoVs that have both intermediate levels of fecal−oral and respiratory transmission potentials. However, SARS-CoV-2 was also observed to have an exceptionally hard outer shell. In a previous publication, the only other CoV detected to have a harder shell than SARS-CoV-2 was HCoV-HKU1. Since then, a search has been made to uncover other CoVs with similarly low M PID values. One of these uncovered CoVs is rabbit-CoV (HKU14, see Tables 1−2) , which has M PID and N PID of 5.4% and 52.2%, respectively. The other CoVs that were uncovered to be closely related to SARS-CoV-2 are bat-RATG13 and Pangolin-CoVs. With the exception of SARS-CoV and MERS-CoV that were added for comparative purposes, Figure 1 highlights the CoVs that have the lowest M PID values in our new CoV sample. The next CoVs that have higher M PID values compared to SARS-CoV-2 are canine (Resp)-CoV (M PID = 7%) and bovine-CoV (M PID = 7.8%). An interesting note is that while pangolins and bats are generalized as to be in group C and B respectively, further details of their virus categorization can be found in Table 2 , which tells us that 3 of the 4 pangolin CoVs and only 1 of 4 bat CoV samples fall into group C, with the rest being placed to the group B. We are beginning to see an emerging pattern of PIDs that is related to the evolutionary pressures faced by the various CoVs arising A mystery immediately arises: How did SARS-CoV-2 acquire its hard outer shell? What is the evolutionary significance of a hard outer shell? An inspection of Tables 1−2 and Figure 1 suggests that the clue lies in rabbits, pangolins, and, perhaps, bats, since they have M PID values of 5.7%, 4.5−6.3%, and 4.1% (bat-RATG13 only), respectively. A more careful study, however, tells us that bat-CoVs have a wide range of M PID values (4.1− 17%), whereas all pangolin-CoVs in this sample have outer shells as hard as SARS-CoV-2. Our data are therefore telling us that the hard outer shell is likely the result of evolutionary pressures found in pangolin-CoV and rabbit-CoV but not necessarily in bat-CoV. The question is then: Why specifically rabbits and pangolins? A hint to the answer can be found in the behaviors of these animals. Pangolins are ant-eaters that dig into the ground for ant and termite meals. 44 Both rabbits and pangolins dig burrows in the ground to build their nests. As a result, they are likely to come into contact with feces that have been buried in the ground. Figure 1A ,B. Furthermore, a glance at the CoVs in group C makes us realize that pangolin-CoVs are different from CoVs of farm animals, such as the wellstudied porcine TGEV, which typically moves very rapidly among pigs via fecal−oral routes. 3, 13, 45 Because TGEV does not have to remain in the environment for a long time, it has a soft outer shell (M PID = 14%), but it clearly has greater fecal−oral potentials (N PID = 43%). In contrast, both pangolin-CoVs and rabbit-CoV are likely to remain in the soil with feces for a long time, which necessitates a harder outer shell (pangolin M PID = 4.5−6.3%; rabbit M PID = 5.7%). The hard outer shells for rabbit and pangolin CoVs are not coincidental, as they are both burrowing animals exposed to the environments, in which the virus can remain active and buried along with the feces for a long time. There are, however, obvious differences between the behaviors of pangolins and rabbits despite the fact that both live in burrows, which could increase the chances of contact with feces that had been buried in the ground. Pangolins eat ants and termites by the use of their sticky tongues, which could accidentally touch fecal matter in the ground. Rabbits, in contrast, eat leaves that are found above the ground. This is likely the reason that the model is detecting greater fecal− respiratory transmission for rabbits and greater fecal−oral transmission potential for pangolins, as seen by their N PID values of 44−48% and 52.2%, respectively ( Figure 1B) . Table 2 . Journal of Proteome Research pubs.acs.org/jpr Article evolutionary pressures. The mentioned case of pangolins and rabbits is an excellent example of this. We can see in Figure 2 that rabbits-CoV and pangolin-CoVs are genetically different but yet have similar M PID because of evolutionary pressures. Another interesting example is given by bats. Table 2 shows that bat-CoVs are very genetically diverse, especially when one compares their sequence similarities to SARS-CoV-2, with the corresponding sequence similarities of pangolin-CoVs. Only RATG13, which is closely related to SARS-CoV-2, has a very hard outer shell (M PID = 4.5%) as we can see in our bat samples (additional note: bat-RATG13 has 96% genomic similarity to SARS-CoV-2). 11, 12 Both bat M sequence similarities and M PID values (M PID = 4.5−17%) show a wide range of variability, whereas the levels of intrinsic disorder in N protein tell us a different story. Bat-CoVs are genetically diverse, as seen in Table 2 (compare N and M sequence similarities) and Figure 2B , but the N PID values are normally in the range of 46−49%. This characteristic is likely the result of the evolutionary pressure arising from the minimal levels of respiratory transmission potential necessary for the optimal infectivity among bats regardless of the CoV type. With a better understanding of differences in the evolutionary pressure arising from our knowledge of M PID and N PID values of the various CoVs, possible pathways of SARS-CoV and SARS-CoV-2 can be envisaged as seen in Figure 3 . While the phylogenetic tree suggests pathways, the M and N disorder levels provide the approximate timeline necessary for the mutations to take place based on the differences in N PID or M PID values as indicators of the evolutionary pressure. Figure 3 lists out some of the possible paths that SARS-CoV, SARS-CoV-2, and their precursors could have taken before reaching humans. We are able to observe that genetically dissimilar CoVs could have similar N PID or M PID because they evolved under comparative conditions, as we have seen in the case of M PID values of pangolin-CoVs and rabbit-CoV. A previous study has dismissed the idea that the latest pangolin strain of CoV is a direct intermediary of the current SARS-CoV-2 on the basis of a genome-wide phylogenetic study. 8 We believe that that study was incomplete for two reasons. First, only one pangolin-CoV strain was used in the analysis. Second, tools that could take into account the peculiarities of evolutionary pressure, such as the shell disorder model, were not incorporated into the study. Tables 1−2 and Figure 1 show that the various pangolin-CoVs reveal signs of fecal−oral transmission traits despite having one strain (2019) that shows close similarities to SARS-CoV-2. With their M PID values remaining low, most pangolin-CoV strains tend to have low N PID values as well, which are a hallmark of the greater fecal−oral transmission potentials. This is consistent with the mentioned behaviors of pangolins, which dig the ground for ants or termites and use their sticky tongues to trap them as food. 44 It is not hard to see that fecal materials can easily enter their meals. For this reason, the two scenarios related to pangolins as SARS-CoV-2 intermediaries are put forth with the evolutionary pressures in mind. Figure 3 illustrates path B as the most direct route, where a precursor virus enters the pangolins with N PID values between 46 and 48% and low M PID . In path B, the virus stays a short while among pangolins before moving to humans, as its current form (M PID = 5.6% and N PID = 48%). In this case, the virus cannot stay long among pangolin because doing so would force the virus to have decreased N PID values, as fecal−oral evolutionary pressures set in. Path C presents a more interesting scenario, where the precursor remains among pangolins for a long time, and thus faces evolutionary pressures toward fecal−oral transmission with the lowering of N PID. In this scenario, however, an entry into the human population necessitates a quiet and slow spread because the low N PID provides for low viral loads in both vital organs and body fluids. The shell disorder model therefore suggests that there is a good likelihood that the virus entered the human population as an attenuated strain of SARS-CoV-2. Being an attenuated strain, Journal of Proteome Research pubs.acs.org/jpr Article it could have easily spread among human without the notice of the medical community, and the virus was thus given a unique window of opportunity to evolve quietly with humans for at least a few years. Figure 2 and Table 2 tell us that while pangolin-CoV 2019 is very closely related to SARS-CoV-2 and is not likely to be an intermediary, there is a greater likelihood that an older pangolin-CoV strain, particularly a 2017 one, could have entered the human population as seen in Figure 2B . Why is then the 2019 strain and Bat-RaTG13 that closely related to SARS-CoV-1 both genetically and in disorder of M and N? It is plausible that human SARS-CoV-2 re-entered the bat and pangolin populations as suggested in Figure 3C . This is not unimaginable as both animals could have encountered feces of infected humans in form of trash with fruit leftovers and ants. This opportunity is not seen in the civet cat in the case of SARS-CoV as shown in path A in Figure 3 . There are no tell-tale signs of fecal−oral transmission potentials in the civet cat CoV that we saw in pangolins. Nor should we expect this possibility based on the behaviors of the civet cats. 3 Therefore, as seen in Figure 3A , the civet SARS-CoV must have entered the civet population to evolve within it for a moderate timespan to acquire a little more respiratory transmission potential, i.e., slight increase in N PID before entering into the human population. Apparently, SARS-CoV did not have a chance to evolve and adapt to humans before being detected by the medical community. There are also other possible intermediaries for SARS-CoV-2, such as rabbits and cows ( Figure 3D,E) . For such animal intermediaries, specific evolutionary pressures will be encountered, but no inherent fecal−oral pressures are seen in these animals, based on the high N PID values of their respective CoVs, even though rabbits have a hard outer shell comparable to that of SARS-CoV-2. In any case, SARS-CoV-2 is so genetically different from their respective CoVs that it is difficult to envisage that SARS-CoV-2 did not pick up any of CoV genetic materials along the way, if any of these animals had served as an intermediary, even if temporarily for a short time. The shell disorder model is a spinoff from its parent research that involved the study of HIV and the lack of an effective anti-HIV vaccine. It was carefully contrived in 2011−2012 from what is known about the behavior of farm animals and their CoVs, especially those of porcine. 3, 13, 45 Before the 2003 SARS outbreak, CoVs were not of medical interest, but veterinary data for animal coronaviruses were plentiful, because of their threat to the farming community. The reason that there are correlations between N disorder and mode of transmission has to do with the necessity of having a minimal viral load in the body fluids, i.e., mucus and saliva, before respiratory transmission becomes viable. 18 While our shell model provides a link between N disorder and mode of transmission, other studies have provided correlations between N disorder and viral virulence. The studies included a wide variety of related and nonrelated viruses, such as NiV, flaviviruses, DENV, and EBOV. 3,19,39−42 There are also hints of links between N disorder and virulence in SARS-CoV and SARS-CoV-2. While there is still much we need to know about SARS-CoV and SARS-CoV-2, there is much stronger evidence of correlations between virulence and inner shell disorder in a fairly large number of viruses as aforementioned. This has paved a way for a novel strategy in the development of COVID-19 vaccine using shell disorder as described in our previous paper. 19 "Trojan Horse" Immune Evasion: Quick Replication before Immune Detection We have described the correlations between inner shell disorder and virulence as a "Trojan horse" immune evasion strategy, because it involves rapid replication of the virus before the host immune system is able to detect its presence. This, however, often backfires on the virus by leading to the death of the host, since large viral loads in vital organs such as the lungs can, of course, kill the host. 3,19,39−42 The reason that inner shell disorder provides important means for the rapid replication is that inner shell plays vital role in the replication process and disorder is important for greater efficiency of protein−protein/ RNA/DNA binding. 27, 46 Inner Shell Disorder Defines More Efficient Protein−Protein/DNA/RNA Interactions and Therefore More Efficient Viral Replication The inner shells of many viruses play very similar roles. In the case of CoV N protein, it assists in the transportation of other viral proteins to regions near the host Golgi apparatus and ER (endoplasmic reticulum), where N helps with the packaging of the viral particles. 47 The packaging process could involve N condensating with the viral RNA and M. 48 Similarly in the case of EBOV, its NP (nucleoprotein) builds a structure that is involved in the transportation of viral proteins to the ER. 49 All this requires protein−protein/DNA/RNA interactions. The greater disorder provides important means for greater binding efficiency especially to host proteins/DNA/RNA. 18, 19, 50 Links among Inner Shell Disorder, Virulence, and Respiratory Transmission: Viral Loads in Body Fluids vs Vital Organs Virulence is associated with viral load in vital organs, such as the lungs, whereas respiratory transmission viability is highly dependent on the viral load in the mucus and saliva. While the viral load in vital organs is likely to correlate with the viral load in the body fluids, a heavy viral load in vital organs does not necessitate a heavy viral load in body fluids, as the latter contains antimicrobial and antiviral enzymes. 20, 21 The ability of the virus to resist such enzymes is dependent on the hardness of the outer shell. The observed greater hardness of M (5.9%) in the case of SARS-CoV-2 is likely to confer greater resistance to such antimicrobial and antiviral enzymes. This itself could account for the observed large viral load in body fluids. This is why SARS-CoV-2 is much more contagious than the 2003 SARS-CoV, which has a higher M PID of 8.6%. The question is then: Why is not the immense viral load in body fluids of COVID-19 patients not translated into the higher virulence, given that the CFR of SARS-CoV-2 is 2−6% and that of SARS-CoV being 9−10%? The answer has to do with the predicted discrepancy of viral load in the body fluids and vital organs. The shell disorder model tells us that this discrepancy comes not just from the differences in M PID values but also from the slightly lower SARS-CoV-2 N PID With all this in mind we are now ready to revisit the roles of pangolins. We have mentioned that pangolins provided a unique window of opportunity, not seen in other animals, for a SARS-CoV-2 precursor to enter the human population quietly. Upon the inspection of the various pangolin-CoV strains dating back to 2017, the shell disorder model that we have just described detected strains that resemble attenuated versions of SARS-CoV-2, given reductions in pangolin-CoV N PID values as seen in Table 2 and Figures 2C−3B . Even more interestingly, while two previous strains have N PID around 46%, which sits at the borderline of group B and C (Figure 1) , one of the 2017 strains has an N PID of 44.8% (see Table 2 and Figures 2C−3B ) that places this strain squarely into the group of CoVs with higher fecal−oral transmission, i.e., the group C. Clinical data from both veterinary and medical communities have shown that while CoVs in group C can move rapidly via fecal−oral routes in farm animals, such as pigs or camels, 3, 13, 45 it is likely to spread slowly within human populations, as it will not have sufficient viral loads to spread rapidly via respiratory modes. We have seen this in the case of MERS-CoV spread among humans. 3, 19 Attenuated SARS-CoV-2 Strains Found in 2017−2018 Pangolin Samples The shell model not only makes the case for a strain that spreads slowly within the human population, but also points at the strain that had moved into the human population as an attenuated form of virus. Again, the strain in the focus here is the 2017 strain of pangolin-CoV with the N PID of 44.8% (Table 2) . We know that this is an attenuated strain for a number of reasons. First, as seen in Table 2 , all pangolin-CoV strains are genetically close to SARS-CoV-2 (pangolin-CoV 2019, SARS-CoV, and MERS-CoV have about 90%, 80%, and 50% genetic similarity to SARS-CoV-2, respectively). Second, while SARS-CoV-2 and SARS-CoV are consistent with the inner shell attenuation theory based on the difference in their N PID values (48% vs. 50%) and CFRs (2−6% vs. 9−10%) ( Figure 1C ), the wealth of our knowledge comes from other viruses and their established correlations between their degree of virulence and inner shell disorder. The fairly large variety of viruses includes flaviviruses, DENV, NiV, and EBOV. 39−42 Summarily, we should be reminded that respiratory transmission requires a maximal viral load in mucus and saliva, whereas greater virulence comes with the higher viral load in vital organs. Attenuated viruses mutating to a virulent strain have been seen before. An excellent example of this is given by the case of polio vaccines. There are two main polio vaccine types. The first type is the Salk vaccine that is made up of inactivated viruses (proteins) that confers only short-term immunity upon several booster shots. 2,3 A second type is the Sabin vaccine, which is an attenuated virus that provides lifetime immunity to those receiving it. 2, 3 This, however, comes with a price. It has been long known that the inactivated vaccine has mutated to the virulent types. 3, 51, 52 It should therefore not be difficult to envisage an attenuated SARS-CoV-2 that had entered into the human population a few years ago before mutating into its current virulent form by acquiring greater disorder at the N protein. ■ CONCLUSIONS There is a general consensus among scientists that SARS-CoV-2 is somehow very highly adapted to humans, so much so that it is suggested that the virus must have entered the human population a long time ago. 11, 24 The mystery deepens when we ponder the question of how the virus could have evolved with humans without the notice of the medical community. One plausible answer, as suggested by the shell disorder model, involves a precursor virus entering the human population around 2017 or before. This virus was an attenuated form of SARS-CoV-2 that spread slowly, perhaps, within localized and limited communities as a mild cold before mutating to its current virulent form a few years later when the virus had sufficient time to evolve with humans. Pangolins are uniquely suited to facilitate such pathway through their behaviors that support fecal−oral transmission of viruses. This also adds to the overwhelming evidence that the SARS-CoV-2 is not man-made. As mentioned, the shell disorder model has detected attenuated SARS-CoV-2 strains based on their close genetic proximity to the current virus and their lower levels of N disorder. Of particular interest is a 2017 pangolin-CoV strain seen Table 2 with N PID of 44.8%. While the shell disorder analysis has detected at least one vaccine strain of SARS-CoV-2 among pangolins, it is needless to say that extra precaution needs to be taken, when such information is used in vaccine development. First, as already mentioned, attenuated vaccines have the tendency to mutate to its virulent types. This is an inherent risk for all attenuated vaccines. Perhaps additional mutations are needed to force the N PID to go to even lower values, so as to lessen the chances of the mentioned converting mutations. Second, we do not know if even with the N PID of 44.8% the virus is sufficiently attenuated as a vaccine strain. Currently, we do not have adequate information to address these questions, and subsequent careful animal and clinical studies are required. Fundamentals of Molecular Virology Viral Shapeshifters: Strange Behaviors of HIV and Other Viruses Evolutionary history, potential intermediate animal host, and crossspecies analyses of SARS-CoV-2 A henumonia outbreak associated with a bew coronavirus of probable bat origin Repurposing of clincially approved drugs for treatment of coronavirus 2019 in a 2019-novel coronavirus (2019 nCoV) related virus model Cross species transmission of newly identified coroanvirus 2019-nCoV Are pangolins the intermediate host of the 2019 novel coronavirus (SARS-CoV-2)? Probable Pangolin Origin of SARS-CoV-2 Associated with the COVID-19 Outbreak The proximal origin of SARS-CoV-2 Stucture and sequence reanalysis of 2019-nCoV genome refutes snakes as its intermediate host and the unique smilarity of spike protiens insertion and HIV-1 Genotype and phenotype of COVID-19: Their roles in pathogenesis Understanding viral transmission behavior via protein intrinsic disorder prediction: Coronaviruses WHO, Middle Eastern respiratory syndrome coronavirus Prediction of Intrinsic Disorder in MERS CoV/HCoV-EMC Supports a High Oral-Fecal Transmission D Identification of MERS-CoV in dromedary camels Rigidity of the outer shell predicted by a protein disorder model sheds light on COVID-19 (Wuhan-2019-nCoV) infectivity Shell disorder analysis predicts greater resilience of SARS-CoV-2 (COVID-19) outside the body and in body fluids A Novel Strategy for the Development of Vaccines for SARS-CoV-2 (COVID-19) and Other Viruses Using AI and Viral Shell Disorder. Preprints Antiviral activities in saliva Innate antimicrobial activity of nasa secretions Virological asscessmant of hospitalized patients with COVID-19 Structural basis of receptor recognition by SARS-CoV-2 SARS-CoV-2 strategically mimics proteolytic activation of human EnaC Intrinsically unstructured proteins Why are "natively unfolded" proteins unstructured under physiologic conditions? Intrinsically unstructured proteins: Re-assessing the protein structure-function paradigm Predicting protein disorder for N-, C-, and internal regions Predicting Binding Regions within Disordered Proteins Sequence complexity of disordered protein Mining alpha-helix-forming molecular recognition features with cross species sequence alignments Coupled folding and binding with alpha-helixforming molecular recognition elements Protein intrinsic disorder toolbox for comparative analysis of viral proteins Viral disorder or disordered viruses: Do viral proteins possess unique features? Protein intrinsic disorder and influenza virulence: The 1918 H1N1 and H5N1 viruses Shell disorder, immune evasion and transmission behaviors among human and animal retroviruses Correlating Flavivirus virulence and levels of intrinsic disorder in shell proteins: Protective roles vs. immune evasion Zika and flavivirus shell disorder: Virulence and fetal morbidity. Biomolecules Nipah shell disorder: Mode of tramsmission and virulence Detection of links between Ebola nucleocapsidand virulence using disorder analysis R: A Language and Environment for Statistical Computing Foraging behaviour and ecology of the Cape pangolin (Manix temminckii) in north-west Zimbabwe Antigenic relationships among procine epidemic diarrhea virus and transmissible gastroenteritis virus strains The balancing act of intrinsically disordeeered proteins enabling functions while minimizing promiscuity The coronaviruse nucleocapsid is a multifuctional protein The SARS-CoV-2 Nucleocapsid phosphoprotein forms mutually exclusive condensate with RNA and th membrane-associated M protein. bioRxiv Functional mapping of the nucleoprotein of the ebola virus Structural disorder within paramyxovirus nucleoproteins and phosphoproteins The double-edged sword: How evolution can make or break a live-attenuated virus vaccine Genotype and phenotype of COVID-19: Their roles in pathogenesis Complete contact information is available at: https://pubs.acs.org/10.1021/acs.jproteome.0c00460Author Contributions GKMG conceived the idea, collected and analyzed the data, and wrote the first draft. VNU helped with the collection and analysis of literature data and reviewed and revised the draft. AKD and JAF reviewed the manuscript and provided the resources necessary for the research. The authors declare the following competing financial interest(s): GKMG is an independent researcher and the owner of Goh's BioComputing, Singapore. GKMG has also written a book (Viral Shapeshifters: Strange Behaviors of HIV and Other Viruses) on a related subject. The authors have no other potential conflict of interests.