key: cord-0916931-t58tjnrs authors: Perez-Romero, Carmina A.; Tonda, Alberto; Mendoza-Maldonado, Lucero; MacSharry, John; Szafran, Joanna; Claassen, Eric; Garssen, Johan; Kraneveld, Aletta D.; Lopez-Rincon, Alejandro title: SARS-CoV-2 Omicron Variant AI-based Primers date: 2022-01-26 journal: bioRxiv DOI: 10.1101/2022.01.21.475953 sha: 1ca41e1c4e92e26b8f581c57d63117d580d3462f doc_id: 916931 cord_uid: t58tjnrs As the COVID-19 pandemic continues to affect the world, a new variant of concern, B.1.1.529 (Omicron), has been recently identified by the World Health Organization. At the time of writing, there are still no available primer sets specific to the Omicron variant, and its identification is only possible by using multiple targets, checking for specific failures, amplifying the suspect samples, and sequencing the results. This procedure is considerably time-consuming, in a situation where time might be of the essence. In this paper we use an Artificial Intelligence (AI) technique to identify a candidate primer set for the Omicron variant. The technique, based on Evolutionary Algorithms (EAs), has been already exploited in the recent past to develop primers for the B.1.1.7/Alpha variant, that have later been successfully tested in the lab. Starting from available virus samples, the technique explores the space of all possible subsequences of viral RNA, evaluating them as candidate primers. The criteria used to establish the suitability of a sequence as primer includes its frequency of appearance in samples labeled as Omicron, its absence from samples labeled as other variants, a specific range of melting temperature, and its CG content. The resulting primer set has been validated in silico and proves successful in preliminary laboratory tests. Thus, these results prove further that our technique could be established as a working template for a quick response to the appearance of new SARS-CoV-2 variants. On November 26th, 2021, the World Health Organization (WHO) declared a fifth variant of concern (VOC) 1 , the Omicron variant, belonging to the PANGO lineage as B.1.1.529 2 , and clade GR/484A in the Global Initiative on Sharing Avian Influenza Data (GISAID) 3 . The Omicron variant is characterized by the mutations 4, 5 presented in Table 1 . B.1.1.529 was first discovered in Botswana on November 11th, 2021. It was then quickly identified in South Africa three days later and identified in two cases in Hong Kong 4 . B.1.1.529 is characterized by at least 30 amino acid substitutions, three small deletions and one small insertion in the viral RNA encoding for the spike protein (S), making it the most divergent variant detected to date 4 . Importantly, half of the amino acid substitutions in B.1.1.529 are present on its receptor-binding domain (RBD), playing a key role in ACE2 binding and antibody recognition. Several mutations in this domain have been characterized in other variants, including VOCs 6, 7 and modelled associating them with increased binding affinity to ACE2 and higher rates of transmission (N501Y, Q498R, H655Y, N679K, P681H) [8] [9] [10] [11] , increase in the ability of immune escape from neutralising antibodies (K417N, N501Y, N440K, S477N, T478K, G339D) [11] [12] [13] [14] , reduction in vaccine effectiveness and increased risk of reinfections (E484, K417N, N501Y) [13] [14] [15] [16] [17] [24] [25] [26] , and early evidence has linked it to an increased risk of reinfection [15] [16] [17] 27 . Due to its highly mutated S protein, B.1.1.529 has been shown to escape the majority of SARS-CoV-2 neutralizing antibodies and retrovirals 12, 13, 28 (except for ensovibep 29 , remdesivir, molnupiravir and nirmatrelvir 30, 31 ). Furthermore, current vaccines technologies tested against B.1.1.529 seem to show lower efficacy 13, 23, 26, 28, 32, 33 , however vaccine boosters (including heterologous booster) have been shown to help reducing immune escape 31, [34] [35] [36] [37] , and universal vaccines to help fight this variant have been proposed 38, 39 . It is important to note that early data point to a reduced risk of hospital admission among Omicron infected individuals 23, 27 , and a reduced risk of severe outcomes among Omicron re-infected individuals earlier infected by the Delta variant 25 . Interestingly, a recent study found an increase in Delta variant neutralization from individuals infected with Omicron 40 , which could result in a decreased ability of reinfection with future and current variants. Due to the high concern of this variant, one of the best strategies to contain its spread is by properly measuring it. The current identification of variants is usually performed through PCR tests. As we recently showed in 41 , on barriers influencing vaccine development timelines in Covid-19 Vaccine R&D, fast and correct identification is of highest impact on Covid-19 vaccine development. In other words, the steepest barrier to innovation is "lack of knowledge concerning the pathogen target". Consequently, AI fast prediction of suitable primers relates directly to outbreak responses, as also recently discussed in 41, 42 . Differentiating between Omicron and the other variants, however, is a difficult task. In practice, the main way of detecting the Omicron variant is by running three targets, and then checking the so-called S gene dropout or S gene target failure 1 : a failure of one specific test that targets the heavily mutated S gene. The virus is then amplified and sequenced, using a procedure similar to the one followed for identifying the B.1.1.7/Alpha variant 43 . Depending on the technology adopted and the laboratory conditions in each country, the whole process can take days. Considering the amount of the people that need to be tested in order to properly assess the spread of the variant, this creates the necessity for a faster and reliable test. In addition, if 69-70 deletion is targeted in the S gene, the BA.2 sublineage does not have this mutation. Here we present the design of a primer set for the specific detection of the Omicron variant using the fast and completely automated pipeline built around deep learning and Evolutionary Algorithms (EA) techniques 44 . Using the methods presented in 44 , we find 10 different sequences from the 10 runs of the EA technique. Next, we run an analysis to find if any of the sequences presents more than one mutation. The resulting sequence GACCCACTTATGGT-GTTGGTC presents 3 characteristics mutations to Omicron variant 1 : Q498R (A23055G), N501Y (A23063T) and Y505H (T23075C), position 23,054 to 23,075 in the reference accession NC_045512.2 45 . Then, we simulated sequence GACC-CACTTATGGTGTTGGTC as a forward primer in Primer3Plus 46 using the accession EPI_ISL_6590782, resulting in a warning for High end self complementarity. To solve the issue, we increased the size of the primer by adding a bp at the end (GACCCACTTATGGTGTTGGTCA), which resulted in an acceptable primer candidate with a T m of 62.0°C. We then generated the internal probe CACCAGCAACTGTTTGTGGA and reverse primer CTGCCAAATTGTTGGAAAGG with a T m of 60.8°C and 60.5°C respectively with a product size of 208 bp. The frequency of appearance of the selected sequences is shown in Table 2 . Out of 123 available Omicron sequences, 112 presented the candidate forward primer produced by our technique. From the 11 that do not present the sequence GACCCACTTATGGTGTTGGTCA, 9 had sequencing errors and 2 (1.63%) do not possess mutation Y505H. Further validation was performed on the different Omicron sub-lineages (BA.1, BA.2, BA.3) to verify that the primers work, Table 3 . An analysis using 659 sequences, marked as complete and low coverage excluded in GISAID from BA.1 sub-lineage show 2/6 88.01% success of the forward primer and one nucleotide change in 2.28% of the sequences, while the rest have sequencing errors in the target. Laboratory testing of the generated primers, as part of the UniCoV study proves to be successful (see Table 4 ) in identifying the Omicron variant. Thus, these results prove further that the use of AI to generate specific diagnostic tests could be used as a fast measure to tackle the on-going SARS-CoV-2 pandemic and the appearance of new variants. To create the primer set, we use a technique based on Evolutionary Algorithms (EAs), that is able to find suitable sequences to identify the Omicron variant 44 . In summary, this method optimizes two integer values p, k that jointly identify the best candidate forward primer of length 21 at position p in sample k of the training set, see Figure 1 . Then gives a cost based on its frequency of appearance marked with label '1', its absence from samples with label '0' (in other words, its specificity to the 3/6 target variant), its CG content, and melting temperature, as described in more detail in 47 , where the same technique was used to generate primers for B.1.1.7 (Alpha) variant successfully. As EAs are stochastic algorithms, and can thus potentially return different candidates at each run, we run the algorithm 10 times to obtain a variety of candidate sequences. After the 10 repetitions, we check the obtained sequences for the number of mutations. Once a sequence containing more than one mutation is identified, we simulate it in Primer3Plus 46 using the accession EPI_ISL_6590782 as a reference for Omicron variant. Raw Saliva samples, approx. 1-2 ml, were collected in 5 ml screwcap containers from volunteers in the UniCoV study 48 . Following collections samples were heat inactivated at 95°C for 5 min, cooled, vortexed and then 20 ml of Saliva was added to 20 ml of Saliva Ready™ Solution in a 0.2 ml 96 well plate. The plate was vortexed and centrifuged and then heated at 62°C for 5 min, 92°C for 5 min and then cooled at 4°C. Saliva was then screened for SARS-CoV-2 (ORF1a, ORF1b and N gene) and Human RNase P using the TaqMan™ 1-Step Mutiplex SARS-CoV-2 Fast PCR Kit 2.0 on the Applied Biosystems™ QuantStudio™ 5 Real Time PCR Instrument, 96 well, 0.2-mL block (Thermo Fisher Scientific) according to the manufacturer's instructions including a Positive control. Reverse transcription 53°C for 5 min, 1 Preincubation 85°C for 5 min., Activation at 95°C 2 minutes with 40 cycles of Denaturation 95°C for 1 second and Anneal / extension 62°C for 30 seconds. Positive SARS-CoV-2 samples were subsequently screened for the presence of Omicron variant using our specific Omicron Primer Set. Briefly cDNA was synthesised using saliva from the Saliva Ready™ step above with LunaScript RT-Supermix (NEB), briefly for primer annealing for 2 min at 25°C , cDNA synthesis for 10 minutes 55°C 2 minutes and denaturation 95°C for 1 minute. qPCR using followed by second and Anneal / extension 62°C for 30 seconds. qPCR was performed using cDNA from above with the Luna Universal Probe qPCR mix (NEB), with Omicron specific primers and N1 (2019-nCoV RUO) primers/probes (Integrated DNA Technologies (IDT)) at a concentration 500 nM with probes at 250 nM(FAM-labelled). PCR conditions were denaturation at 95°C fro 1 minutes and 40 cycles of Denaturation 95°C for 15 second and Anneal / extension 60°C for 30 seconds on the Applied Biosystems™ QuantStudio™ 5 Real Time PCR Instrument, 96 well, 0.2-mL block. All PCR reactions were performed in duplicate with a technical replicate performed following initial analysis. Classification of Omicron (B.1.1.529): SARS-CoV-2 Variant of Concern A Dynamic Nomenclature Proposal for SARS-CoV-2 Lineages to Global Initiative on Sharing All Influenza Data-From Vision to Reality European Centre for Disease Prevention and Control. Implications of Emergence of the Emergence and Spread of the SARS-CoV-2 B.1.1.529 Variant of Concern (Omicron) for the EU/EEA SARS-CoV-1 variants of Concern and Variants under Investigation in England The biological and clinical significance of emerging sars-cov-2 variants Sars-cov-2 variants, spike mutations and immune escape The omicron variant increases the interactions of sars-cov-2 spike glycoprotein with ace2 Omicron and delta variant of sars-cov-2: A comparative computational study of spike protein Insertions in the sars-cov-2 spike n-terminal domain may aid covid-19 transmission Sars-cov-2 omicron spike glycoprotein receptor binding domain exhibits super-binder ability with ace2 but not convalescent monoclonal antibody B. 1.1. 529 escapes the majority of sars-cov-2 neutralizing antibodies of diverse epitopes An opendata portal to share covid-19 drug repurposing data in real time Predictions of the sars-cov-2 omicron variant (b. 1.1. 529) spike protein receptor-binding domain structure and neutralizing antibody interactions Increased risk of sars-cov-2 reinfection associated with emergence of the omicron variant in south africa Science brief: Omicron (b.1.1.529) variant Comparative analysis of antibody responses from covid-19 convalescents receiving various vaccines reveals consistent high neutralizing activity for sars-cov-2 variant of concern omicron SARS-CoV-2 variants of concern and variants under investigation in England: Technical briefing COVID-19 Variant of Concern Omicron (B.1.1.529): Risk Assessment Sars-cov-2 b. 1.1. 529 (omicron) variant-united states HKUMed finds Omicron SARS-CoV-2 can infect faster and better than Delta in human bronchus but with less severe infection in lung South Africa's largest private health insurance administrator, releases at-scale, real-world analysis of Omicron outbreak based on 211 000 COVID-19 test results in South Africa SARS-CoV-2 sequencing update Early assessment of the clinical severity of the sars-cov-2 omicron variant in south africa Activity of convalescent and vaccine serum against a b. 1.1. 529 variant sars-cov-2 isolate SARS-CoV-2 variants of concern and variants under investigation in England: Technical briefing Striking antibody evasion manifested by the omicron variant of sars-cov-2. bioRxiv Ensovibep, a novel trispecific darpin candidate that protects against sars-cov-2 variants Remdesivir, molnupiravir and nirmatrelvir remain active against sars-cov-2 omicron and other variants of concern Sars-cov-2 omicron neutralization by therapeutic antibodies, convalescent sera, and post-mrna vaccine booster Effectiveness of covid-19 vaccines against the omicron (b. 1.1. 529) variant of concern. medRxiv mrna-1273 and bnt162b2 mrna vaccines have reduced neutralizing activity against the sars-cov-2 omicron variant Homologous or heterologous booster of inactivated vaccine reduces sars-cov-2 omicron variant escape from neutralizing antibodies Divergent sars cov-2 omicron-specific t-and b-cell responses in covid-19 vaccine recipients mrna-based covid-19 vaccine boosters induce neutralizing immunity against sars-cov-2 omicron variant Sputnik light booster after sputnik v vaccination induces robust neutralizing antibody response to b. 1.1. 529 (omicron) sars-cov-2 variant Artificial intelligence predicts the immunogenic landscape of sars-cov-2 leading to universal blueprints for vaccine designs Immunoinformatics and molecular modeling approach to design universal multi-epitope vaccine for sars-cov-2 Omicron infection enhances neutralizing immunity against the delta variant Barriers influencing vaccine development timelines, identification, causal analysis, and prioritization of key barriers by kols in general and covid-19 vaccine r&d Blockchain-facilitated sharing to advance outbreak r&d Investigation of novel SARS-COV-2 variant: Variant of Concern Design of Specific Primer Sets for the Detection of B.1.1.7, B.1.351, P. 1, B. 1.617. 2 and B. 1.1. 519 Variants of SARS-CoV-2 using The Establishment of Reference Sequence for SARS-CoV-2 and Variation Analysis Primer3Plus, an Enhanced Web Interface to Primer3 Design of Specific Primer Sets for SARS-CoV-2 Variants Using Evolutionary Algorithms ALR and AT made the programming, data collection, and experiments in silico. EC, ADK and JG made the experiment and study design. JMS and JS performed the wet lab validation on these primer sequences (from samples obtained as art of the UnCoV University community screening study in UCC The used data for this manuscript, and the resulting frequency of appearance for each sequence is available in : https://github.com/steppenwolf0/omicronVariant.git