key: cord-1027769-p8fmk21l authors: Alam, Aftab; Khan, Arbaaz; Imam, Nikhat; Siddiqui, Mohd Faizan; Waseem, Mohd; Malik, Md Zubbair; Ishrat, Romana title: Design of an epitope-based peptide vaccine against the SARS-CoV-2: a vaccine-informatics approach date: 2020-12-08 journal: Brief Bioinform DOI: 10.1093/bib/bbaa340 sha: ae7977331a4a6f4ee62b6054e0d5d01f1057482c doc_id: 1027769 cord_uid: p8fmk21l The recurrent and recent global outbreak of the severe acute respiratory syndrome coronavirus-2 (SARS-CoV-2) has turned into a global concern which has infected more than 42 million people all over the globe, and this number is increasing in hours. Unfortunately, no vaccine or specific treatment is available, which makes it more deadly. A vaccine-informatics approach has shown significant breakthrough in peptide-based epitope mapping and opens the new horizon in vaccine development. In this study, we have identified a total of 15 antigenic peptides [including thymus cells (T-cells) and bone marrow or bursa-derived cells] in the surface glycoprotein (SG) of SARS-CoV-2 which is nontoxic and nonallergenic in nature, nonallergenic, highly antigenic and non-mutated in other SARS-CoV-2 virus strains. The population coverage analysis has found that cluster of differentiation 4 (CD4(+)) T-cell peptides showed higher cumulative population coverage over cluster of differentiation 8 (CD8(+)) peptides in the 16 different geographical regions of the world. We identified 12 peptides ((LTDEMIAQY, WTAGAAAYY, WMESEFRVY, IRASANLAA, FGAISSVLN, VKQLSSNFG, FAMQMAYRF, FGAGAALQI, YGFQPTNGVGYQ, LPDPSKPSKR, QTQTNSPRRARS and VITPGTNTSN) that are [Formula: see text] identical with experimentally determined epitopes of SARS-CoV, and this will likely be beneficial for a quick progression of the vaccine design. Moreover, docking analysis suggested that the identified peptides are tightly bound in the groove of human leukocyte antigen molecules which can induce the T-cell response. Overall, this study allows us to determine potent peptide antigen targets in the SG on intuitive grounds, which opens up a new horizon in the coronavirus disease (COVID-19) research. However, this study needs experimental validation by in vitro and in vivo. History is witness to the fact that whenever a major disease comes, it brings tremendous destruction with itself, and these destructions are physical and economical. Previously, cholera, smallpox, bubonic plague and influenza are some of the most brutal killers in human history and killed millions of people all over the world. Nowadays, the world is scared of the coronavirus disease 2019 (COVID- 19) , and its outbreak continues to spread from China to all over the world, and we do not yet know when it will stop. It is a contagious disease caused by a SARS family virus named 'severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2)', which has not been previously identified in humans. The initial symptoms of COVID-19 are dry cough, throat infection, fever, breath shortness, loss of taste or smell, rashes in skin, conjunctivitis, tiredness, muscles pain and, diarrhea; most of the individuals (about 80%) recover without any special treatment, but older people and a person with preexisting medical conditions or comorbidities (cardiovascular disease, diabetes, lungs disease and cancer) are more likely to develop a serious illness. The incubation period for the infection with SARS-CoV-2 ranges from 2 to 14 days after its exposure [1] . The transmissibility of the virus is shown by its reproductive number (R0), and it varies from area to area. The World Health Organization (WHO) has estimated R0 between 1.4 and 2.5, but some other studies have estimated it as 2.24-3.58 for COVID-19 [2, 3] . In just 9 months, the COVID-19 that originally emerged from the Wuhan province of China is posing major public health and governance challenges. The cases have now spread in 213 countries and territories around the globe. Till 24 October 2020, more than 42.5 million infected individuals, with over 1 134 940 deaths around the globe, have been reported to WHO (WHO COVID-19 Dashboard), and these numbers are rapidly increasing in hours. At present, unfortunately, no vaccine or specific treatment is available. However, the WHO listed (as per 19 Moreover, there are no chemotherapeutic agents available to curb this menace; however, few agents are being used, including natural compounds [4] [5] [6] , western medicines [7, 8] and traditional Chinese medicines (TCN) [9, 10] , which may have potential efficacy against the SARS-CoV-2. Moreover, other drugs like interferon-α (IFN-α), lopinavir/ritonavir, chloroquine phosphate, ribavirin, favipiravir, disulfiram, arbidol and hydroxychloroquine are recommended as the tentative treatments for COVID-19 [11, 12] . Currently, there is no specific treatment/medicine or vaccine to cure COVID-19, so there is an urgent need to develop new vaccines or drugs against this deadly disease. In this way, the integration of computational techniques provides a novel approach to integrating the vaccine-informatics approach for the development of vaccines. These methods had earlier been used in the development of vaccines against several diseases, including dengue [13] , malaria [14] , influenza [15] , multiple sclerosis [16] and tumor [17] . However, this approach generally works through the identification of major histocompatibility complex (MHC)-1 and II molecules and thymus cells (T-cell) epitopes (CD8 + and CD4 + ) [18] , which particularize the selection of the potential vaccine agents related to the transporter of antigen presentation (TAP) molecules [19, 20] . The beginning of 2020 has seen the emergence of the deadly COVID-19 pandemic, and currently, we are drowning with an enormous amount of articles, with their probable epitope-based peptide vaccine for COVID-19 in which the bone marrow or bursa-derived cells (B-cells) and T-cell epitopes have been analyzed, which have anticipated the possibility of antigenic epitopes which can be used to design a novel vaccine candidate against the SARS-CoV-2 [21] [22] [23] . In the current study, we have also predicted epitope-based vaccine candidates against the SARS-CoV-2 using the systematic vaccine-informatics approach. We considered surface glycoprotein (SG) of SARS-CoV-2 due to its specific functions; SARS-CoV-2 uses surface spike protein to mediate entry into the host cells. To fulfill its purpose, the SARS-CoV-2 spike binds to the receptor 'hACE2' through its receptorbinding domain (RBD) and is proteolytically triggered by human proteases [24, 25] . Notably, we not only prognosticate the most potential vaccine candidate but have also cross-checked the resemblance, congruity and the compatibility of these selected epitopes with humans to circumvent any possible risk of autoimmunity. Additionally, we had checked the resemblance of our epitopes with those which are already experimentally verified epitopes of different organisms, including SARS-CoV, which not only makes our study more precise and noteworthy but also expands our views for the vaccine-informatics approach in planning for the next global pandemic. Our work can save the time needed to screen a large number of possible epitopes compared with the experimental techniques and also guide the experimental work with high confidence of finding the desired results in vaccine development. The SG sequence (ID: QHO62112.1) was obtained from the NCBI gene bank database (https://www.ncbi.nlm.nih.gov/gene/) in the FASTA format. Additionally, we checked the sequence similarity of peptide sequences with other SG proteins of other SARS-CoV-19 isolates from different geographical regions using the Clustal Omega tool [26] to analyze the variation in epitopes sequences, which can determine whether the epitopes are conserved or have altered peptide ligands. The NETCTL_1.2 server [27] was used to identify the CD8 + Tcell peptides at a set threshold value of 0.95 to sustain the sensitivity and specificity of 0.90 and 0.95, respectively. We used all the expanded 12 MHC class-I supertypes (including A1, A2, A3, A24, A26, B7, B8, B27, B39, B44, B58 and B62) and incorporated the peptide prediction of MHC class-I binding molecules and proteasomal C-terminal cleavage with TAP transport efficiency. These results were accomplished by the weighted TAP transport efficiency matrix. Then MHC class-I binding and proteasomal cleavage efficiency were used to obtain the total scores and translate it into the sensitivity and specificity. We selected peptides as the epitope candidate on the basis of the overall score. Further, we checked the peptides binding to MHC class-I molecules by using the MHC class-I binding predictions tool [28] . The predicted output was given in units of IC 50 nM (IC 50 values < 50 nM = high affinity, IC 50 < 500 nM moderate affinity and IC 50 < 5000 nM low affinity) [29] . The MHC-NP (Naturally Processed) tool used for assesses the probability of naturally processing and binding of peptides to the given MHC molecules. This tool predicts naturally processed epitopes based on the physiochemical properties and comparison of residual position with experimentally verified epitopes [30] . Further, we identified CD4 + T-cell peptides with IC50 ≤ 100 ((IC 50 values < 50 nM = high affinity, IC 50 < 500 nM moderate affinity and IC 50 < 5000 nM low affinity) by using MHC class-II binding predictions tool [31] . The identification of B-cell peptides (epitopes) in the SG of SARS-CoV-2 was accomplished to find the potent antigen, which gives an assurance of humoral immunity. Here, we used Antibody Epitope Prediction tool [32] [37] and Parker Hydrophilicity Prediction [38] . In this study, we selected that region in the protein sequence, which was crossreferenced, and common findings were considered as the B-cell antigenic region based on the above six classical propensity scale methods. The epitope conservation defined the degree of a resemblance betwixt the peptide and query sequences. Hence, we used the Epitope Conservancy Analysis (ECA) tool [39] to find the conservancy of our peptides among the other SARS coronaviruses, bat SARS-like coronaviruses and bat coronaviruses. Additionally, the immunogenicity prediction can reveal the degree of efficiency for individual peptides to produce an immunogenic response. In our study, we used the Class I Immunogenicity [40] tool to predict the immunogenicity of the MHC class-I binding peptides. The structure of CD + 8 and CD + 4 T-cell peptides were modeled by a biased modeling method using an online server, PEP-FOLD 3.5 server, at the RPBS MOBYL portal [41] . The crystal structure of the SARS-CoV-2 spike glycoprotein (6VSB-A) was used as the reference model together with a mask representing the structure fragments for which the local conformation of this structure has been imposed. Additionally, The structure of human leukocyte antigen (HLA) molecules, including HLA-B * 53:01 (PDB ID: 1A1M) [42] , HLA-B * 44:03 (PDB ID: 4JQX) [43] and HLA-DRB1 * 01:01 (1AQD) [44] were retrieved from the Protein Data Bank (PDB) [45] . The structure of peptides and HLA molecules are depicted in Figure 1 . The population coverage analysis (PCA) tool [46] gives the idea about the probable response of each peptide in different countries of the world based on peptide-HLA data genotypic frequency. Our CD8 + T-cell (three peptides) and CD4 + T-cell (six peptides) peptides and their respective HLA alleles were used for PCA. We selected 16 geographical areas (115 countries and 21 different ethnicities grouped), which were East Asia, Northeast Asia, South Asia, Southeast Asia, Southwest Asia, Europe, East Africa, West Africa, Central Africa, North Africa, South Africa, West Indies, North America, Central America, South America and Oceania. These 16 geographical regions cover the HLA allele frequencies and associated data for different individual populations from the most popular countries. Predicted peptides versus epitope database and peptide screening for autoimmune, allergic and toxic response In this section, we used predicted peptides (including T-and B-cell peptides) to search for homologous epitopes at 80-90% identity in the Epitope database [47, 48] . The current limited accessible data is not sufficient to recognize the antigenic region in the spike proteins of SARS-CoV-2 by human immune responses. Herein, the homologous epitopes (derived from other pathogens) would expedite the evaluation of vaccine candidate immunogenicity, as well as observing the possible outcomes of mutational events and epitope escape as the virus is transmitted through human populations [49] . Based on initially full-length genomic phylogenetic analysis, the precursory studies suggested that SARS-CoV-2 is quite similar to SARS-CoV and the putatively same cell entry mechanism and human cell receptor usage. And due to conspicuous resemblance amid these viruses, the previous study that gave us a basic understanding of protective immune responses against SARS-CoV, which may potentially be leveraged to aid vaccine development against the SARS-CoV-2 [50] . This study also helps to check the peptides identity with human proteome because there is a chance of an autoimmune response due to any kind of molecular mimicry between the predicted peptides (epitopes) and the human proteome. Moreover, we checked the allergic and toxic nature of the predicted peptides using AlgPred [51] and ToxinPred [52] tools, respectively. In this study, molecular docking studies produced important information regarding the orientation pattern of the peptides in the binding groove of the HLA molecules as well as which residues are actively involved in the binding. In this study, we selected three CD8 + T-cell peptides (LTDEMIAQY, WTAGAAAYY and WMESEFRVY) and five CD4 + T-cell peptides (IRASANLAA, FGAISSVLN, VKQLSSNFG, FAMQMAYRF and FGAGAALQ) for molecular docking against the HLA molecule, including HLA-B * 53:01 (PDB ID: 1A1M), HLA-B * 44:03 (PDB ID: 4JQX) and HLA-DRB1 * 01:01 (PDB ID: 1AQD). We used the glide module [53] [54] [55] of Schrödinger suite for peptides-HLA molecules docking. All peptides were prepared using the LigPrep module of the Schrodinger suite and docked in the binding site of protein using the SP-peptide mode of Glide. Receptor grid was generated using the receptor grid generation in the Glide application by specifying the binding (active) site residues, which was identified by the SiteMap tool [56] . The docked conformers were evaluated using Glide (G) Score, and the best docked pose with lowest Glide score value was recorded for each peptide. Sequence retrieval and structure prediction Viral glycoproteins have a major role in its pathogenesis. The main goal of viral infection is to recognize and bind a receptor on the cell surface of the host. It has been considered that SG plays an important role in immunity and infection, So we have retrieved the envelope SG of SARS-CoV-19 from the NCBI gene bank database (ID: QHO62112.1). Moreover, the sequence similarity of query proteins and peptides was done with the other SG proteins of other SARS-CoV-19 isolates from the various regions of the world (including China, Columbia, Japan, Malaysia, Israel, Iran, India, Sri Lanka, Vietnam, South Korea, Pakistan, United States, Hong Kong, Taiwan, Spain, South Africa, Serbia, Greece, Nederland, France and the Czech Republic), and it was found that all the predicted peptides are conserved in all of the isolates (As per 10 August 2020) (Supplementary table S1 available online at https://academic.oup.com/bib). The NETCTL server predicted several peptides in the SARS-CoV-19 SG, but only nine most potent peptides were chosen, which have a high combinatorial score. We considered only those alleles of MHC class-I for which the peptides showed higher binding affinity (IC50 ≤ 400 nm . Proteasomes played a key role in cleaving the peptide bonds and converting the protein into a peptide. The total score of each peptide-HLA interaction was taken into consideration, and a greater score meant greater processing Table 1 . Additionally, we identified 162 CD4 + T-cell peptides (epitopes) with IC50 ≤ 100; however, only six peptides (FVSNGTHWF, IRASAN-LAA, FGAISSVLN, VKQLSSNFG, FAMQMAYRF, and FGAGAALQ) were found to interact with most of the HLA-DRB-1 molecules, and the details are given in Table 2 . The B-cell epitopes comprise of peptides which can easily be used to take the place of antigens for immunizations and antibody production. In this study, we used an amino acid scale-based method in the B-cell antibody epitope prediction tool in which we predict linear epitopes from the protein sequence. We found six B-cell linear epitopes (including LTPGDSSSGWTAG, YQAGSTPCNGV, YGFQPTNGVGYQ, VITPGT-NTSN, QTQTNSPRRARS and LPDPSKPSKR) in the surface glycoprotein of SARS-CoV-19, which may be capable of inducing the desired immune response as B-cell epitopes (Figure 2 ). The population coverage analysis calculates the expected response of the predicted peptides (including B-and T-cells) in populations from different geographical areas. In this study, PCA for 16 different geographical regions was carried out for the predicted peptides by considering all the MHC class-I and II molecules. These results suggested that the expected response of these peptides is varying for populations residing in different geographical regions, as shown in Figure 3 . The tool predicted average population coverage of 21.50% and 51.09% for MHC class-I and II binding peptide fragments, respectively. The PCA of MHC class-II binding peptide fragments of SARS-CoV-19 SG revealed maximum population coverage, for example, 75.57%, 73.61% and 72.91% for North America, Europe and East Asia populations (the details are given in Figure 4A and B). In the past, many cases have been reported for the development of autoimmune diseases, like systemic lupus anemia, mysthemia gravis (hepatitis B), multiple sclerosis (swine flu), diabetes mellitus (mumps) and Gullian Barr syndrome (influenza and oral polio vaccine) [57, 58] . To avoid autoimmunity, it becomes important to discard those peptide agents which are similar to the human proteome. So, we have mapped all the predicted peptides sequences against the human and other viruses, including SARS-CoV, which has a maximum sequence identity to SARS-CoV-2 and is the best-characterized coronavirus in respect to epitopic responses. We have identified most of the peptide sequences (including B-and T-cell peptides) that were found to be similar with the experimentally determined epitopes of SARS-CoV virus, except for two peptides (FVSNGTHWF and LTPGDSSSGWTAG) which resemble with Mus musculus and human's proteome; these peptides were eliminated from further study due to autoimmunity risk. The details of the identified peptides and their resemblance with another organism's proteome are given in Table 3 . Next, we have screened all the peptides to check their allergic and toxic nature, and all these peptides were found to be non-allergen as well as nontoxic in nature (Supplementary table S2 available online at https://academic.oup.com/bib). Figures 5 and 6 , and the docking details are given in Table 4 . The development of vaccines refers to one of the most effective and cost-effective medical and public health achievements of all time. It is a very lengthy, complex and costly process and requires the collaborative involvement of public and private sectors. In each year, vaccination programs save over 3 million lives globally. The peptide, as a choice of vaccine agent, has made the astonishing move toward vaccine design against the viruses, bacteria and cancer. The peptide vaccine is often synthetic and mimics naturally occurring proteins from pathogens, and these peptidebased vaccines have shown promising successful results in the past for diseases like malaria, dengue, multiple sclerosis and influenza. Besides these diseases, the peptide-based vaccines have also been developed against several types of cancer like colorectal cancer, myeloid leukemia and gastric cancer [59] [60] [61] . The identification and design of immunogenic peptides (epitopes) are expensive as well as time consuming process. So the vaccine-informatics approach has made it easy to identify potent peptides. In the present study, we have identified CD8 + Tcell (three peptides), CD4 + T-cell (six peptides) and B-cell linear peptides (six peptides) from the SARS-CoV-19 SG; however, we were more emphasized to study T-cell peptides because vaccines against the T-cell epitopes are more promising as they evoke a long-lasting immune response, with antigenic drift, and an antigen can easily escape the memory response of antibody [62, 63] . For the MHC class-I binding peptides, the immune responses for the top five different geographical regions (highest population coverage range 30-34%) were: South Africa: 33 and Tobago). Thus, our results suggested that the MHC class-II binding peptides may be potent vaccine agents for different populations of the globe. We know that vaccination is the utmost prevention of epidemiologic infectious diseases, but it has a low incidence of serious systemic adverse effects (like autoimmune diseases). We have predicted a total of 15 peptides (B-and T-cells),and among those, 12 peptides are very important, including LTDEMIAQY, WTAGAAAYY, WMESEFRVY, IRASANLAA, FGAISSVLN, VKQLSS-NFG, FAMQMAYRF, FGAGAALQI, YGFQPTNGVGYQ, LPDPSKPSKR, QTQTNSPRRARS and VITPGTNTSN because their peptide fragments are matching with the experimentally identified epitopes Table 3 . List of peptides which are resembled with other organisms' proteome (homologous epitopes at 80-90% identity), including SARS-CoV, human herpesvirus 4, Leishmania infantum, Bordetella pertussis, Homo sapiens and Mus musculus. The similar fragments of peptides are shown in underline and bold letter along with their source epitopes (epitope ID), antigen and organism and Mus musculus proteomes, were eliminated from the study to avoid autoimmunity risk. Besides, we did not find any resemblance of one peptide (YQAGSTPCNG) with any organism's proteome. Hence, this unique peptide can also be proposed as a candidate for further studies in the area of vaccines development, therapeutic antibodies and diagnostic tools against the SARS-CoV-2. As we know that HLA alleles are polymorphic in nature; so peptides can interact with them. In our study, MHC class-I binding peptides (8-9 residues long) are bound in the wide groove The docking studies suggested that these peptides are preferably bound into the groove of respective alleles and can induce the CD + 8 and CD + 4 T-cell immunity. In our study, integrated computational approaches have identified a total of 15 peptides (T-and B-cells) from SARS-CoV-19 SG, of which 12 peptides have resemblance with experimentally identified epitopes of SARS-CoV and other pathogens. There is no vaccine or specific treatment currently available for COVID-19, and vaccine development is a long way from being translated into practical applications. So at this crunch situation, we have suggested that the predicted peptides (epitopes) would be capable to prompt an effective immune response as a peptide vaccine against the SARS-CoV-2. Consequently, these peptides may be used for synthetic vaccine design to combat the emerging COVID-19. However, in vivo and in vitro, experimental validation is needed for their efficient use as a vaccine candidate. • As we aware, the COVID-19 infection is the current global health epidemic, with very high infection and spreading rate, which is changing the deadly global figure, hour by hour. Unluckily, no vaccines or specific treatment is available, which makes it more deadly. • This study provides knowledge about the involvement of SARS-CoV-2 spike glycoprotein in the immune pathogenesis of the virus as well as induced the protective immune response. • In this study, we have identified 15 antigenic peptides (epitopes) in SARS-CoV-2 spike glycoprotein. These epitopes are capable of evoking the T-cell immune response via interacting with the MHC class-I and II molecules, and few epitopes are also capable of inducing the B-cell immune response. • Notably, we found 12 peptides (epitopes) that have 80-90% identity with experimentally identified epitopes of SARS-CoV, and this will likely be beneficial for a quick progression of the vaccine design. • All peptides are nontoxic and nonallergenic in nature, highly antigenic and non-mutated in other SARS-CoV-2 virus strains. • Our study provides the knowledge and boosts the ongoing and struggling scientific society for designing an effective vaccine to stop the current global health emergency. The origin, transmission and clinical therapies on coronavirus disease 2019 (COVID-19) outbreak-an update on the status Preliminary estimation of the basic reproduction number of novel coronavirus 2019 to 2020: a data-driven analysis in the early phase of the outbreak Covid-19: current knowledge, disease potential, prevention and clinical advances Natural products and their derivatives against coronavirus: a review of the nonclinical and pre-clinical data Potential Natural Compounds for Preventing SARS-CoV-2 (2019-nCoV) Infection Data sharing for novel coronavirus (COVID-19) Is traditional Chinese medicine useful in the treatment of COVID-19? Traditional Chinese herbal medicine for treating novel coronavirus (COVID-19) pneumonia: protocol for a systematic review and meta-analysis. Syst Pay attention to situation of SARS-CoV-2 and TCM advantages in treatment of novel coronavirus infection Traditional Chinese medicine in the treatment of patients infected with 2019-new coronavirus (SARS-CoV-2): a review and perspective Discovering drugs to treat coronavirus disease 2019 (COVID-19) COVID-19: current knowledge, disease potential, prevention and clinical advances n silico analysis of envelope Dengue Virus-2 and envelope Dengue Virus-3 protein as the backbone of Dengue Virus tetravalent vaccine by using homology modeling method A synthetic malaria vaccine elicits a potent CD8(+) and CD4(+) T lymphocyte immune response in humans. Implications for vaccination strategies Design of a heterosubtypic epitope-based peptide vaccine fused with hemokinin-1 against influenza viruses A highly immunogenic trivalent T cell receptor peptide vaccine for multiple sclerosis Immunization with a HER-2/neu helper peptide vaccine generates HER-2/neu CD8 T-cell immunity in cancer patients Computational immunology: the coming of age Computational methods for prediction of T-cell epitopes-a framework for modelling, testing, and applications The role of the proteasome in generating cytotoxic T-cell epitopes: insights obtained from improved predictions of proteasomal cleavage Development of epitope-based peptide vaccine against novel coronavirus 2019 (SARS-COV-2): immunoinformatics approach Epitope based vaccine prediction for SARS-COV-2 by deploying immunoinformatics approach Design of a multiepitope-based peptide vaccine against the E protein of human COVID-19: an immunoinformatics approach Cell entry mechanisms of SARS-CoV-2 Characterization of spike glycoprotein of SARS-CoV-2 on virus entry and its immune crossreactivity with SARS-CoV Fast, scalable generation of high-quality protein multiple sequence alignments using Clustal Omega Large-scale validation of methods for cytotoxic T-lymphocyte epitope prediction Accurate approximation method for prediction of class I MHC affinities for peptides of length 8, 10 and 11 using prediction tools trained on 9mers The PSIPRED protein analysis workbench: 20 years on MHC-NP: predicting peptides naturally processed by the MHC Peptide binding predictions for HLA DR, DP and DQ molecules Prediction of the secondary structure of proteins from their amino acid sequence No title found Empirical predictions of protein conformation Induction of hepatitis a virus-neutralizing antibody by a virus-specific synthetic peptide Prediction of chain flexibility in proteins: a tool for the selection of peptide antigens A semi-empirical method for prediction of antigenic determinants on protein antigens New hydrophilicity scale derived from high-performance liquid chromatography peptide retention data: correlation of predicted surface residues with antigenicity and x-ray-derived accessible sites Development of an epitope conservancy analysis tool to facilitate the design of epitope-based diagnostics and vaccines Properties of MHC class I presented peptides that enhance immunogenicity PEP-FOLD: an updated de novo structure prediction server for both linear and disulfide bonded cyclic peptides Bound water structure and polymorphic amino acids act together to allow the binding of different peptides to MHC class I HLA-B53 HLA peptide length preferences control CD8 + T cell responses The class II MHC protein HLA-DR1 in complex with an endogenous peptide: implications for the structural basis of the specificity of peptide binding The protein data bank Predicting population coverage of T-cell epitope-based diagnostics and vaccines The immune epitope database: how data are entered and retrieved IEDB-AR: immune epitope database-analysis resource in 2019 A sequence homology and bioinformatic approach can predict candidate targets for immune responses to SARS-CoV-2 Preliminary identification of potential vaccine targets for the COVID-19 coronavirus (SARS-CoV-2) based on SARS-CoV immunological studies AlgPred: prediction of allergenic proteins and mapping of IgE epitopes In silico approach for predicting toxicity of peptides and proteins Extra precision glide: docking and scoring incorporating a model of hydrophobic enclosure for protein−ligand complexes Glide: a new approach for rapid, accurate docking and scoring. 1. Method and assessment of docking accuracy Glide: a new approach for rapid, accurate docking and scoring. 2. Enrichment factors in database screening Identifying and characterizing binding sites and assessing druggability Vaccine-induced autoimmunity Computational analysis in designing T cell epitopes enriched peptides of Ebola glycoprotein exhibiting strong binding interaction with HLA molecules Phase 2 trial of a multivalent WT1 peptide vaccine (galinpepimut-S) in acute myeloid leukemia A phase I/Ib study of OTSGC-A24 combined peptide vaccine in advanced gastric cancer Immunotherapy of established (pre)malignant disease by synthetic long peptide vaccines Mutation analysis of the cross-reactive epitopes of Japanese encephalitis virus envelope glycoprotein From ZikV genome to vaccine: in silico approach for the epitope-based peptide vaccine against Zika virus envelope glycoprotein Identification of immunodominant sites on the spike protein of severe acute respiratory syndrome (SARS) coronavirus: implication for developing SARS diagnostics and vaccines Searching immunodominant epitopes prior to epidemic: HLA class II-restricted SARS-CoV spike protein epitopes in unexposed individuals HLA DRB1 * 01:01 binding capacity of selected SARS-derived peptides Identification of an HLA-A * 0201-restricted CD8+ T-cell epitope SSp-1 of SARS-CoV spike protein SARS corona virus peptides recognized by antibodies in the sera of convalescent cases This work did not receive any specific grant from funding agencies in the public, commercial or not-for-profit sectors. Data is with the authors and will be provided on request through corresponding author. The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conf lict of interest. Supplementary data are available online at https://academic. oup.com/bib.