key: cord-0681894-8zg8ing0 authors: Wang, Pingping; Xu, Zhaochun; Zhou, Wenyang; Jin, Xiyun; Xu, Chang; Luo, Meng; Ma, Kexin; Cao, Huimin; Huang, Yan; Lin, Xiaoyu; Pang, Fenglan; Li, Yiqun; Jiang, Qinghua title: Identification of potential vaccine targets for COVID‐19 by combining single‐cell and bulk TCR sequencing date: 2021-05-21 journal: Clin Transl Med DOI: 10.1002/ctm2.430 sha: 909e0d35ad5d254b5754ac2a11747e9497893787 doc_id: 681894 cord_uid: 8zg8ing0 nan Of all the T cell types identified in this study, the proportion of Transitional CD8, Effector CD8 and CD4 CTL in COVID-19 patients were significantly higher than those in healthy controls (P = 1.91 × 10 −4 , 2.52 × 10 −2 , and 2.98 × 10 −3 , respectively), while the proportion of Naïve CD4 was significantly lower (P = 1.24 × 10 −2 ) ( Figure 1C ). Single cell repertoire analysis demonstrates that larger clonotypes exhibited a non-uniform distribution of cell types with an enrichment for cytotoxic T cells, such as transitional CD8, effector CD8 and CD4 CTL ( Figure 2 ). Interestingly, effector CD8 and CD4 CTL in our study also jointly expressed resident memory marker ZNF683 2 and tissue exit marker S1RP5 3 ( Figure 1D ). According to Ref. (4) , these cells might be cytotoxic T cells recently egressed from tissues (such as lung tissue) and reentered circulation, an observation that waits for further experimental validation. To investigate whether the TCR repertoires of the recovered patients with COVID-19 differentiate from healthy individuals, we compared the deep TCR-seq data between patients with COVID-19 and controls. At the repertoire level, T cell diversity in COVID-19 patients was significantly lower than that in controls (P < 0.0001, Figure 3A), consistent with the clonal expansion upon antigen exposure. Clustering of TCRs with similar CDR3s is an effective approach to identify antigen-specific T cells, [5] [6] [7] as TCRs sharing similar motifs from distinct individuals may also share antigen-specificity. Through TCR clustering, we detected 29 409 TCR groups (Table S4) . Interestingly, COVID-19 patients shared more TCR groups than healthy controls ( Figure 3B ). To obtain patient-specific TCRs, we searched for CDR3 groups significantly enriched in the COVID-19 cases, and identified a total of 916 groups (FDR < 0.05, Table S5 ), which were referred as 'COVID-19 TCR groups' in the downstream analysis. These groups of T cells are enriched for activated T cells, specifically, the transitional CD8, effector CD8 and CD4 CTL subtypes The identification of COVID-19 TCR groups also allowed us to uncover the candidate antigenic epitopes from the virus genome. In total, 866 9-mer peptides from 11 SARV-CoV-2 proteins were computationally predicted to bind patient HLA alleles profiled in our study (Table S6) . We examined the peptides and CDR3 groups found in multiple individuals, and identified 1602 cooccurring TCR-antigen pairs that were significantly shared by the same patients, covering 31 CDR3 groups and 114 peptides (FDR < 0.05, Table S7 ). Of these, we identified two pairs, each with a single TCR group and a single antigen (FDR < 0.001, Figure Those peptides presented by MHC-I with high affinity, significantly cooccurring with a COVID-19 TCR group are colored as red to multiple epitopes ( Figure 4A ), which demonstrated similar motifs in the TCR contact regions. 8 We next located all the 114 peptides in the virus genome, and found more than 91% were distributed in proteins ORF1ab, S, N, and ORF3a ( Figure 4B) , where ORF3a showed significant epitope enrichment (P = 0.0056, Binomial test). In summary, our analysis revealed a number of candidate peptides as promising targets for COVID-19 vaccine development. This study provides an effective solution for identifying potential antigenic peptides based on large-scale TCR repertoire and HLA typing. The combined use of singlecell and deep TCR sequencing provided us with singlecell resolution, and also enabled us to obtain millions of immune receptors. 9 However, highly abundant T cell clones may not be disease-specific. 10 With this strategy, we grouped similar TCRs to search for evidence of convergent selection in patients. Mapping these receptors to single cell data identified novel T cell phenotypes specific to recovery COVID-19 patients. In addition, with HLA genotyping, we were able to provide individualized TCR epitopes, which allowed us to investigate their associations with recurrent TCR groups across different individuals. This method led to statistically confident antigen targets and provided guidance for efficient mRNA vaccine design. We hope that our findings and immune receptor datasets will inform the development of next-generation vaccines that can better activate natural T cell immunity for COVID-19. This work was supported by finding from the National Natural Science Foundation of China (Nos. 61822108 and 62032007 to Q.J.). The authors declare no conflict of interest. Single cell transcriptome sequencing data were deposited on Zenodo (https://doi.org/10.5281/zenodo.3747336). Custom scripts in this study are available upon request to the corresponding author. Pingping Wang, Zhaochun Xu, Wenyang Zhou contributed equally as first authors. Reference-based analysis of lung single-cell sequencing reveals a transitional profibrotic macrophage Human tissue-resident memory t cells are defined by core transcriptional and functional signatures in lymphoid and mucosal sites Exit Strategies: S1p signaling and t cell migration Human CD4(+)CD103(+) cutaneous resident memory T cells are found in the circulation of healthy individuals Quantifiable predictive features define epitope-specific T cell receptor repertoires Identifying specificity groups in the T cell receptor repertoire Investigation of antigen-specific T-cell receptor clusters in human cancers How the T cell receptor sees antigen-a structural view Single cell RNA and immune repertoire profiling of COVID-19 patients reveal novel neutralizing antibody T cell receptor repertoires of mice and humans are clustered in similarity networks around conserved public CDR3 sequences Additional supporting information may be found online in the Supporting Information section at the end of the article.