key: cord-1036874-nnamuph8 authors: Wong, R.C.W.; Lee, M.K.P.; Siu, G.K.H.; Lee, L.K.; Leung, J.S.L.; Leung, E.C.M.; Ho, Y.I.I.; Lai, R.W.M. title: Healthcare workers acquired COVID-19 disease from patients? An investigation by phylogenomics date: 2021-07-20 journal: J Hosp Infect DOI: 10.1016/j.jhin.2021.05.017 sha: 7dac0204e366cffb20fb2e8374a3f885e9a445e5 doc_id: 1036874 cord_uid: nnamuph8 The increasing number of coronavirus disease 2019 (COVID-19) cases in the community has posed a significant epidemic pressure on healthcare settings. When healthcare workers (HCWs) acquire COVID-19, contact tracing and epidemiological investigation might not be adequate for determining the source of transmission. Here, we report a phylogenetic investigation involving two infected HCWs and nine patients to determine whether patient-to-HCW transmission had occurred in a hospital without a previous COVID-19 outbreak. This is the first study to apply phylogenomics to investigate suspected nosocomial transmission in a region with low prevalence of COVID-19. Our results do not support the occurrence of direct patient-to-HCW transmission. The current wave of community outbreak of coronavirus disease 2019 in Hong Kong at the time of writing commenced in November 2020 and the total number of symptomatic and asymptomatic infected cases was greater than in previous waves. When there are ongoing outbreaks of COVID-19, healthcare workers (HCWs) may acquire the disease from communities or through nosocomial transmission such as patient-to-HCW and HCW-to-HCW [1e3] . To differentiate between community-acquired infection from hospital-acquired infection in HCWs, traditional epidemiological study alone may be inadequate [2] . Few studies have applied phylogenomics to investigate whether COVID-19-infected HCWs have acquired the disease in hospital [3] . Recently, two HCWs working in one of our cluster hospitals were tested positive for severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) RNA. Both HCWs had direct contact with COVID-19 patients. They wore full personal protective equipment (PPE) and complied with the hospital infection control guidelines. No nosocomial COVID-19 outbreaks had been reported in this hospital since the pandemic began. A follow-up investigation to differentiate between hospital-acquired and communityacquired infection in these two HCWs was therefore important. Whole-genome sequencing (WGS) had been applied to study the genomic epidemiology of SARS-CoV-2 in hospitals with known outbreaks; infected patients/HCWs with epidemiological links and identical viral genomes would indicate that intra-hospital transmission had occurred [2e6] . Hong Kong is a densely populated city with >7.5 million residents. As of January 18 th , 2021, the number of COVID-19 cases reported by the Centre for Health Protection was 9665 and the estimated COVID-19 prevalence rate was 0.13% since the pandemic. Here we report a phylogenetic investigation on two suspected cases of hospital-acquired COVID-19 among HCWs in a regional hospital. In this study, we aimed to apply WGS to supplement epidemiological investigation to determine whether COVID-19 in these HCWs was acquired from the infected patients. To our knowledge, this is the first report to apply phylogenomic analysis in a region with low prevalence of COVID-19 to investigate whether nosocomial acquisition of COVID-19 among HCWs had occurred in a hospital which did not have an ongoing outbreak of COVID-19. In mid-January 2021, two independent HCWs had developed upper respiratory tract symptoms and were tested positive for SARS-CoV-2 RNA. These were a doctor working in an intensive care unit (ICU) and a nurse working in an isolation ward for COVID-19 patients. Both HCWs had a recent history of close contact with the same deteriorating COVID-19 patient who required ICU care. Contact tracing by the hospital infection control team had further identified a group of eight COVID-19 patients who had been managed by these two HCWs in the period between January 4 th and 14 th , 2021. From a few days to three weeks after the onset of symptoms, blood was collected from patients who were ready for discharge and clinically stable with non-detectable SARS-CoV-2 RNA or high cycle threshold (C T ) value in respiratory samples. Blood samples were tested for IgG antibodies to nucleocapsid protein with the use of Alinity i SARS-CoV-2 IgG qualitative assay (Abbott, Sligo, Ireland). Eleven respiratory specimens, including eight posterior oropharyngeal saliva (also known as deep throat saliva) and one sputum, were collected from nine patients. One posterior oropharyngeal saliva and one pooled nasopharyngeal and throat swabs were collected from the two infected HCWs. Using the automated QIAcube system (Qiagen, Hilden, Germany), RNA was extracted from these samples with the QIAamp MinElute Virus Spin Kit (Qiagen) according to the manufacturer's recommendation and tested with the TIB-Molbiol LightMix Ò SarbecoV E-gene assay (TIB-MolBiol, Berlin, Germany) as described previously [7] . For WGS of SARS-CoV-2, human DNA was removed from the RNA elutes with the use of Turbo DNA-free Kit (ThermoFisher Scientific, Waltham, MA, USA) [8] . cDNA was synthesized by using LunaScript RT SuperMix Kit (New England BioLabs Inc. (NEB), Hitchin, UK) and was then amplified by multiplex PCR with Q5 Hot Start High-Fidelity 2X Master Mix Kit (NEB) and ARTIC nCoV-2019 V3 primer sets (Integrated DNA Technologies, Coralville, IA, USA). WGS was performed using Nanopore Min-ION MK1B with flow cell R9.4.1 (FLO-MIN106D) (Oxford Nanopore Technologies, Oxford, UK) according to the ARTIC nCoV-2019 sequencing protocol v3. The consensus sequences of the samples in this study were submitted to the NCBI GenBank under the accession numbers MW768952 to MW768962. Bioinformatics, phylogenetic analysis and estimation of evolutionary rate Sequencing data were analysed by using the pipelines modified from the ARTIC network nCoV-2019 novel coronavirus bioinformatics protocol [8, 9] . In brief, filtered reads with length between 400 and 700 bp were mapped to the reference genome Wuhan-Hu-1 (GenBank no.: MN908947.3) through minimap2 version 2.17 (https://anaconda.org/bioconda/ minimap2). Variants were called by using Medaka (https:// github.com/nanoporetech/medaka) and Clair (https:// github.com/HKU-BAL/Clair) pipelines. Nucleotide substitution was numbered with reference to the Wuhan-Hu-1 genome. Novel single nucleotide polymorphism (SNP) (also known as sample-unique variants) was defined as specific nucleotide substitution in viral genome present in the corresponding sample only. A similar approach has been used to determine intra-hospital transmission of SARS-CoV-2 [2, 5] . Multiple sequence alignment for consensus genomes was performed by Clustal Omega 1.2.0 (http://www.clustal.org/omega/). PhyML 3.0 (https://github.com/stephaneguindon/phyml) was used to construct maximum-likelihood phylogenetic tree with smart model selection of GTR substitution model. 1000Â bootstrap replications were performed and Mega X version 10.0 (https:// megasoftware.net/) was used to plot phylogenetic tree and rooted on the reference genome Wuhan-Hu-1. Bayesian molecular clock analysis with Markov Chain Monte Carlo (MCMC) framework provided by BEAST version 2.6.2 was used to estimate the viral evolutionary rate in substitutions per site per year as published in our previous study [8] . (Table I) . On the other hand, sample from HCW-N harboured three other novel SNPs, namely 307 C/A, 4579T/A, and 22419C/T that were not found in the patients and HCW-D (Table I) . Patient 1 (ICU patient) shared identical viral genome with patients 2 and 3 and did not have novel SNP. Other patients (4e9) carried one or two novel SNPs that were not present in the HCWs (Table I) . The hierarchy of the phylogenetic tree suggested that patients 1e3 could be the most recent common ancestor of both HCWs and other patients (Figure 1 ). The novel mutations identified in the HCWs might have emerged over time within the new hosts [2] . However, the evolutionary rate of SARS-CoV-2 lineage B.1.36.27 was 7.74Â10 À4 substitutions per site per year (95% BCI: 6.74Â10 À4 to 8.89Â10 À4 substitutions per site per year), which is equivalent to 1.93 nucleotide changes per month. This estimation was supported by the observation that the viral genome of patient 1 remained identical to those of the locally acquired cases reported in October 2020. A similar evolutionary rate was also observed by the other studies [2, 5, 6] . As the interval of symptom onset between patients 1e3 and the HCWs was within one month (Table I) , it is unlikely that the SARS-CoV-2 variant with this evolutionary rate accumulated three mutations in such short period. The genetic distance (three to five novel SNP differences) did not support direct transmission of patient-to-HCW, though the possibility of nosocomial acquisition of COVID-19 cannot be completely excluded. The mutually exclusive novel SNPs in HCW-D and HCW-N provided phylogenetic evidence that there was no HCWto-HCW transmission as they were infected by two different variants of SARS-CoV-2. SARS-CoV-2 has relatively low genetic diversity and therefore in-depth genomic study with single nucleotide resolution is necessary in order to resolve suspected hospital-acquired transmission. The application of SNP analysis in viral genome is a stringent approach to supplement the routine contact tracing. By using the same typing method, a recent study revealed that a SARS-CoV-2 variant that harboured T470N amino acid substitution (22971C/A) in the spike (S) gene was disseminated in the catchment area of another hospital in Hong Kong in mid-December 2020 and eventually led to a nosocomial outbreak [10] . It should be noted that the genomic investigation must be integrated with the clinical and epidemiological information. Patients/HCWs having epidemiological links with no SNP difference in viral genome strongly indicated the existence of nosocomial transmission of COVID-19 [3] . However, as spontaneous mutations might emerge over time within a newly infected host, the presence of one or two nucleotide differences might not be sufficient to rule out the transmission linkage among patients/HCWs. The evolutionary rate of the circulating SARS-CoV-2 lineage could be considered as a basis to determine whether the virus is likely to accumulate a certain number of mutations over the transmission period. Large numbers of SNP difference among the cases suggested that the infections would more likely have been acquired from different sources. In our study period (January 4 th to 14 th , 2021), the total number of infected cases in this city was 450. Despite the low prevalence, it was concerning that nosocomial infection might have occurred as two HCWs became infected. Both cases would have been identified as patient-to-HCWs transmission if based on routine epidemiological investigation alone. WGS, however, provided high-resolution information to rule out this route of transmission. Our data also reassured that the current guidance for infection control measures and use of PPE protected the HCWs from contracting infection from patients and from other HCWs. WGS is therefore an indispensable technique in the investigation of suspected nosocomial transmission. The major limitation of this study was that we did not include all the community-acquired cases over the study period (14 days before date of symptom onset of HCW cases) for phylogenetic analysis. However, all the patients were epidemiologically linked with known local community cases. In order to determine whether the B.1.36.27 lineage is the prevalent lineage in our locality under the same time-period, a total of nine samples collected from known communityacquired COVID-19 patients (epidemiologically unrelated to our study) had been sequenced and were found to be B.1.36.27 lineage. Therefore, our samples may represent the local circulating variants of SARS-CoV-2 in the community with low prevalence of COVID-19. Moreover, we only provided evidence to rule out direct patient-to-HCW transmission. The definite source of their infection, however, cannot be identified. It was likely that both HCWs acquired the disease in the community. Management of a hospital-wide COVID-19 outbreak affecting patients and healthcare workers Investigation of intra-hospital SARS-CoV-2 transmission using nanopore whole genome sequencing Rapid implementation of SARS-CoV-2 sequencing to investigate cases of health-care associated COVID-19: a prospective genomic surveillance study Transmission of SARS-CoV-2 among healthcare workers and patients in a teaching hospital in the Netherlands confirmed by whole-genome sequencing Clinical utility of SARS-CoV-2 whole genome sequencing in deciphering source of infection Analysis of SARS-CoV-2 genomic epidemiology reveals disease transmission coupled to variant emergence and allelic variation Evaluation on testing of deep throat saliva and lower respiratory tract specimens with Xpert Xpress SARS-CoV-2 assay Territorywide study of early coronavirus disease outbreak nCoV-2019 novel coronavirus bioinformatics environment setup Nosocomial outbreak of COVID-19 by possible airborne transmission leading to a superspreading event The authors would like to thank the laboratory staff from the Department of Microbiology, PWH, Infection Control Team of NTEC, and Mr. K.L. Chan for their contributions and valuable support on this study.