key: cord-0840958-qmla48z2 authors: XIANG, Y.; WONG, K. C.-Y.; SO, H.-C. title: Exploring drugs and vaccines associated with altered risks and severity of COVID-19: a UK Biobank cohort study of all ATC level-4 drug categories date: 2020-12-07 journal: nan DOI: 10.1101/2020.12.05.20244426 sha: bc1394a447d8c2d194ac0aa45190d040baab68e0 doc_id: 840958 cord_uid: qmla48z2 Background: COVID-19 is a major public health concern, yet its risk factors are not well-understood and effective therapies are lacking. It remains unclear how different drugs may increase or decrease the risks of infection and severity of disease. Methods: We studied associations of prior use of all level-4 ATC drug categories (including vaccines) with COVID-19 diagnosis and outcome, based on a prospective cohort of UK Biobank (UKBB). Drug history was based on general practitioner(GP) records. Effects of prescribed medications/vaccinations on the risk of infection, severity of disease and mortality were investigated separately. Hospitalized and fatal cases were categorized as severe infection. We also considered different study designs and conducted analyses within infected patients, tested subjects and the whole population respectively, and for 5 different time-windows of prescriptions. Missing data were accounted for by multiple imputation and inverse probability weighting was employed to reduce testing bias. Multivariable logistic regression was conducted which controls for main confounders. Results: We placed a greater focus on protective associations here, as (residual) confounding by indication and comorbidities tends to bias towards harmful effects. Across all categories, statins showed the strongest and most consistent protective associations. Significant protective effects against severe infection were seen among infected subjects (OR for prescriptions within a 12-month window, same below: 0.50, 95% CI:0.42-0.60), tested subjects (OR=0.63, 0.54-0.73) or in the general population (OR=0.49, 0.42-0.57). A number of top-listed drugs with protective effects were also cardiovascular medications, such as angiotensin converting enzyme inhibitors, angiotensin receptor blockers, calcium channel blocker and beta-blockers. Some other drugs showing protective associations included biguanides (metformin), estrogens, thyroid hormones and proton pump inhibitors, among others. Interestingly, we also observed protective associations by numerous vaccines. The most consistent association was observed for influenza vaccines, which showed reduced odds of infection (OR= 0.73 for vaccination in past year, CI 0.65-0.83) when compared cases to general population controls or test-negative controls (OR=0.60, 0.53-0.68). Protective associations were also observed when severe or fatal infection was considered as the outcome. Pneumococcal, tetanus, typhoid and combined bacterial and viral vaccines (ATC code J07CA) were also associated with lower odds of infection/severity. Conclusions: A number of drugs, including many for cardiometabolic disorders, may be associated with lower odds of infection/severity of infection. Several existing vaccines, especially flu vaccines, may be beneficial against COVID-19 as well. However, causal relationship cannot be established due to risk of confounding. While further studies are required to validate the findings, this work provides a useful reference for future meta-analyses, clinical trials or experimental studies. angiotensin receptor blockers, calcium channel blocker and beta-blockers. Some other drugs showing protective associations included biguanides (metformin), estrogens, thyroid hormones and proton pump inhibitors, among others. Interestingly, we also observed protective associations by numerous vaccines. The most consistent association was observed for influenza vaccines, which showed reduced odds of infection (OR= 0.73 for vaccination in past year, CI 0.65-0.83) when compared cases to general population controls or test-negative controls (OR=0.60, 0.53-0.68). Protective associations were also observed when severe or fatal infection was considered as the outcome. Pneumococcal, tetanus, typhoid and combined bacterial and viral vaccines (ATC code J07CA) were also associated with lower odds of infection/severity. Conclusions: A number of drugs, including many for cardiometabolic disorders, may be associated with lower odds of infection/severity of infection. Several existing vaccines, especially flu vaccines, may be beneficial against COVID-19 as well. However, causal relationship cannot be established due to risk of confounding. While further studies are required to validate the findings, this work provides a useful reference for future meta-analyses, clinical trials or experimental studies. Coronavirus Disease 2019 has resulted in a pandemic affecting more than a hundred countries worldwide [1] [2] [3] . More than 65 million confirmed infections and 1.5 million fatalities have been reported worldwide as at 20 th Nov 2020 (https://coronavirus.jhu.edu/map.html). It is of urgent public interest to gain deeper understanding into the disease, including identifying risk factors (RFs) for infection and severe disease, and uncovering new treatment strategies. A number of clinical risk factors (e.g. age, obesity, cardiometabolic disorders, renal diseases, multi-comorbidities) [4] [5] [6] [7] [8] have been suggested to increase the risk to infection or lead to greater risks of complications. However, it is less well-known how different drugs may increase or reduce the risks of COVID-19 or its severity. Drugs with protective effects may be potentially repurposed for the prevention or treatment of the disease. Development of a new drug is often extremely lengthy and costly, while existing drugs with known safety profiles can be brought into practice in a much shorter time-frame. Here we performed a comprehensive study on all ATC (Anatomical Therapeutic Chemical Classification System) level-4 drug categories (N=819) and assessed their associations with susceptibility to and severity of COVID-19 infection in the UK Biobank (UKBB), controlling for possible confounding factors. Vaccines were also included. To our knowledge, this is the most comprehensive analysis of drug associations with COVID-19 to date. While pharmaco-epidemiology studies are typically focused on one or a few drugs, COVID-19 is a new disease and we still have very limited understanding of its pathophysiology and treatment. As a result, a hypothesis-driven approach may have important limitations of missing potential drug associations. In the field of genetic epidemiology, it has been observed that hypothesis-driven candidate gene studies are not as reliable as genome-wide association studies (GWAS) 9 which is relatively unbiased, indicating merits of the latter approach. In the same vein, here we adopted a 'drug-wide' association study approach, which provides a systematic and unbiased assessment of drug associations. In the present work, we performed rigorous analyses on the impact of medications/vaccinations on the risk of infection, disease severity and mortality. Analyses were also conducted within infected patients, tested subjects and the whole population respectively, and for five different time-windows of prescriptions. The UK Biobank is a large-scale prospective cohort comprising over 500,000 subjects aged 40-69 years who were recruited in 2006-2010 10 . In this study, subjects with recorded mortality before 31 Jan 2020 (N = 28,930) were excluded, since it was the date for the first recorded case in UK. This study was conducted under project 28732. is the author/funder, who has granted medRxiv a license to display the preprint in (which was not certified by peer review) preprint The copyright holder for this this version posted December 7, 2020. ; https://doi.org/10.1101/2020.12.05.20244426 doi: medRxiv preprint COVID-19 phenotypes COVID-19 outcome data were downloaded from UKBB data portal. Information regarding COVID-19 data in the UKBB was given at http://biobank.ndph.ox.ac.uk/showcase/exinfo.cgi?src=COVID19. Briefly, the latest COVID test results were downloaded on 6 Nov 2020 (last update 3 Nov 2020). We consider inpatient (hospitalization) status at testing as a proxy for severity. Data on date and cause of mortality were also extracted (latest update on 21 Oct 2020). Cases indicated by U07.1 were considered to be (laboratory-confirmed) COVID-19-related fatalities. A case was considered as having 'severe COVID-19' if the subject was hospitalized and/or if the cause of mortality was U07.1. We required both test result and origin to be 1 (positive test and inpatient origin) to be considered as a hospitalized case. For a small number of subjects with initial outpatient origin and positive test result, but changed to inpatient origin and negative result within 2 weeks, we still considered these subjects inpatient cases (i.e. assume the hospitalization was related to the infection). For a minority of subjects (N=19) whose mortality cause was U07.1 but test result(s) was negative within one week, to be conservative, they were excluded from subsequent analyses. Medication data was obtained from the Primary Care data for COVID-19 research in UKBB (details available at https://biobank.ndph.ox.ac.uk/showcase/showcase/docs/gp4covid19.pdf). In the UK, patients seeking medical advice usually visit a general practitioner (GP) first. Many illnesses are managed under a primary care setting, while most secondary care medical encounters are also reported back to the GP and recorded in their electronic records. We made use of the latest release of GP records released by UKBB, which contains prescription data from two EHR systems (TPP or EMIS) for ~397,000 UKBB participants. The drug code and issue date of each drug are available. Since the GP records cover up to ~50 years' of prescriptions, we set time windows to restrict prescriptions with a certain time period as the 'exposure'. The 'index date' was defined as (1) the date of the first positive COVID-19 test for infected subjects (for U07.1 cases, the mortality date was regarded as the index date if no test record was found); (2) the date of last test for those tested negative; (3) 3 Nov 2020 for those who were untested. The issue date of each prescription was available but the duration was not. Time windows were determined by whether the drug was issued within a specified period before the index date. The following windows were considered for medications: 6 months, 1 year, 2 years and 5 years. Narrower time windows (<6 months) may not be desirable and may lead to many prescriptions being missed as the latest issue date was 25 July 2020, but the latest index date was 3 Nov 2020. As for vaccines, unlike many medications, vaccines are not prescribed regularly and most vaccines only need to be given once or less than a few times; hence a narrow time window is not optimal due to sparsity of . CC-BY-NC-ND 4.0 International license It is made available under a perpetuity. is the author/funder, who has granted medRxiv a license to display the preprint in (which was not certified by peer review) preprint The copyright holder for this this version posted December 7, 2020. ; https://doi.org/10.1101/2020.12.05.20244426 doi: medRxiv preprint data. For seasonal vaccines, namely flu vaccines, they are usually given in autumn (Sep to Nov) or early winter in the UK. A time-window of 6 months will lead to missing most of the flu vaccines given. On the other hand, it is also reasonable to consider a longer time window (e.g. 10 years) as vaccine effects can be more long-lasting 11 . In view of the above, we considered time windows of 1, 2, 5 and 10 years for vaccinations. For flu vaccines, we defined 'past 1 year' as prescriptions from 1 st Sep 2019 onwards (and similarly for past k years) to account for seasonal nature of vaccination. All the medications were mapped to ATC Classification (https://www.genome.jp/kegg-bin/get_htext?br08303). Drug categories were defined by the 4 th level of ATC classification. We performed multivariable regression analysis with adjustment for potential confounders including basic demographic variables (age, sex, ethnic group), comorbidities (coronary artery disease, diabetes, hypertension, asthma, COPD, depression, dementia, history of cancer), blood measurement (e.g. blood urea and creatinine reflecting renal function), indicators of general health (number of medications taken, number of non-cancer illnesses), anthropometric measures (body mass index [BMI]), socioeconomic status (Townsend Deprivation index) and lifestyle risk factor (smoking status). For disease traits, we included information from ICD-10 diagnoses (code 41270), self-reported illnesses (code 20002) and incorporated data from all waves of follow-ups. Subjects with no records of the relevant disease from either self-report or ICD-10 were regarded as having no history of the disease. We performed a total of 8 sets of analysis ( Table 1 ). The impact of prescribed medication/vaccination on the risk of infection (Model E and F), severity of infection (Model A, C and G) and risk of mortality (Model B, D and H) from COVID-19 were investigated separately. Both hospitalized and fatal cases were grouped under the 'severe' category. We also considered different study designs and conducted our analyses with different comparison samples. Models A and B are restricted to the infected subjects, while models C, D and E involves comparison of severe, fatal and general infected cases to the general population (with no known diagnosis of COVID-19). On the other hand, models F, G and H compared infected, severe and fatal cases respectively against subjects who were tested negative for SARS-CoV-2. There were 397,000 subjects in the UKBB with available GP prescription records. Among them, 30, 835 subjects have received at least one COVID-19 test, and 3858 were tested positive. There were 1318 cases classified as 'severe' (hospitalized or mortality from COVID-19) and 170 fatal cases. In total 393,142 UKBB is the author/funder, who has granted medRxiv a license to display the preprint in (which was not certified by peer review) preprint The copyright holder for this this version posted December 7, 2020. ; https://doi.org/10.1101/2020.12.05.20244426 doi: medRxiv preprint participants did not have a known diagnosis of COVID-19. The detailed count of participants for each model is listed in Table 2 . Logistic regression (using the R package speedglm) was used to examine the impact of medication on different outcomes in the eight sets of analysis. All statistical analyses were conducted using R. The false discovery rate (FDR) approach by Benjamini & Hochberg 12 was performed to control for multiple testing. This approach controls the expected proportion of false positives among the rejected null hypotheses. Missing values of remaining features were imputed with the R package missRanger. The program is based on missForest, which is an iterative imputation approach based on random forest (RF). It has been widely used and has been shown to produce low imputation errors and good performance in predictive models 13 . The program missRanger is largely based on the algorithm of missForest, but uses the R package 'ranger 14 ' to build RFs, leading to a large improvement in speed. (We found that other packages such as MICE and missForest are computationally too slow to produce results for the large-scale analyses here). Predictive mean matching (pmm) was also employed to avoid imputation with values not present in the original data and increase variance to more realistic levels for multiple imputation (MI). We followed the default settings with pmm.k = 5 and num.trees = 100. We performed the analyses on multiply imputed datasets (imputed for 10 times) and combined the results by Rubin's rules 15 using the mi.meld function under the R package amelia. Another advantage of missRanger is that out-of-bag errors (in terms of classification errors or normalized root-mean-squared error) could be computed which provides an estimate of imputation accuracy. Bias due to non-random testing has been discussed previously in other works 16, 17 . As a person has to be tested to be diagnosed of COVID-19, factors leading to increased risk of being tested will also lead to an apparent increase in the risk of infection 17 . In addition, it has been raised that collider bias can occur when conditioned on the tested group and results in spurious associations, for example between a risk factor and COVID-19 severity if both increases the Pr(tested). One way to reduce this kind of bias is to employ inverse probability weighting (IPW) of Pr(tested). Essentially, we wish to create a pseudo-population or mimic a scenario under which testing is more random instead of selected for certain subgroups. The IPW approach unweighs those who are less likely tested and downweighs those who have a high chance of being tested. This may create more unbiased estimates of the effects of drugs. We took reference to the approach described in 16 to analyze the data with IPW. Following our recent work 18 which aims to predict COVID-19 severity with machine learning (ML), here we also employed an ML model (XGboost) to predict Pr(tested) based on a range of factors. An advantage of using ML models is that non-linear and complex interactions can be considered, which may improve predictive performance over logistic models. is the author/funder, who has granted medRxiv a license to display the preprint in (which was not certified by peer review) preprint The copyright holder for this this version posted December 7, 2020. ; https://doi.org/10.1101/2020.12.05.20244426 doi: medRxiv preprint We employed the same set of predictors as our previous work, and followed the same analysis strategy of hyper-parameter tuning and cross-validation to obtain predicted probabilities (please refer to 18 for details). Beta-calibration 19 was performed and the resulting average AUC was 0.622. The predicted probabilities [i.e. Pr(tested)] were used to construct weights for IPW. Stabilized weights 20 were used. Due to the large number of models and drugs being studied, we shall highlight the main results and findings from different sensitivity analysis. Confounding by indication and other comorbidities is unavoidable and in particular, drugs showing harmful effects may possibly be explained by such confounding. On the other hand, as it is expected that most diseases tend to increase the risk/severity of infection, drugs showing protective effects are much less likely to be affected by confounding, and such associations may be relatively more reliable. We therefore place a greater focus on protective drugs in the sections below, although main drugs with harmful effects will also be briefly discussed. A summary of the demographic and covariate data of the original UKBB dataset is shown in Table S1 . The missing rates and out-of-bag (OOB) errors for different variables from multiple imputations are shown in Table S2 . Full results of all drug categories across all time windows (including 6, 12, 24, 60 and 120 months; the last time-window only for vaccines) are shown in Tables S6 to S10. All protective associations (with at least nominal significance i.e. p<0.05) are shown in Table S3 , while all association results with vaccines are presented in Table S4 . For drugs associated with increased odds of infection/severity, we also summarized the top 10 drugs (ranked by p-value) from each model and time window, and put them together in Table S5 . Across all categories, statins showed the strongest and most consistent protective associations. Highly significant protective effects were seen across infected subjects, tested or the whole population. The most consistent evidence is for models A, C, D and G, which suggests its effect in reducing the severity or mortality of infection. Albeit with smaller effect sizes, we also observed that statins might be linked to lower susceptibility to infection (model E). Interestingly, a number of top-listed drugs are also cardiovascular medications, such as angiotensin converting enzyme inhibitors (ACEI), angiotensin receptor blockers (ARB), calcium channel blocker (CCB) and beta-blockers. For simplicity, odds ratios (OR) are presented for a time horizon of 1 year if not further specified. Significant protective associations (FDR<0.1) are shown in Table 3 . Statins showed protective effects across models A, C, D, E and G. Significant protective effects against severe infection were seen among infected . CC-BY-NC-ND 4.0 International license It is made available under a perpetuity. is the author/funder, who has granted medRxiv a license to display the preprint in (which was not certified by peer review) preprint The copyright holder for this this version posted December 7, 2020. Significant associations for vaccines (FDR<0.1) are shown in Table 4 . As for vaccines, one of the most consistent associations was observed for influenza vaccines. Protective associations were observed across models B to H, and across all time windows. Flu vaccination was associated with lower odds of infection when is the author/funder, who has granted medRxiv a license to display the preprint in (which was not certified by peer review) preprint The copyright holder for this this version posted December 7, 2020. ; https://doi.org/10.1101/2020.12.05.20244426 doi: medRxiv preprint 0.38 -0.84); it also showed weakening of effect over time. Other significant associations included tetanus and typhoid vaccines which were observed to be protective against infections. Significant results for other drugs having protective effects and FDR<0.1 are shown in Table 5 Table S6 -10 and a summary is also provided in Table S5 . Analysis restricted to subjects with complete covariate data, with and without IPW We have repeated the analyses to subjects with complete covariate data, with or without the IPW approach. In general, we observed similar drugs with significant results and the top-ranked protective or harmful drugs were similar to the above. Comparing results with and without IPW, the list of significant drugs appeared similar although the OR estimates and SE were adjusted. The full results are presented in Table S11-12. In this work, we have performed a thorough and rigorous analysis on the effect of drugs and vaccines on COVID-19 susceptibility and severity. We uncovered a number of drugs with potentially protective or harmful effects. As an observational study, different kinds of bias such as confounding and selection bias may affect the results. We have performed analysis on infected subjects (models A and B), the whole population (models C, D, is the author/funder, who has granted medRxiv a license to display the preprint in (which was not certified by peer review) preprint The copyright holder for this this version posted December 7, 2020. ; https://doi.org/10.1101/2020.12.05.20244426 doi: medRxiv preprint E) and the tested population (models F, G, H) to obtain a more comprehensive picture of drug effects under different settings, and to avoid limitations (e.g. selection/collider bias) of some designs. We note that sometimes the different models may yield different results. One main observation is that analysis on the tested population appears to result in more findings of drugs with protective effects. We also observed that some drugs in model F (infected vs tested negative) may show different effects under model E (infected vs general population). Several reasons may explain this finding. First and foremost, confounding by indication is inevitable and may play a more important role when analyzing general population samples. It is possible that apparent harmful effects of drugs are due to the diseases/conditions that the prescription is related to, or to poorer health in general. Based on an machine learning model to predict testing probability (see Figure S1 ), we observed that people who are older, having more comorbidities and taking more medications, suffering from cardiovascular conditions etc. were more likely to be tested. Compared to the general population, the tested group may represent a more 'homogeneous' population, enriched for people with poorer health and more comorbidities in general. Therefore a proportion of confounders, which overlap with factors associated with higher probability of being tested, are essentially controlled for by stratification if we only study the tested subjects. On the other hand, in the general population, there is a higher proportion of healthy subjects, the effect of confounding by indication may be stronger. Another possibility is collider bias due to conditioning on a subgroup of subjects. For example, a drug may be associated with certain conditions which in turn are associated with higher chance of being tested; on the other hand those who have more severe symptoms or complications are more likely to be tested. Conditioning on testing may result in spurious associations between the drug and severity of infection. However, we have tried to minimize this type of bias by the IPW approach, and we did not observe substantial difference in results with or without IPW correction for most drugs. However, we note that even with adjustment by IPW, there is still chance for residual selection or collider bias. For example, some factors associated with Pr(tested) may not be captured in the prediction model. A third possibility to consider is that a drug may truly produce different effects in different subgroups, due to effect modification by other factors or diseases. For instance, a recent study reported that the protective effect of statins is more marked in patients with diabetes 21 . The fact that risk factor associations may differ between a whole-population or tested-population based study has also been noted previously, for example by 17 . Below we highlight drugs that are tentatively associated with altered risk or severity of infection. We will preferentially consider drugs that showed significant associations (with FDR<0.1) across multiple models and time-windows, those with stronger statistical significance, and those with protective effects as confounding by indication is much less likely. is the author/funder, who has granted medRxiv a license to display the preprint in (which was not certified by peer review) preprint The copyright holder for this this version posted December 7, 2020. ; https://doi.org/10.1101/2020.12.05.20244426 doi: medRxiv preprint Interestingly, many drugs with potential protective effects are indicated for cardiometabolic (CM) disorders. Cardiometabolic risk factors, such as obesity, hypertension, DM and CAD, have consistently been shown to be related to risk and severity of infection; as such, it is biologically plausible that drugs for treating CM disorders may be beneficial. Among all drugs, the strongest and most consistent protective association was observed for statins. The Potential mechanisms for the protective actions of statins have been discussed elsewhere [24] [25] [26] . It has been postulated that besides reducing CVD risks, statins may reduce risk/severity of infection by inhibiting inflammation and excessive immune response, producing direct antiviral effects, improving endothelial function and exerting an antithrombotic effect, among other actions [24] [25] [26] . Another group of drugs worth highlighting is ACEI and ARB. There have been intense discussions on whether ACEI/ARB may affect risk or severity of infection from early on, as ACE2 is a receptor for SARS-CoV-2. Nevertheless, a recent study showed that ACE2 is localized in respiratory cilia, and the use of ARB/ACEI does not change its expression. 27 For several other kinds of CM drugs, the associations are not as strong but may still be worthy of further studies. Biguanides (mainly metformin) are observed to be protective for severe COVID-19 infection, both among the infected and at a population level. For example, in a meta-analysis on four observational studies of hospitalized patients mostly with type 2 DM, the use of metformin was associated with a lower risk of mortality (OR = 0.75, 95% CI = 0.67-0.85) 36 . A number of mechanisms have been proposed 36, 37 . For example, besides improving glycemic control and weight reduction, metformin may lead to AMPK activation which potentially reduces viral entry by phosphorylation of ACE2 receptor. It may also lead to mTOR pathway inhibition and prevents hyperactivation of the immune system 36 . Other drugs of interest may include beta-blockers and calcium channel blockers (C08CA, dihydropyridine derivatives). It was suggested that beta-blockers may be useful in preventing hyperinflammation and hence beneficial for COVID-19 38 . For calcium channel blockers (CCBs), a study using cell culture suggested that is the author/funder, who has granted medRxiv a license to display the preprint in (which was not certified by peer review) preprint The copyright holder for this this version posted December 7, 2020. ; https://doi.org/10.1101/2020.12.05.20244426 doi: medRxiv preprint CCBs, especially amlodipine and nifedipine, were useful in blocking viral entry and infection in epithelial lung cells. 39 . In another retrospective study 40 , both beta-blockers and CCBs were associated with lower mortality. Another relevant study in the UK 41 utilized data from the UK Clinical Practice Research Datalink (CPRD) and found that ACEI/ARB, CCBs and thiazide diuretics were all associated lower odds of diagnosis, while beta-blockers do not show any association after adjusting for consultation frequency. None of the above drugs were associated with mortality in that study 41 . There has been intense interest in whether vaccines indicated for other diseases may protect against COVID-19. Here we observed that a number of vaccines (ATC code J07) showed protection against infection or severe For influenza vaccines, we observed highly consistent protective associations. It has been proposed that 'trained innate immunity', which may involve epigenetic reprogramming of innate immune cells, may enable a vaccine to protect against other diseases 42, 43 . Interestingly, two studies in Italy reported that higher coverage rate of flu vaccine was associated with lower rate of infection, hospitalization and mortality from COVID-19. Another larger-scale study based on electronic records of 137,037 subjects who have received viral PCR tests showed that a number of vaccines (given in the past 1, 2 or 5 years) were associated with lower risks of infection 44 . These included flu and pneumococcal vaccines also implicated in the present study. Taken together, we believe that further experimental and clinical studies are warranted to investigate the non-specific effects of vaccines. We briefly highlight a few other drugs with potential protective effects. Estrogens (G03CA) were among the drugs showing protective associations. As many studies reported higher risks of severe disease in men than in women, it has been hypothesized that estrogen may play a part in the sex-discordant outcomes, for example via its effects on immune response to infections [45] [46] [47] . Thyroid hormones (TH) were also among the top-ranked drugs. It was postulated that TH may ameliorate tissue injury due to hypoxia by suppression of p38 MAPK 48 . Clinical trials on TH are ongoing 48, 49 and our findings support a protective role of TH in COVID-19. Another drug category of note is proton pump inhibitors (PPI). Several studies have suggested harmful effects of PPI on disease severity, which may be related to reduced gastric acid production with subsequent bacterial overgrowth [50] [51] [52] . However, an in-vitro screening study revealed that PPIs may serve as a potent inhibitor of SARS-CoV-2 replication 53 . The difference in findings between the current study and previous works may be due to is the author/funder, who has granted medRxiv a license to display the preprint in (which was not certified by peer review) preprint The copyright holder for this this version posted December 7, 2020. ; https://doi.org/10.1101/2020.12.05.20244426 doi: medRxiv preprint heterogeneity in study samples and designs, differences in the outcome studied (e.g. hospitalization vs ICU admission used in some other studies; infection risk vs severity of disease) and variations in the covariates being adjusted. Residual confounding, such as by other comorbidities and drugs given, may also affect the results. We noted a number of drugs with potentially harmful effects, but we caution that residual confounding, such as confounding by indication, other comorbidities and general poor health, may lead to bias towards an increased odds of infection or severe disease. For example, people who have poorer health in general may visit their GPs more often and be prescribed drugs (e.g. laxatives, antibiotics, painkillers), which may lead to confounding. Nevertheless, it is possible that some of the top-ranked drugs may indeed increase the risk/severity of infection. For instance, it is slightly unexpected that laxatives were highly significant across multiple models and time windows. Of note, it has recently been postulated that dysregulation of gut microbiome may be associated with susceptibility or resilience to infection 54, 55 , and laxatives represent a main category of drugs that affects the gut microbiome 56 . Interestingly, several associations involve psychiatric medications such as benzodiazepines, antipsychotics and anti-dementia drugs. The association may be due to underlying neuropsychiatric conditions (e.g. anxiety, psychosis, dementia etc.), or the effect of the drugs, or a combination of both. Some of above drugs overlapped with those revealed in a recent study using primary care data in Scotland. In a univariate analysis restricted to non-residents in care homes and those without major conditions, laxatives, anxiolytics, penicillins and opioid analgesics were significantly associated with ICU admission or mortality from COVID-19 when compared to population controls 57 . These drugs were also top-listed as those with harmful effects in our study. This study has a number of strengths. First and foremost, the study was performed on a large cohort of subjects with a sample size close to half a million. The sample was not limited to one or a few medical centers and covered the entire UK population, although this is not an entirely random sample and participation bias still exists 16 . The large and well-characterized sample also enables analysis of infected, tested as well the whole population. We have studies all level-4 ATC drug categories, allowing an unbiased and systematic analysis on the association of different drugs with COVID-19 risks or outcomes. This avoids the risk of publication bias, especially negative results to be unreported. Drugs showing null associations are still be of important public health interest, as this may suggest that patients on such medications may not need to change their regimen in view of the pandemic. Medication history was retrieved from GP records, which minimize recall bias and errors from self-reporting. Another strength is that we performed a variety of statistical analysis to reduce bias, including control for potential confounders, multiple imputation, IPW to reduce effects of testing bias, and study of different time windows and multiple models. Some of our findings were corroborated by previous studies; however, many previous clinical studies were limited to hospitalized or infected individuals, which cannot study the effect of drugs on susceptibility to infection. Selection on hospitalized/infected subjects may be prone to selection/collider bias as discussed elsewhere 16 , therefore we included multiple models with infected, is the author/funder, who has granted medRxiv a license to display the preprint in (which was not certified by peer review) preprint The copyright holder for this this version posted December 7, 2020. ; https://doi.org/10.1101/2020.12.05.20244426 doi: medRxiv preprint tested as well the whole population as samples, which aims to reduce bias and limitations due to specific study designs. There are also various limitations, some of which are mentioned above. First and foremost, this is an observational study based on a retrospective cohort of UKBB. As this is not a randomized controlled trial (RCT), confounding is inevitable, especially confounding by indication. Although we have controlled for main confounders in the regression model, residual confounding is still likely. Since confounding by indication will likely bias towards increased odds of infection or severe disease, null or protective associations may be more reliable. Confounding by the use of other types of drugs is also possible. Also, the UKBB cohort is not random and participants are in general healthier than the general population 58 . The majority of participants are of European descent so the findings may not be generalizable to other ethnicities. Also, the subjects are mostly >50 years old and drug effects in younger individuals may be different. Regarding drug history, it is worth noting that vaccination records are not complete as individuals may receive vaccination outside GP practices. Over-the-counter prescriptions were not counted, and it cannot be guaranteed that all drugs issued are dispensed by the pharmacy (https://biobank.ctsu.ox.ac.uk/crystal/crystal/docs/tppgp4covid19.pdf ). There is a relatively high missing rate of GP prescription records for deceased COVID-19 patients, which leads to reduced power to detect associations. While the UKBB cohort sample is large, we still have low power to detect associations for drugs that are uncommonly prescribed. Another limitation with the GP records is that only the issue date but no duration or dosage is available. As for the outcome, hospitalization is a rough proxy for severity only. Finally, this study focuses on prior (or pre-diagnostic) use of drugs and their association with infection risk/severity, and does not directly address whether newly prescribed drugs to recently diagnosed patients will be useful or not. Here we observed that a number of drugs, including many for cardiometabolic disorders, may be associated with lower odds of infection/severity of COVID-19. Several existing vaccines, especially flu vaccines, may be beneficial against COVID-19 as well. Due to the observational nature of the study, confounding cannot be excluded, and other bias and limitations may be present. We understand that causal relationship between drugs and disease cannot be reliably concluded from this study alone, and shall regard the findings as more exploratory than confirmatory. Nevertheless, being one of the most comprehensive studies to date on drug associations, we believe the current work provided a valuable resource to prioritize relevant drugs for future meta-analyses, clinical trials or experimental studies. is the author/funder, who has granted medRxiv a license to display the preprint in (which was not certified by peer review) preprint The copyright holder for this this version posted December 7, 2020. The authors declare no conflict of interest. is the author/funder, who has granted medRxiv a license to display the preprint in (which was not certified by peer review) preprint The copyright holder for this this version posted December 7, 2020. ; https://doi.org/10.1101/2020.12.05.20244426 doi: medRxiv preprint Only subjects with available GP prescription records are shown. . CC-BY-NC-ND 4.0 International license It is made available under a perpetuity. is the author/funder, who has granted medRxiv a license to display the preprint in (which was not certified by peer review) preprint The copyright holder for this this version posted December 7, 2020. ; https://doi.org/10.1101/2020.12.05.20244426 doi: medRxiv preprint Early Transmission Dynamics in Wuhan, China, of Novel Coronavirus-Infected Pneumonia Clinical Characteristics of Coronavirus Disease 2019 in China Obesity: A critical risk factor in the COVID-19 pandemic Clinical course and risk factors for mortality of adult inpatients with COVID-19 China: a retrospective cohort study Cardiometabolic multimorbidity is associated with a worse Covid-19 prognosis than individual cardiometabolic risk factors: a multicentre retrospective study (CoViDiab II) CKD is a key risk factor for COVID-19 mortality Comorbidities and the risk of severe or fatal outcomes associated with coronavirus disease 2019: A systematic review and meta-analysis Replicability and Prediction: Lessons and Challenges from GWAS UK Biobank: An Open Access Resource for Identifying the Causes of a Wide Range of Complex Diseases of Middle and Old Age Heterogeneity and longevity of antibody memory to viruses and vaccines Controlling the False Discovery Rate: A Practical and Powerful Approach to Multiple Testing Comparison of imputation methods for missing laboratory data in medicine A Fast Implementation of Random Forests for High Dimensional Data in C++ and R Multiple imputation for nonresponse in surveys Collider bias undermines our understanding of COVID-19 disease risk and severity Framework to aid analysis and interpretation of ongoing COVID-19 research Uncovering clinical risk factors and prediction of severe COVID-19: A machine learning approach based on UK Biobank data. medRxiv Beta calibration: a well-founded and easily implemented improvement on logistic calibration for binary classifiers Use of stabilized inverse propensity scores as weights to directly estimate relative risk and its confidence intervals Statin Use and In-Hospital Mortality in Diabetics with COVID-19 Meta-analysis of Effect of Statins in Patients with COVID-19 Statin use is associated with lower disease severity in COVID-19 infection Commentary: Statins, COVID-19, and coronary artery disease: killing two birds with one stone Statins and SARS-CoV-2 disease: Current concepts and possible benefits Potential role of statins in COVID-19 ACE2 localizes to the respiratory cilia and is not increased by ACE inhibitors or ARBs Update Alert 4: Risks and Impact of Angiotensin-Converting Enzyme Inhibitors or Angiotensin-Receptor Blockers on SARS-CoV-2 Infection in Adults Use of inhibitors of the renin-angiotensin system in hypertensive patients and COVID-19 severity: A systematic review and meta-analysis Risk of severe COVID-19 disease with ACE inhibitors and angiotensin receptor blockers: cohort study including 8.3 million people Renin-angiotensin system antagonists are associated with lower mortality in hypertensive patients with COVID-19 Clinical Features of COVID-19 in Patients With Essential Hypertension and the Impacts of Renin-angiotensin-aldosterone System Inhibitors on the ACE-inhibitors and Angiotensin-2 Receptor Blockers are not associated with severe SARS-COVID19 infection in a multi-site UK acute Hospital Trust. medRxiv COVID-19 with Different Severities: A Multicenter Study of Clinical Features Association of Inpatient Use of Angiotensin-Converting Enzyme Inhibitors and Angiotensin II Receptor Blockers With Mortality Among Patients With Hypertension Hospitalized With COVID-19 Metformin and COVID-19: From cellular mechanisms to reduced mortality Metformin in COVID-19: A possible role beyond diabetes Can Beta-2-Adrenergic Pathway Be a New Target to Combat SARS-CoV-2 Hyperinflammatory Syndrome?-Lessons Learned From Cancer FDA approved calcium channel blockers inhibit SARS-CoV-2 infectivity in epithelial lung cells Association of antihypertensive agents with the risk of in-hospital death in patients with Covid-19. medRxiv Antihypertensive Medications and COVID-19 Diagnosis and Mortality: Population-based Case-Control Analysis in the United Kingdom Trained innate immunity as underlying mechanism for the long-term, nonspecific effects of vaccines Unravelling the nature of non-specific effects of vaccines-A challenge for innate immunologists Exploratory analysis of immunization records highlights decreased SARS-CoV-2 rates in individuals with recent non-COVID-19 vaccinations. medRxiv Are sex discordant outcomes in COVID-19 related to sex hormones? Progesterone, Immunomodulation, and COVID-19 Outcomes Sexual Dimorphism of Coronavirus 19 Morbidity and Lethality Use of triiodothyronine to treat critically ill COVID-19 patients: a new clinical trial Triiodothyronine for the treatment of critically ill patients with COVID-19 infection: A structured summary of a study protocol for a randomised controlled trial Treatment with proton pump inhibitors increases the risk of secondary infections and ARDS in hospitalized patients with COVID-19: coincidence or underestimated risk factor Use of proton pump inhibitors and risk of adverse clinical outcomes from COVID-19: a meta-analysis A Preclinical safety study of thyroid hormone instilled into the lungs of healthy rats -an investigational therapy for ARDS In vitro screening of a FDA approved chemical library reveals potential inhibitors of SARS-CoV-2 replication Gut Microbiota Status in COVID-19: An Unrecognized Player? Alterations in Gut Microbiota of Patients With COVID-19 During Time of Hospitalization Impact of commonly used drugs on the composition and metabolic function of the gut microbiota Associations of severe COVID-19 with polypharmacy in the REACT-SCOT case-control study. medRxiv Comparison of Sociodemographic and Health-Related Characteristics of UK Biobank Participants With Those of the General Population