key: cord-0831901-7g9fivhc authors: Burn, E.; Li, X.; Kostka, K.; Morgan Stewart, H.; Reich, C.; Seager, S.; Duarte-Salles, T.; Fernandez-Bertolin, S.; Aragon, M.; Reyes, C.; Martinez-Hernandez, E.; Marti, E.; Delmestri, A.; Verhamme, K.; Rijnbeek, P.; PRIETO-ALHAMBRA, D. title: Background rates of five thrombosis with thrombocytopenia syndromes of special interest for COVID-19 vaccine safety surveillance: incidence between 2017 and 2019 and patient profiles from 20.6 million people in six European countries date: 2021-05-13 journal: nan DOI: 10.1101/2021.05.12.21257083 sha: b4ed0d2cc8ed33b889e4516766c31c2dc0e42257 doc_id: 831901 cord_uid: 7g9fivhc Background Thrombosis with thrombocytopenia syndrome (TTS) has been reported among individuals vaccinated with adenovirus-vectored COVID-19 vaccines. In this study we describe the background incidence of TTS in 6 European countries. Methods Electronic medical records from France, Netherlands, Italy, Germany, Spain, and the United Kingdom informed the study. Incidence rates of cerebral venous sinus thrombosis (CVST), splanchnic vein thrombosis (SVT), deep vein thrombosis (DVT), pulmonary embolism (PE), and stroke, all with concurrent thrombocytopenia, were estimated among the general population between 2017 to 2019. A range of additional adverse events of special interest for COVID-19 vaccinations were also studied in a similar manner. Findings A total of 20,599,134 individuals were included. Background rates ranged from 1.0 (0.7 to 1.4) to 1.5 (1.0 to 2.0) per 100,000 person-years for DVT with thrombocytopenia, from 0.5 (0.3 to 0.6) to 1.4 (1.1 to 1.8) for PE with thrombocytopenia, from 0.1 (0.0 to 0.1) to 0.7 (0.5 to 0.9) for SVT with thrombocytopenia, and from 0.2 (0.0 to 0.4) to 4.4 (3.9 to 5.0) for stroke with thrombocytopenia. CVST with thrombocytopenia was only identified in one database, with incidence rate of 0.1 (0.0 to 0.2) per 100,000 person-years. The incidence of TTS increased with age, with those affected typically having more comorbidities and greater medication use than the general population. TTS was also more often seen in men than women. A sizeable proportion of those affected were seen to have been taking antithrombotic and anticoagulant therapies prior to their TTS event. Interpretation Although rates vary across databases, TTS has consistently been seen to be a very rare event among the general population. While still very rare, rates of TTS are typically higher among older individuals, and those affected were also seen to generally be male and have more comorbidities and greater medication use than the general population. Funding This study was funded by the European Medicines Agency (EMA/2017/09/PE Lot 3). stroke with thrombocytopenia among a general population of 10 million individuals per 28 days. Expected events would however be higher for an older cohort, as patients with TTS are typically older than the general population. Patients with TTS are more often men and appear to typically have more comorbidities and greater medication use, including prior use of antithrombotic and anticoagulant therapies, than the general population. Not only do the rates of TTS being reported among those who had received an adenovirus-based vaccine against SARS-CoV-2 raise a concern, but early evidence from spontaneous reports suggests that TTS among those vaccinated has been more commonly been seen among women and younger persons. While this might be in part explained by potential biases in spontaneous reporting, it also underlines the importance of further evaluation of the safety signal for TTS. As with other adverse events of special interest for COVID-19 vaccines, estimates of TTS vary depending on the data source used. Comparison of observed versus expected TTS rates during the monitoring of COVID-19 vaccines should ideally be done using the same databases for the estimation of post-vaccine (observed) and background (expected) rates, with adjustment for patient characteristics (such as age and sex) done as part of the analysis. In little over a year since the beginning of the coronavirus disease 2019 pandemic, numerous vaccines against SARS-CoV-2 have been developed based on several platforms. 1 A range of these have demonstrated a high degree of efficacy in large phase 3 clinical trials, 2-4 received conditional approvals from regulators, and together they have already been given to over a billion individuals. 5 The benefits of these vaccines are demonstrable with, for example, a large study on mass vaccination in Israel finding the estimated effectiveness of BNT162b2 mRNA vaccine to be 94% for symptomatic COVID-19, 87% for hospitalisation, and 92% for severe COVID-19 from seven days after the second dose. 6 Similarly, the use of the BNT162b2 mRNA and ChAdOx1 in Scotland have been associated with substantial reductions in the risk of developing severe COVID -19 disease. 7 There remains, however, a need to continue assessing the safety of vaccines against SARS-CoV-2 and assess safety signals as and when they arise. While the results of phase 3 clinical trials provided valuable information on the rates of relatively common, but mostly mild, adverse reactions following vaccination against SARS-CoV-2, they were not powered to study the occurrence of rare adverse events of special interest. While the risks of rare but serious adverse events might be low, nationwide vaccination campaigns where millions of people are inoculated can lead to a considerable absolute number of any such events to occur. A particular area of concern has recently arisen relating to the occurrence of thrombosis (often cerebral or abdominal) with concomitant thrombocytopenia among individuals who had received adenovirus-based vaccine against SARS-CoV-2. As of the 28 th April 2021, 242 instances of provide context for the observed rates being seen among those vaccinated. 10 Moreover, a description of the profiles of the individuals who have had TTS events in the past will also help to inform a consideration of whether the profiles of individuals with TTS after a vaccination against COVID-19 differ to those who typically have such events. In this study we set out to estimate the background incidence rates of TTS and to describe the profiles of individuals who typically have these types of events. We did this using electronic medical records collected between 2017 to 2019 and covering over 20 million people across six European countries. In addition, we performed similar analyses for a range of other embolic and thrombotic events and coagulopathies of special interest for COVID-19 vaccinations. An international network cohort study using routinely-collected primary care data from across Europe. Data were previously mapped to the Observational Medical Outcomes Partnership (OMOP) Common Data Model (CDM), which allowed for the study to be run in a distributed manner, with common analytic code run by each site without the need to share patient-level data between sites. [11] [12] [13] Data sources Data from six electronic medical records databases from France, Netherlands, Italy, Germany, Spain, and the United Kingdom informed the analysis. The Clinical Practice Research Datalink (CPRD) GOLD database contains data contributed by general practitioners (GP) from the United Kingdom. 14 The Integrated Primary Care Information (IPCI) database is collected from electronic healthcare records of patients registered with GPs throughout the Netherlands. IQVIA Longitudinal Patient Data (LPD) Italy includes anonymised patient records collected from software used by GPs during an office visit to document patients´ clinical records. IQVIA LPD France is a computerised network of physicians including GPs who contribute to a centralised database of anonymised patient electronic medical records. 15 IQVIA Disease Analyser (DA) Germany is collected from extracts of patient management software used by GPs and specialists practicing in ambulatory care settings. The Information System for Research in Primary Care (SIDIAP; www.sidiap.org) is a primary care records database that covers . CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. (which was not certified by peer review) The copyright holder for this preprint this version posted May 13, 2021. ; https://doi.org/10.1101 https://doi.org/10. /2021 approximately 80% of the population of Catalonia, North-East Spain. SIDIAP-H is a subset of SIDIAP which includes around 2 million people out of the total 7 million in SIDIAP that are registered in primary care practices with linked hospital inpatient data available as obtained from the Catalan Institute of Health hospitals. 16 Results for SIDIAP-H are presented in this manuscript, with results for SIDIAP also reported in the supplementary materials for comparison. The primary study cohort consisted of individuals present in a database as of the 1 st January 2017, with this date used as the index date for all study participants. These individuals were followed up to whichever came first: the outcome of interest, exit from the database, or the 31 st December 2019 (the end of study period). A second study cohort definition of active patients was considered, where individuals entered the cohort on the date of their first visit occurrence after 1st January 2017. As with the primary study cohorts these individuals were followed up to whichever came first: the outcome of interest, exit from the database, or 31 st December 2019. As a sensitivity analysis, study cohorts were also generated with the additional requirement that individuals had a minimum of one year of history available in the database prior to their index date. For each analysis of a given outcome, any individual with that outcome in the year prior to their index date was excluded. Here we summarise results for five specific TTS events of interest: CVST with thrombocytopenia, DVT with thrombocytopenia, PE with thrombocytopenia, SVT with thrombocytopenia, and stroke with thrombocytopenia. Occurrences of CVST, DVT, PE, and stroke were identified on the basis of diagnostic codes. Given the limited granularity in coding for a number of the databases, stroke encapsulated both ischemic and hemorrhagic stroke. Thrombocytopenia was identified either by a diagnostic code or a measurement of between 10,000 to 150,000 platelets per microliter of blood and was required to have been observed over a time window starting ten days prior to the event of interest up to ten days afterwards. For comparison, we also provide results for each of the outcomes without the need for them to be accompanied by thrombocytopenia. In addition, we provide background rates for coagulopathies that have been identified as potential causes for TTS: heparininduced thrombocytopenia (HIT), disseminated intravascular coagulation (DIC), immune thrombocytopenia, and thrombotic thrombocytopenic purpura (TTP -which included hemolytic uremic syndrome). . CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. The copyright holder for this preprint this version posted May 13, 2021. ; https://doi.org/10.1101 https://doi.org/10. /2021 The outcomes described here are taken from a wider set of adverse events of special interest (AESI) for COVID-19 vaccinations. Three sets of outcome events were identified: 1) venous thromboembolic events, 2) arterial thromboembolic events, and 3) rare embolic, coagulopathies, and TTS events. For venous thromboembolic events, instances of DVT (with one broad definition and another narrow -in this manuscript, we describe results for the narrow definition) and PE events were identified, with venous thromboembolism events defined as the occurrence of either DVT or PE. For arterial thromboembolic events, instances of myocardial infarction and ischemic stroke were identified, along with the composite outcome of either of these events. Instances of stroke, either ischemic or hemorrhagic, were also identified. A wide set of rare embolic and thrombotic events and thrombocytopenias and platelet disorders were considered: DIC, immune thrombocytopenia, TTP, HIT, thrombocytopenia, thrombotic thrombocytopenic purpura, platelet disorder/s, CVST, splenic vein thrombosis, splenic artery thrombosis, splenic infarction, hepatic vein thrombosis, hepatic artery thrombosis, portal vein thrombosis, intestinal infarction, mesenteric vein thrombosis, celiac artery thrombosis, visceral vein thrombosis, and SVT. All study outcomes were identified using code lists reviewed by three epidemiologists (EB, DPA, CR), a clinical neurologist (EMH) and a clinical haematologist (EM). These definitions were reviewed using the aid of the CohortDiagnostics R package, 17 so as to identify additional codes of interest and to remove those highlighted as irrelevant based on feedback from regulators (e.g. puerperium and pregnancy-related disease) through an iterative process during the initial stages of analyses. A detailed description of the definitions used to identify the outcomes of the study is provided at https://livedataoxford.shinyapps.io/CovCoagOutcomesCohorts/. This application summarises the codes used to identify outcomes and their frequency in the databases used in the study, the overlap between cohorts in the databases as a whole, and a detailed summary of the profiles of all the individuals with a code of interest in each of the databases. The characteristics of the study population were extracted relative to their index date, as were those of individuals with a particular outcome of interest relative to the date of their event. The age and sex of individuals was identified, along with their history of conditions and medication use. Using all of an individual's prior observation time, prior diagnosis of autoimmune disease, antiphospholipid syndrome, thrombophilia, asthma, atrial fibrillation, malignant neoplastic disease, diabetes mellitus, obesity, heart disease, hypertensive disorder, renal impairment, chronic obstructive pulmonary disease (COPD), or dementia were identified. Meanwhile, prior medication use was identified using a time window of 183 days prior to four days prior the index date. Drug exposures of antithrombotic . CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. The copyright holder for this preprint this version posted May 13, 2021. and anticoagulant therapies, non-steroidal anti-inflammatory drugs, Cox-2 inhibitors, systemic corticosteroids, lipid modifying agents, antineoplastic and immunomodulating agents, hormonal contraceptives for systemic use, tamoxifen, and sex hormones and modulators of the genital system were identified over this time window. The profiles of the study cohorts and those with an outcome of interest were summarised, with median and interquartile range (IQR) used for continuous variables and counts and percentages used for categorical variables. For each study outcome, the number of events, the observed time at risk, and the incidence rate per 100,000 person-years are summarised along with exact 95% confidence intervals calculated using the Poisson distribution. These results are provided for the study cohorts and stratified by data source as a whole and stratified by age (≤44, 45 to 64, or ≥65 years old) and sex. To aid in comparison with rates being reported after vaccinations, the expected number of events per 28 days for a population of 10 million were calculated based on the incidence rates calculated for the overall study cohorts and age strata. All analytic code used for the analysis has been made publicly available at https://github.com/oxford-pharmacoepi/CovCoagBackgroundIncidence. This study was funded by the European Medicines Agency (EMA). The study outcomes were chosen in collaboration with the EMA so as to best reflect the safety signals of interest. The study protocol was reviewed by the EMA prior to its submission, and registered in the European Union electronic . CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. The copyright holder for this preprint this version posted May 13, 2021. ; https://doi.org/10.1101 https://doi.org/10. /2021 The median age of the background populations ranged from 41 in CPRD to 52 in DA Germany and LPD Italy. More detailed characteristics of each of these populations are summarised in Table 1. Incidence rates for the different outcomes of thrombosis and TTS across the databases are summarised in Table 2 . The incidence rates for CVST ranged from 0.3 (95% CI: 0.2 to 0.5) to 1.2 (1.0 to 1.5) per 100,000 person-years; CVST with thrombocytopenia was only seen in SIDIAP-H, where the incidence rate was 0.1 (0.0 to 0.2) per 100,000 person-years. The incidence rates for SVT ranged from 1.5 (1.2 to 1.8) to 14.6 (13.6 to 15.6) , and from 0.1 (0.0 to 0.1) to 0.7 (0.5 to 0.9) per 100,000 person-years for SVT with thrombocytopenia. The incidence rates for DVT ranged from 85.9 (84.6 to 87.2) to 187. 2 (182.6 to 191.8) , and from 1.0 (0.7 to 1.4) to 1.5 (1.0 to 2.0) per 100,000 person-years for DVT with thrombocytopenia. The incidence rates for PE ranged from 66.1 (63.0 to 69.2) to 92.1 (88.9 to 95.4), and from 0.5 (0.3 to 0.6) to 1.5 (1.3 to 1.7) per 100,000 person-years for PE with thrombocytopenia. Lastly, the incidence rates for stroke ranged from 128.3 (126.0 to 130.6) to 385.1 (379.9 to 390.3), and from 0.6 (0.4 to 0.7) to 4.4 (3.9 to 5.0) per 100,000 person-years for stroke with thrombocytopenia. As with thrombosis in general, incidence rates for TTS were typically higher for older age groups, see Figure 1 . Based on the highest estimates for the overall study cohorts, one would expect approximately 1 case of CVST with thrombocytopenia, 5 of SVT with thrombocytopenia, 11 of DVT with thrombocytopenia, 11 of PE with thrombocytopenia, and 34 of stroke with thrombocytopenia among a general population of 10 million individuals per 28 days. For a cohort aged 65 or over, this would rise to 9 of SVT with thrombocytopenia, 34 of DVT with thrombocytopenia, 38 of PE with thrombocytopenia, and 116 of stroke with thrombocytopenia. The age and sex profiles of those with TTS are summarised in Table 3 and the prevalence of comorbidities and prior medication are presented in Figure 3 , along with those of the study populations. The median age of the six individuals with CVST with thrombocytopenia in SIDIAP-H was 68 years old. The median age of those with DVT with thrombocytopenia ranged from 66 to 76 across the databases, from 70 to 78 for PE with thrombocytopenia, from 59 to 64 for SVT with thrombocytopenia, and from 72 to 78 for stroke with thrombocytopenia. Men predominated in all the affected cohorts, accounting for 51.3% to 80.6% of those with different TTS in the contributing databases. The prevalence of comorbidities and prior medication use was higher for patients with TTS than in the general population. In CPRD, for example, 1.8% of the source population had an autoimmune disease, 5.1% had a history of cancer, 5.5% had diabetes, 4.3% had renal impairment. These compared to 12.6%, 25.2%, 20.5%, and 26.8% for patients with DVT with thrombocytopenia. Similarly, while 2.9% of the study population were taking antithrombotic and anticoagulant . CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. The copyright holder for this preprint this version posted May 13, 2021. therapies in the months preceding their index date, 18.1% of patients with DVT with thrombocytopenia were. Requiring a year of prior history for study participants to be included in the analysis and defining study populations based on their first visit after 2017 had only a small effect on the results, see the Appendix. Incidence rates for DIC, HIT, immune thrombocytopenia, and TTP are summarised in Table 4 . The incidence rate for DIC ranged from 0.2 (0.1 to 0.3) to 3.8 (3.3 to 4.3) per 100,000 person-years, from 0.2 (0.1 to 0.4) to 7.8 (7.4 to 8.2) for HIT, from 2.1 (1.8 to 2.5) to 16.7 (15.7 to 17.8) for immune thrombocytopenia, and from 0.5 (0.4 to 0.7) to 2.8 (2.6 to 3.1) for thrombotic thrombocytopenic purpura. The incidence rates for all study outcomes are summarised in the supplementary materials and in a web application: https://livedataoxford.shinyapps.io/CovCoagBackgroundIncidence/, where the characteristics of outcome cohorts are also described. In this study we have analysed data for over 20 million people from across 6 European countries to establish the background incidence of TTS. With incidence rates of less than 5 per 100,000 personyears, TTS can be considered as a very rare event. These events can generally be expected to occur in older persons, with the average age of those over 60 for most events in most of the databases studied. Moreover, those affected typically had a higher prevalence of comorbidities, such as autoimmune diseases, cancer, and diabetes. They also had a high prevalence of use of medications indicated for the prevention of thrombosis including antithrombotic and anticoagulant therapies, as well as some potentially associated with an increased risk of TTS such as systemic glucocorticoids. Coagulopathies potentially associated with TTS were mostly rare: immune thrombocytopenia was the most common with rates up to almost 17 per 100,000 person-years, followed by HIT (up to 8 per 100,000), DIC (up to 4 per 100,000), and TTP (up to 3 per 100,000). Exact estimates of the background incidence of both rare events, such as CVST, and more common events, such as DVT, have though been seen to vary substantially depending on the data source . CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. The copyright holder for this preprint this version posted May 13, 2021. ; used. This comes even after the use of common analytic code and with databases mapped to a common data model. More specifically, the database which captured both inpatient and outpatient records from Spain showed the highest background rates of most of the studied events, demonstrating the importance of using consistent data sources in vaccine safety research. A number of previous studies have estimated the incidence of venous thromboembolism in the general population, with its incidence rate estimated to be around 100 persons per 100,000 personyears. 18 Approximately two-thirds of venous thromboembolism can be expected to present as DVT, with the other third presenting as PE with or without DVT. 19 The background of stroke has also been the subject of much research, with its incidence generally estimated to be more than 100 persons per 100,000 person-years. 20, 21 The incidence of each of these events is seen to be much higher among older persons. The incidence of SVT and CVST is far less well-known. Estimates for the incidence of CVST have ranged from 0.2 to 2 per 100,000 person-years. [22] [23] [24] [25] Meanwhile there is little research describing the incidence of SVT in the general population, although the incidence of portal vein thrombosis, the most commonly involved vein, has been estimated at around 3 per 100,000 person-years, while the incidence of Budd-Chiari syndrome was estimated at around 2 per 100,000 person-years in the same study. 26 In a recent study data from Denmark and Norway was used to assess 28-day rates of thromboembolic events and coagulation disorders among a cohort of people who had received the ChAdOx1 vaccine and in historical comparator cohorts. 27 In the historical comparator population, which covered 2016 to 2018 for Denmark and 2018 to 2019 for Norway, the incidence rate of CVST, PE, lower limb venous thrombosis, and SVT were estimated at 2, 57, 94, and 4 per 100,000 personyears, respectively. Meanwhile the incidence rate for idiopathic thrombocytopenia purpura and DIC were 7 and 1 per 100,000. These estimates are broadly in line with those from our study. The background incidence of TTS itself, however, has not been previously described in detail. Our findings demonstrate that thrombosis with concomitant thrombocytopenia is very rare, with an unadjusted overall incidence rate of less than 5 per 100,000 person-years for each of DVT, PE, CVST, SVT, and stroke with thrombocytopenia. In particular, CVST with thrombocytopenia was only seen in the database with patient-level linkage to hospital data, where its incidence rate was approximately one per million person-years. Similarly, the incidence rate for SVT with thrombocytopenia was between one and seven cases per million. . CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. The copyright holder for this preprint this version posted May 13, 2021. ; https://doi.org/10.1101 https://doi.org/10. /2021 Spontaneous reporting has identified 93 cases of CVST with thrombocytopenia among individuals who had recently received the ChAdOx1 vaccine in the UK. 8 The profile of patients with TTS after vaccination also appears to differ to the typical profiles of those with TTS as seen in our data. While in this study we have seen those with TTS to typically be older than the general population, more commonly male, and with more comorbidities and greater prior medication use, initial studies describing the profiles of patients with vaccine-induced TTS have most often presented the cases of people who were aged under 60, more often female, and with relatively few comorbidities described. 28-31 While these case series of TTS after vaccination are small and their profiles may reflect the particular characteristics of those who were first to receive a vaccine against SARS-CoV-2, this dissimilarity in patient profiles of those with TTS in previous years and those for whom it has been reported following a vaccination is notable. Where we have estimated the incidence of thrombosis in general, our findings have been very broadly in line with previous research. Estimates have, however, been seen to vary substantially across databases. This was particularly the case for estimates for DVT, for example, where a threefold difference was seen between the databases with the highest and lowest incidence rates. Our estimates of TTS and coagulopathy also vary across the databases used. Indeed, this heterogeneity was observed although we used data mapped to a common data model and applied the same analytic code across the databases. Given that the data sources used come from different countries and can be expected to vary in completeness, accuracy, and granularity, it is not necessarily surprising that estimates vary. In addition, given the seriousness of many of the events described here, the degree to which episodes of hospital care are reflected in the data is also likely to be an important driver of the observed heterogeneity, with incidence rates higher for most of the events in the data source with linkage to hospitalisation data. The reasons for the heterogeneity seen between the databases used in this study is discussed in further detail in the Appendix. This study relies on routinely-collected health care data and while this has allowed for the inclusion of a large study population, the recording of TTS has not previously been evaluated in the databases used. A degree of measurement error can thus be expected, and further research is required to validate the recording of TTS. The degree to which the TTS events being described after vaccinations against SARS-CoV-2 are comparable to TTS events previously seen in the general population is as yet unclear. TTS after vaccination appears to occur at unusual sites, with a large proportion of spontaneous reports and . CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. The copyright holder for this preprint this version posted May 13, 2021. ; case series describing cerebral or abdominal thromboses, and with high levels of antibodies to platelet factor 4 often observed despite the absence of an exposure to heparin. 28, 32 In this study we have focused on specific sites of thrombosis with concomitant thrombocytopenia. We believe that this is more instructive than providing a singular background incidence rate for venous thromboembolism with thrombocytopenia, which would be driven in large part by commonly seen events (such as DVT and PE) and would not necessarily reflect the presentation of TTS after vaccination. As the pathophysiology of TTS after vaccination becomes better understood, definitions of the appropriate historical comparator can also be expected to evolve so as to best match the condition being described among those who have been recently vaccinated. Based on data from over 20 million people from six European databases, TTS has been seen to be very rare. While rates varied across databases, the highest incidence rates for DVT, PE, and stroke with thrombocytopenia were 1.5, 1.5, and 4 per 100,000 person-years, respectively. Meanwhile the highest incidence rates for CVST and SVT with thrombocytopenia were 0.1 and 0.7 per 100,000 person-years. TTS was typically seen among individuals older, more often male, and in worse health than the general population. While these findings help to provide context for the rates of adverse events being reported by spontaneous reports following vaccinations against SARS-CoV-2, a full assessment of the safety signal for TTS would benefit from within-database comparisons which account for individual-level characteristics such as age and sex. . CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. The copyright holder for this preprint this version posted May 13, 2021. ; https://doi.org/10.1101 https://doi.org/10. /2021 24. M., C. J., M., Z. S., Majid, A. & Jan, S. The Incidence of Cerebral Venous Thrombosis. Stroke 43, 3375-3377 (2012 . CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. The copyright holder for this preprint this version posted May 13, 2021. ; https://doi.org/10.1101 https://doi.org/10. /2021 Ethical approvals The protocol for this research was approved by the Independent Scientific Advisory Committee Ethics Committee (project code: 21/007-PCV), and the IPCI governance board (application number 3/2021). Some databases used (IQVIA LPD Italy, IQVIA LPD France, IQVIA DA Germany) in these analyses are commercially available, syndicated data assets that are licensed by contributing authors for observational research. These assets are de-identified commercially available data products that could be purchased and licensed by any researcher. As these data are deemed commercial assets, there is no Institutional Review Board applicable to the usage and dissemination of these result sets or required registration of the protocol with additional ethics oversight. Compliance with Data Use Agreement terms, which stipulate how these data can be used and for what purpose, is sufficient for the licensing commercial entities. Further inquiry related to the governance oversight of these assets can be made with the respective commercial entity, IQVIA (iqvia.com). All authors were involved in the study conception and design, interpretation of the results, and the preparation of the manuscript. EB led the data analysis and wrote the initial draft of the manuscript with DPA. EB, TDS, CR, MA, and SFB had access to the SIDIAP data, EB, XL, AD, and DPA had access to the CPRD data, PR had access to the IPCI data, and KK, HMS, CR, SS, had access to LPD France, LPD Italy, and DA Germany. . CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. The copyright holder for this preprint this version posted May 13, 2021. ; https://doi.org/10.1101 https://doi.org/10. /2021 Tables and Figures A3. Heterogeneity in the incidence rates across databases In this study variation in incidence rates has been seen across the databases used. Below we discuss the reasons for this, with a particular focus on the results for deep vein thrombosis as an example. While heterogeneity between studies in the literature can often in part be explained by variation in analytic choices, this is not the case in this study where we applied the same analytic code to each database. Variation in the ´true´ background incidence rates across countries is not in itself unexpected, given differences in population demographics, the prevalence of risk factors for the events of interest, and so on. For example, risk factors for deep vein thrombosis include obesity (https://www.sciencedirect.com/science/article/pii/S0140673605718808), and the prevalence of obesity is well-known to vary substantially across countries (https://link.springer.com/article/10.1007/s00394-014-0746-4). Medical vocabularies vary and the databases used in the study use Read codes (CPRD), ICPC (IPCI), ICD-9 (LPD Italy), ICD-10 (DA Germany and LPDD France), and ICD-10CM (SIDIAP) to represent condition-related concepts. These coding systems differ in the way that they describe clinical events (in particular their granularity) and this can have a meaningful impact on research findings. For example, this can be seen in the literature by the impact on research findings when databases switched from using ICD-9 to ICD-10 codes. (https://onlinelibrary.wiley.com/doi/abs/10.1002/pds.4563). In our study, a substantial impact can be seen for some databases when using the narrow definition for DVT rather than the broad one, see Figure 1 and is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. The copyright holder for this preprint this version posted May 13, 2021. ; https://doi.org/10.1101 https://doi.org/10. /2021 CPRD and IPCI, in did have a substantial impact for DA Germany (where the most common included code became "Phlebitis and thrombophlebitis of lower extremities, unspecified"), for LPD France (where the most common included code became "Phlebitis and thrombophlebitis of unspecified site"), for LPD Italy (where the most common included code became "Other venous embolism and thrombosis of unspecified site"), and SIDIAP (where the most common included code became "Phlebitis and thrombophlebitis of unspecified deep vessels of unspecified lower extremity"). This provides one example of the underlying variation between the databases in how the presentation of a particular is captured in the medical record. While using a common data model facilitates the analysis of the data sources in a standardised manner, this underlying heterogeneity in the source data is still reflected to an important degree. . CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. (which was not certified by peer review) The copyright holder for this preprint this version posted May 13, 2021. ; https://doi.org/10.1101 doi: medRxiv preprint Many of the events considered in our study can be expected to involve hospitalisation. Consequently, the degree to which diagnoses made in the hospital setting are reflected in the data sources used can be expected to have a large effect on the capture of events. We see this for SIDIAP where we ran the analysis in both the complete database and in a subset which was linked at a patient-level to hospital records, SIDIAP-H. An individual in the community with DVT may require hospital treatment and DVT may also occur in persons hospitalised for other reasons. For our narrow definition, we see that the incidence rate of DVT increases from 64.5 per 100,000 person-years in SIDIAP to 106.8 per 100,000 person-years in SIDIAP-H, see figure 3 . In SIDIAP-H, 44% of these DVT were among patients who were hospitalised at the time. While we can expect patient-level linkage of the other data sources to hospital level data to impact their results, it will not necessarily be to the same degree as in SIDIAP. In Catalonia, general practitioners are generally able to view patient hospital records during a consultation in a separate . CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. (which was not certified by peer review) The copyright holder for this preprint this version posted May 13, 2021. ; https://doi.org/10.1101 https://doi.org/10. /2021 program to that used to view primary care records, with only the latter is captured in the SIDIAP database. Consequently, for patient care there is not a need to copy across what is seen in the system with the hospital record into the system with the primary care record which is captured by SIDIAP. This is not the case in the UK, however, where there is a need to add the information from letters received by general practitioners from secondary care facilities into the electronic medical record. Consequently, the impact of hospital linkage is likely to be context-dependent, as well as varying depending on the event under consideration. . CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. (which was not certified by peer review) The copyright holder for this preprint this version posted May 13, 2021. ; https://doi.org/10.1101 https://doi.org/10. /2021 SARS-CoV-2 vaccines in development Efficacy and Safety of the mRNA-1273 SARS-CoV-2 Vaccine Safety and Efficacy of the BNT162b2 mRNA Covid-19 Vaccine Safety and efficacy of the ChAdOx1 nCoV-19 vaccine (AZD1222) against SARS-CoV-2: an interim analysis of four randomised controlled trials in Brazil, South Africa, and the UK Unprecedented achievement': who received the first billion COVID vaccinations? BNT162b2 mRNA Covid-19 Vaccine in a Nationwide Mass Vaccination Setting Interim findings from first-dose mass COVID-19 vaccination roll-out and COVID-19 hospital admissions in Scotland: a national prospective cohort study MHRA. Coronavirus vaccine -weekly summary of Yellow Card reporting: Updated 6 May Observational Health Data Sciences and Informatics (OHDSI): Opportunities for Observational Researchers Validation of a common data model for active safety surveillance research Data Resource Profile: Clinical Practice Research Datalink (CPRD) Validity And Limitations Of The Longitudinal Patient Database France For Use In Pharmacoepidemiological And Pharmacoeconomics Studies. Value Heal Construction and validation of a scoring system for the selection of high-quality data in a Spanish population primary care database (SIDIAP) CohortDiagnostics: Diagnostics for OHDSI studies The Epidemiology of Venous Thromboembolism Epidemiology and Risk Factors for Venous Thrombosis Worldwide stroke .1%) 7%) 1%) 3%) 1,425,782 (16.9%) 498,244 (12.6%) 222,433 (17.1%) 322,776 (30.3%) 429,924 (22.5%) Renal impairment .1%) Non-steroidal anti-inflammatory drugs Antithrombotic and anticoagulant therapies 5%) Lipid modifying agents IQVIA Longitudinal Patient Data France, IPCI: Integrated Primary Care Information, IQVIA LPD Italy: IQVIA Longitudinal Patient Data Italy, SIDIAP-H: Information System for Research in Primary Care with hospital linkage. COPD: chronic obstructive pulmonary disease 1,909 IQVIA Longitudinal Patient Data France, IPCI: Integrated Primary Care Information, IQVIA LPD Italy: IQVIA Longitudinal Patient Data Italy, SIDIAP-H: Information System for Research in Primary Care with hospital linkage This study was funded by the European Medicines Agency in the form of a competitive tender (Lot ROC No EMA/2017/09/PE). We acknowledge Prof Johan Van der Lei for the overall management of this research grant. DPA's research group has received research grants from the European Medicines Agency, from the Innovative Medicines Initiative, from Amgen, Chiesi, and from UCB Biopharma; and consultancy or