key: cord-0689337-dhvjb7in authors: Pulley, Jill M.; Jerome, Rebecca N.; Bernard, Gordon R.; Shirey-Rice, Jana K.; Xu, Yaomin; Wilkins, Consuelo H. title: The Astounding Breadth of Health Disparity: Phenome-Wide Effects of Race on Disease Risk date: 2020-09-18 journal: J Natl Med Assoc DOI: 10.1016/j.jnma.2020.08.009 sha: 6a853324d706971445991639b3e7c1535b593e43 doc_id: 689337 cord_uid: dhvjb7in OBJECTIVE: We conducted a phenotype-wide association study (PheWAS) to compare diagnoses among Blacks with those of Whites in one health center in Tennessee using data from 1,883,369 patients. METHODS: We used our deidentified EHR, the Synthetic Derivative, to assess risk of diagnoses associated with Black as compared with White race using Firth logistic regression with covariates including age, sex, and density of clinical encounters. RESULTS: There were anchoring associations in both directions, including the highest increased risk for Blacks of having sickle cell anemia, and strongest decreased risk of basal cell carcinoma. Results included established areas of disparity and many novel associations. CONCLUSIONS: PheWAS is a viable tool for calculating risk associated with any biomarker. The current analysis provide a new approach to generating hypotheses and understanding the breadth of health disparities. Future analyses will further explore causality, risk factors, and potential confounders not accounted for here. H ealth and healthcare disparities, and their myriad influences on the wellbeing of individuals in affected groups, are a major focus of initiatives in the United States. The National Institute on Minority Health and Health Disparities designated health disparity populations include "racial/ethnic minorities, socioeconomically disadvantaged populations, underserved rural populations, and sexual and gender minorities." 1 Health disparities are complex, and we know in particular that racial health disparities are multifactorial and that the 'variable' of race is often correlated with other factors (e.g., socioeconomic level, experience of discrimination, habits, health system interactions), each of which can independently and interdependently influence health. Thoughtful design of strategies to mitigate the untoward effects of disparities requires a sound understanding of both the scope and magnitude of health disparities affecting a group. The phenome-wide association study (PheWAS) provides a powerful and validated methodology for visualizing the effect of an exposure on the relative risk of all diagnoses documented in the electronic health record (EHR). 2, 3 While most commonly used to explore effects of genetic variation, PheWAS is readily adaptable to explore effects of other exposures such as race. To aid in estimating the breadth of racial health disparities, we conducted a PheWAS to compare diagnoses among Blacks with those of Whites at one health center in Tennessee. The analysis was not undertaken to ignore the importance of other factors. Rather, it was intended to assess variance in disease risk holistically, across many diseases, to: 1) visualize and obtain insight into the overall phenome-wide burden, 2) evaluate concordance between individual disease risk in the PheWAS analysis compared to established disparity to demonstrate utility; and 3) identify disparities among rarer diseases that might be overlooked in public health literature. We extracted diagnoses for all individuals with documented White or Black race from our Derivative (SD), a deidentified version of the entire EHR at our medical center. 4 Our EHR currently includes more than 3 million patients of all ages; the current study extracted data from 1990 to 2019. We employed PheWAS to analyze variability in risk of all documented diagnoses associated with Black as compared with White race using demographic data from the SD (99% accuracy compared to genetic ancestry 5 ). International Classification of Diseases, Ninth Revision (ICD-9) and ICD-10-CM codes were converted to phenotype codes (phecodes). 6, 7 For each phecode, a case was defined as having a minimum of two phecodes on different dates; controls were those having no related phecodes as is standard. Firth logistic regression was performed using R with covariates including age, sex, and the number of ages with a clinical encounter recorded. We report associations using the Bonferroni corrected p value of 2.7 Â 10 À5 (minimum detectable bound p ¼ 5 Â 10 À324 ). For comparisons with the published literature, we extracted published odds ratios (OR) or calculated relative risk comparing Blacks and Whites. Figure 1 illustrates the phenome wide results, including phenotypes with increased (top section) or decreased risk (bottom section) among Blacks as compared with Whites, representing a diverse range of disease types and affected organ systems. A dynamic version of the PheWAS results, with hover-over labels for all phenotypes, is available online at https://prod.tbilab.org/phewas_race. Our analysis included 1,883,369 patients, including 269,872 Blacks and 1,613,497 Whites. Mean age at last encounter was 37.8 years (range 0-90 years). Approximately 52.8% (n ¼ 994,930) were females and 47.2% (n ¼ 888,439) were males. We found anchoring associations as the absolute strongest OR in both directions in the data, including the highest increased risk for Blacks of sickle cell anemia 8 (OR 94.7; 95% CI 79.14, 114.51; p < 5 x 10 e324 ), and strongest decreased risk of basal cell carcinoma 9 (OR 0.009; 95% CI 0.005, 0.01; p < 5 x 10 e324 ). Agreement with previous research estimating risk magnitude was also apparent (Table 1) . Notably, almost all pregnancy complications were higher risk in Black women whereas many congenital anomalies carried higher risk in Whites. There were some apparent areas of discordance. For example, the risk of low birth weight among Blacks was lower in our data (OR 1.30; 95% CI 1.25, 1.35; p < 5 x 10 e324 ) as compared with the literature, while the odds of end stage renal disease was greater (OR 5.2; 95% CI 4.9-5.4; p < 5 x 10 e324 ). Further, though the odds of diabetes or cerebrovascular disease were similar between our results and the literature, the odds of downstream sequelae including diabetic retinopathy and end stage renal disease were larger among Blacks in our data than estimates reported elsewhere. Figure 1 . Manhattan plot of PheWAS results comparing phenotype risk between Blacks and Whites. The plot illustrates represents log odds ratio for phenotypes with increased or decreased odds among Blacks as compared with Whites; significance is set at the Bonferroni-corrected threshold of p < 2.7 Â 10 À5 . The disease categories with the greatest racial disparity (as indicated by the highest ORs) include HIV, end stage renal disease, hypertension, uterine fibroids, diabetes, sarcoidosis, asthma, atherosclerosis, and glaucoma (Tables 1 and 2 ). PheWAS recapitulated widely established areas of disparity, but also identified several diseases for which significant disparities do not appear to be as well studied or understood. The neoplasm category of phecodes showed the lowest relative volume of phenotypes among Blacks, with seven phenotypes with higher risk among Blacks and 89 neoplasm-related phenotypes with higher risk among Whites. High risk disease categories with the largest number of patients (shown by the size of the circle in Figure 1 ) generally conform to known prevalence among Blacks (although PheWAS represents health system data, not the general public) include those in Table 2 . In addition, the risk of readily remediable high health disparity conditions, such as vitamin D deficiency (OR 1.54; 95% CI 1.49, 1.58; p < 5 x 10 e324 , remains apparent. Other less well reported phenotypes with sizable populations included dermatophytosis (OR 2.00; 95% CI 1.89, 2.12; p < 5 x 10 e324 , iron deficiency anemia (OR 2.61; 95% CI 2.53, 2.71; p < 5 x 10 e324 ), and fever of unknown origin (OR 1.67; 95% CI 1.64, 1.70; p < 5 x 10 e324 ). While these are likely related to underlying disease such as diabetes, other immune dysfunction, or sickle cell, their appearance in the data might be reflecting the known ripple effect of disparities; that is, that individual health disparities are compounded, producing new, incremental increases in comorbidities over time in the Black population. Immune-related, rare, and mental health diseases among blacks Phenotypes carrying risks of immunocompromise, which may be particularly relevant in times of community outbreaks of communicable disease, are also notable. In addition to HIV and type 2 diabetes, we also see increased risk of various autoimmune diseases conferring risk of immune compromise due to the disease process and/or need for immunosuppressing treatment regimens (Table 2) . Less reported in the public health literature than common diseases, Blacks have an increased risk of many rare diseases (Table 2 and Figure 2 ). Blacks in this analysis also have an increased risk of many psychiatric diagnoses ( Table 2 ). Using a large disease-agnostic, real world database of diagnoses, we applied PheWAS which can calculate risk associated with any biomarker (here, we used race as the social construct). The data are credible, recapitulating known relative risk. Appropriate disease complexity is reflected (such as, a cluster of pregnancy-related complications). Long-term consequences of risk factors (e.g., cerebral atherosclerosis) are also present in the data (e.g., dementia). The spectrum of risks noted above are concordant with those inducing increased risk of COVID-19 infection: hypertension, diabetes, heart disease, asthma, obesity, and immune compromising conditions are likely playing a significant role in the increased COVID-19 disease severity and mortality experienced by Blacks in communities across the United States. The implications of these issues are potentially further worsened by delayed or cancelled health visits among those who cannot access telehealth formats. All of the limitations of the PheWAS method apply to this work, and have been described. 2 We note several of particular relevance to the current report. First, these codes do not separate biologic risk from risks associated with systematic differences in health system factors such as utilization or diagnostic biases; for example, the differences in mental health conditions are also concordant with previous literature on systemic biases in diagnoses among Blacks as compared with Whites. [10] [11] [12] [13] Other important factors also affect health and healthcare disparities and may lead to selection bias, including access to care, trust in the health system, and insurance status. For example, we observed many fewer diagnostic codes indicating neoplasms among Blacks, in contradiction with the published literature. This discrepancy is perhaps explained at least in part by insurance characteristics; many of our cancer clinics do not accept Medicaid; further, cancer incidence, as estimated in the current study, and mortality are different issues. Incorporation of data representing additional key exposures and outcomes such as these into future modeling will further inform our discussion of the breadth and implications of health and healthcare disparities. Despite these limitations, PheWAS represents a useful complement to existing approaches to visualizing health disparities, can highlight diseases of particular relevance to various audiences, and aid in decision making regarding high priority health disparities research and other programs. As stated, any given disease can have many individual (but not independent) risk factors such as genetics, socioeconomics, lifestyle, healthcare access, stressors, environmental exposures, and many others. But these factors converge in the Black population to produce drastically Figure 2 . Forest plot of PheWAS results comparing phenotype risk between Blacks and Whites among rare diseases. The plot illustrates represents log odds ratio for rare disease phenotypes with increased or decreased odds among Blacks as compared with Whites; significance is set at the Bonferroni-corrected threshold of p < 2.7 Â 10 À5 . PHENOME-WIDE EFFECTS OF RACE ON DISEASE poorer health. All of the multifactorial risks that correlate with race and contribute to poorer health are implicitly included within the aggregate results described above, experienced in the real world in their composite by the individuals whose diagnoses comprise these data. As health systems charged with maintaining the health of the public, we need to better understand and recognize the overwhelming disparity that exists among Black patients, both a single disease at a time, and in their totality. Indeed, the preponderance of health risk in Blacks culminates in variable longevity; Whites live on average 4 years longer than Blacks. 14 Poorer health is an important driver of that loss of life, with socioeconomic and other factors being principal underlying components. The project described was supported by CTSA award No. UL1 TR002243 from the National Center for Advancing Translational Sciences. Its contents are solely the responsibility of the authors and do not necessarily represent official views of the National Center for Advancing Translational Sciences or the National Institutes of Health. The national Institute on minority health and health disparities research framework PheWAS: demonstrating the feasibility of a phenome-wide scan to discover gene-disease associations Systematic comparison of phenome-wide association study of electronic medical record data and genome-wide association study data Secondary use of clinical data: the Vanderbilt approach Accuracy of administratively-assigned ancestry for diverse populations in an electronic medical record-linked biobank Evaluating phecodes, clinical classification software, and ICD-9-CM codes for phenome-wide association studies in the electronic health record Mapping ICD-10 and ICD-10-CM codes to phecodes: workflow development and initial evaluation Data & Statistics on Sickle Cell Disease j CDC Vital signs: melanoma incidence and mortality trends and projections -United States Racial disparities in bipolar disorder treatment and research: a call to action Racism as a unique social determinant of mental health: development of a didactic curriculum for psychiatry residents The role of treatment setting and high acuity in the overdiagnosis of schizophrenia in African Americans Racial disparities in psychotic disorder diagnosis: a review of empirical literature Lagging life expectancy for black men: a public health imperative Morbid obesity rates continue to rise rapidly in the United States Delivery Hospitalizations Involving Preeclampsia and Eclampsia Heart disease and African Americans -the office of minority health Diabetes and African Americans -the office of minority health Lupus in women j CDC Racial and ethnic differences in trends in dementia prevalence and risk factors in the United States Stroke and African Americans -the office of minority health Recent trends, risk factors, and disparities in low birth weight in California, 2005e2014: a retrospective study Prevalence of diabetic retinopathy in the United States None.