key: cord-1015042-l5ud3kto authors: Sudharsanan, Nikkil; Didzun, Oliver; Bärnighausen, Till; Geldsetzer, Pascal title: The Contribution of the Age Distribution of Cases to COVID-19 Case Fatality Across Countries: A 9-Country Demographic Study date: 2020-07-22 journal: Ann Intern Med DOI: 10.7326/m20-2973 sha: 230c8989f711fd5023e15d76c4bd6cef91dd8b08 doc_id: 1015042 cord_uid: l5ud3kto BACKGROUND: There is wide variation in coronavirus disease 2019 (COVID-19) case-fatality rates (CFRs) across countries, leading to uncertainty about the true lethality of the disease. A large part of this variation may be due to the ages of individuals who are tested and identified. OBJECTIVE: To measure the contribution of distortions from the age distributions of confirmed cases to CFRs within and across populations. DESIGN: Cross-sectional demographic study using aggregate data on COVID-19 cases and deaths by age. SETTING: Population-based data from China, France, Germany, Italy, the Netherlands, South Korea, Spain, Switzerland, and the United States. PARTICIPANTS: All individuals with confirmed COVID-19, as reported by each country as of 19 April 2020 (N = 1 223 261). MEASUREMENTS: Age-specific COVID-19 CFRs and age-specific population shares by country. RESULTS: The overall observed CFR varies widely, with the highest rates in Italy (9.3%) and the Netherlands (7.4%) and the lowest rates in South Korea (1.6%) and Germany (0.7%). Adjustment for the age distribution of cases explains 66% of the variation of across countries, with a resulting age-standardized median CFR of 1.9%. Among a larger sample of 95 countries, the observed variation in COVID-19 CFRs is 13 times larger than what would be expected on the basis of just differences in the age-composition of countries. LIMITATION: The age-adjusted rates assume that, conditional on age, COVID-19 mortality among diagnosed cases is the same as that among undiagnosed cases and that individuals of all ages are equally susceptible to severe acute respiratory syndrome coronavirus 2 infection. CONCLUSION: Selective testing and identifying of older cases considerably warps estimates of the lethality of COVID-19 within populations and comparisons across countries. Removing age distortions and focusing on differences in age-adjusted case fatality will be essential for accurately comparing countries' performance in caring for patients with COVID-19 and for monitoring the epidemic over time. PRIMARY FUNDING SOURCE: Alexander von Humboldt Foundation. C oronavirus disease 2019 (COVID-19) has led to unprecedented disruptions to health systems and individuals' social, psychological, and economic lives (1) (2) (3) (4) (5) (6) . As the number of COVID-19 cases continues to increase worldwide (7) , individuals are being exposed to a continuous flow of information (and misinformation) about the disease (8, 9) . The lethality of COVID-19 in particular is highly discussed by the news media and general public, especially because wide differences have emerged in the COVID-19 case-fatality rate (CFR) across countries (7) . This wide variation has contributed to confusion among the general public and also among scientists and policymakers as to how fatal infection with severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) really is (10) . A large part of the variation in CFRs across countries may be due to the ages of individuals who are tested and identified. One consistent pattern across countries is that the COVID-19 CFR rises sharply over age (Supplement Figure 1 , available at Annals.org) (11) . This relationship means that small differences in the age distribution of cases can have a strong influence on the overall CFR observed in the population. Indeed, there has been discussion about the role that the age distribution of cases plays in observed CFR differences (12) . For example, news sources and scientific journals have reported that lower CFRs in low-and middle-income countries could reflect younger population distributions and that the high CFR in Italy might be due to the large proportion of older individuals with confirmed COVID-19 (10, 13) . However, the contribution of such age-based distortions on observed CFR differences across populations has not been empirically examined and quantified. We measured how much of the wide variation in CFRs across countries is due to differences in the age distribution of cases rather than differences in the virulence of SARS-CoV-2, underlying health of cases (independent of age), and the ability of the health system to effectively care for patients with COVID-19. This information is important for comparing countries' performance in treating and caring for patients with COVID-19 and to measure progress over time. This information can also help policymakers and the public anticipate the likely CFR See also: Our main aim was to determine how much of the differences in the observed overall CFR across countries can be attributed to differences in the age distri-bution of cases. To explore this, our analysis had the following 3 steps. First, we calculated the observed overall CFR in each country by multiplying the agespecific CFRs by the corresponding age-specific share of cases in that age group and summing this product across age groups: Here, is m i the age-specific CFR and is d i the proportion of cases in age group i. Second, we compared the observed overall CFR with 2 age-adjusted CFRs: the age-expected CFR and the age-standardized CFR. To estimate the ageexpected CFR, we conducted the same procedure as for the observed overall CFR, except that instead of using the observed age-specific proportions of cases, d i , we used the share of the overall population in age group i, d i pop : Third, we estimated the age-standardized CFR similarly, this time using a common age distribution of cases, d i std , across all countries (Supplement Table 14 shows the estimated standard distribution). Compared with the observed overall CFR, the ageexpected CFR corrects for within-country distortions by assuming that the age distribution of the entire population is a better estimate of the true underlying distribution of cases than the observed age distribution of COVID-19 cases, which is strongly influenced by who presented with the most salient symptoms and was thus tested. Another way of conceptualizing the ageexpected CFR is that it is the CFR that would occur if, within a given population, individuals of all ages were equally likely to be infected regardless of who showed symptoms or was tested. Of note, because the ageexpected CFR does not use the observed age distribution of cases, it is less dependent on the distribution of testing in a population. This approach of applying agespecific rates estimated from a subpopulation (in our case, only among those who have been tested and confirmed to have COVID-19) to the overall age distribution of a population to estimate population-level rates is often used in environments where high-quality mortality data are available for only a small subset of the population. For example, studies in India, where comprehensive cause-of-death registers do not exist, have estimated overall cause-specific mortality rates by applying age-and-cause-specific mortality rates from a mortality surveillance cohort to the overall population distributions (25) . The goal of age-expected CFRs is to attempt to correct for distortions in who was tested within populations and therefore provide a more accurate picture of the CFR in each country separately. In contrast, agestandardized rates provide a way to compare CFRs across countries. By standardizing the CFRs using a common case-distribution by age across countries, any ORIGINAL RESEARCH Age Distributions of Cases and COVID-19 Case Fatality Across Countries differences that remain among countries are purely due to differences in their age-specific CFRs. We estimated the contribution of these 2 age adjustments as the difference in the SD of CFRs across countries relative to the observed rates. For all estimated rates, the widths of the 95% Cis were within 7% of reported rate for most of the countries and within 14% of the reported rates for the Netherlands and South Korea. Next, we used a form of indirect standardization to explore how much variation we would expect in the COVID-19 CFR across countries purely due to differences in population age distributions and how much "excess" variation we observe due to distortions caused by age distribution among the larger sample of 95 countries. We were unable to use this larger sample of countries for our primary analyses because they did not have information on COVID-19 mortality disaggregated by age. For each country in this analysis, we estimated an overall predicted CFR on the basis of the age distribution of the country and a common set of age-specific CFRs across countries. We constructed the common agespecific CFRs as the mean of the age-specific CFRs across the 9 countries in our first analyses, because they were the countries for which age-disaggregated data were available (Supplement Table 16 shows this common age pattern of COVID-19 mortality). This process completely removed the influence of country differences in age-specific COVID-19 mortality, thus allowing only differences in the age distribution to drive differences in the overall predicted CFRs (this procedure if often referred to as "indi-rect standardization" and can be thought of as standardizing the age-specific CFRs rather than standardizing the case distribution across ages). We then compared the across-country distribution and SD in these predicted CFRs with the across-country distribution and SD in actual reported CFRs. We were, unfortunately, unable to directly age-standardize the rates for this larger set of countries because we did not have age-disaggregated information on cases and deaths. There was wide variation in the observed overall CFRs, with the highest rates in Italy (9.3%), the Netherlands (7.4%), and Spain (6.0%) and lowest rates in South Korea (1.6%), the United States (1.2%), and Germany (0.7%) (Figure 1) . These wide differences led to an SD of the observed overall CFRs of 3.1%. The differences across countries attenuated substantially between the observed overall and age-expected CFRs. For example, Italy's high observed overall CFR decreased by half (to 4.6%) and the Netherlands' rate decreased by nearly two thirds (to 2.6%). Conversely, Germany's low rate doubled (to 1.5%). Not all countries had a large change between the observed overall and age-expected rates. For example, South Korea's rate only decreased by 0.1% to an age-expected rate of 1.5%. Overall, moving from the observed overall to age-expected CFRs decreased the SD of the CFRs across countries from 3.1% to 1.2%. The differences across countries further attenuated between the age-expected and age-standardized CFRs, with a further reduction in the SD to 1.0%, and a median age-adjusted CFR of 1.9%. Given that the SD for the observed CFR between countries is 3.1 and the SD for the age-standardized CFR 1.0, we conclude that differences in the age distribution of cases are responsible for two thirds of the variation in the observed CFRs across countries. Of note, although age standardization reduced the differences in CFRs across countries, even after age standardization, Italy (3.9%), Spain (2.8%), and the Netherlands (2.7%) still had the highest CFRs. Adjustment for age differences, however, affected which countries have the lowest rates. Switzerland became the best-performing country, with an age standardized CFR of just 1.2%, whereas South Korea went from the third-to fifth-best performing country. Germany's considerable advantage among the observed CFRs also disappeared, with an age-standardized CFR on par with the United States and France. Among the larger analysis of 95 countries, there was large variation in observed overall CFRs, with a difference of 28.6 percentage points between the highest rates in Sudan (28.6%) and Angola (25.0%) and the lowest rates in Eritrea (0.0%) and Cambodia (0.0%) (Figure 3) . The variation for the predicted CFRs was substantially smaller, with a difference of just 1.9 percentage points between the highest rate (2.1% in Malta) and the lowest rate (0.24% in Uganda). Overall, the SD of CFRs decreased markedly, from 6.4 among the observed overall CFRs to just 0.5 among the predicted CFRs. We found that distortions from the ages of individuals who were tested and identified as having COVID-19 explains two thirds of the variation in overall CFRs across countries. This suggests that selective testing and identification of older patients and age-distribution differences among countries considerably warp estimates of differences in the lethality of COVID-19. This distortion is especially salient for pairwise comparisons of countries with very different age structures, such as Italy and China, where changes to the age distribution of cases drastically affects the difference in CFRs between countries. We observed a similar phenomenon among our larger sample of 95 countries and found that observed differences among countries in overall COVID-19 CFRs are far larger than what we would expect on the basis of just agecomposition differences across countries. Compared with the observed overall CFRs, our age-expected rates make the assumption that individuals of all ages have an equal likelihood of being infected regardless of whether they are symptomatic or Diagonal boxes are the age-expected case-fatality rates for each country. Off-diagonal estimates are interpreted as, "What would the age-expected case-fatality rate for country X be if it had the age-distribution of country Y?". Age Distributions of Cases and COVID-19 Case Fatality Across Countries tested. This means the age-expected rates are a product of the overall age structure of a country and its age-specific CFRs. Therefore, changes between the observed overall and age-expected CFRs within a country reflect the extent to which the observed age distribution of cases is older or younger than the overall population distribution. For example, we found that the CFR in Germany doubled when moving from the observed to the age-expected CFR, whereas the CFR in Italy, the Netherlands, and Spain approximately halved. This reveals that in Germany, the distribution of individuals who were tested and confirmed as having COVID-19 was substantially younger than the overall population, implying that testing was disproportionately done among younger individuals. Conversely, in Italy, the Netherlands, and Spain, the distribution of confirmed cases was far older than the overall population distribution, suggesting that identification was disproportionately done among older individuals. This may reflect differences in testing strategies. For example, countries where testing was done primarily among those who exhibit severe symptoms and seek care are likely to disproportionately identify older cases, leading the observed overall CFR to be much higher than the ageexpected CFR. Compared with age-expected CFRs, the agestandardized CFRs provide a way to compare COVID-19 mortality across countries while filtering out any differences between countries in both the age distributions of COVID-19 cases and the age distributions of their populations. For example, whereas the observed overall and age-expected rates gave the impression that China and South Korea had much lower CFRs than the other countries, this difference narrowed considerably after age standardization. This change reveals that the comparatively younger distribution of these 2 countries gave a skewed impression of how they were faring relative to countries with older age distributions. Of note, however, agestandardized CFRs are primarily a tool for comparing countries with one another. Indeed, changes within a country between the age-expected and age-standardized rates do not provide any information other than that the age distribution of a country is different from the standard distribution. After age standardization, the highest CFRs were still observed in Italy, Spain, and the Netherlands, and the lowest CFR occurred in Switzerland, followed closely by France, the United States, and Germany. Several factors are likely to explain these residual differences across countries, including differences in the underlying health of the populations; timely identification of and care for COVID-19; health care quality, especially for treatment of chronic conditions; and, more broadly, the general preparedness of health systems The predicted case-fatality rates use the age distribution of the country and the average age-specific case-fatality rates of the 9 countries in Figure 2 . IQR = interquartile range. (26 -29 ). An important future area of research will be to identify the contribution of each of these pathways to ultimately prepare health systems for future waves of the epidemic. Our study has limitations. First, the age-expected CFR relies on the assumption that all individuals in the population are equally likely to be infected (regardless of whether they show symptoms or are ultimately tested). This assumption could be violated for clinical reasons, such as if persons with preexisting chronic conditions are more susceptible to infection, and social reasons, if age is related to the likelihood that an individual congregates in groups or engages in preventive behavior. Evidence in support of both these potential pathways is still emerging with some indication, for example, that individuals with diabetes may be more susceptible to infection (30) , and that the high rate of infection among older Italians relative to other age groups was not just a distortion from testing but was instead related to cultural practices, whereby older Italians were more likely to live in the same household with younger generations (12) . Ultimately, evidence on the true susceptibility and how it varies across the population is still emerging; in the interim, however, we believe our approach of assuming a uniform infection rate by age provides a better estimate of the true infection distribution than the distribution of identified cases. In addition, both the age-expected and agestandardized CFRs make the additional assumption that the age-specific CFRs among individuals who were identified as having COVID-19 are the same as those who had undiagnosed disease. The extent to which this assumption holds probably varies with country-specific testing approaches. In countries with widespread and early testing, such as South Korea and Germany, the observed age-specific CFRs are likely to be more representative of the true age-specific CFRs because they are based on a broader population sample. However, in countries where testing was less comprehensive and more likely to be done among severe cases, such as in the United States and Italy, the observed age-specific CFRs may be higher than the true values because more severe cases were likely to be detected. Similarly, attribution of deaths to COVID-19 may bias age-specific CFRs if countries define COVID-19 deaths differently. In these circumstances, neither the age-expected nor age-standardized rates will correct for this source of bias. Continuously evaluating and reestimating CFRs as countries expand testing will be crucial for alleviating these biases and developing a better understanding of the lethality of the condition. Our study reveals the strong and important role that the age distribution of cases can have on COVID-19 case fatality. However, even after we corrected for age distortions, important differences in CFRs remained across countries. This suggests that differences in the underlying health of a country's population and how effectively the health system cares for identified COVID-19 cases have meaningful effects on the share of individuals diagnosed with COVID-19 who survive. Removing the noise from age distortions and focusing on why age-adjusted CFRs are higher in some countries than in others, and why they change within countries over time, will be essential for formulating best case strategies for preventing and reducing COVID-19 mortality. The Italian health system and the COVID-19 challenge COVID-19 in Europe: the Italian lesson Psychological impact of the COVID-19 pandemic on health care workers in Singapore. Ann Intern Med COVID-induced economic uncertainty COVID-19 and the consequences of isolating the elderly COVID-19 exacerbating inequalities in the US An interactive web-based dashboard to track COVID-19 in real time The COVID-19 social media infodemic The novel coronavirus (COVID-2019) outbreak: amplification of public health consequences by me-ORIGINAL RESEARCH Age Distributions of Cases and COVID-19 Case Fatality Across Countries dia exposure Why death and mortality rates differ. BBC. 2 Demographic perspectives on mortality of Covid-19 and other epidemics. National Bureau of Economic Research. Working Paper 27043 Demographic science aids in understanding the spread and fatality rates of COVID-19 Division of Risk Assessment and International Cooperation, KCDC. The updates on COVID-19 in Korea as of COVID-19): cases in the US. Accessed at www.cdc.gov/corona virus/2019-ncov/cases-updates/cases-in-us The epidemiological characteristics of an outbreak of 2019 novel coronavirus diseases (COVID-19)-China, 2020. China CDC Weekly COVID-19: point é pidé miologique du 15 mars 2020 Accessed at www.epicentro.iss.it /coronavirus/bollettino/Bollettino-sorveglianza-integrata-COVID-19 _26-marzo%202020 Rijksinstituut voor Volksgezondheid en Milieu. Ontwikkeling COVID-19 in grafieken Enfermedad por el coronavirus (COVID-19) dd6_0/data?selectedAttribute=AnzahlTodesfall on 29 European Centre for Disease Prevention and Control. Download today's data on the geographic distribution of COVID-19 cases worldwide United Nations Department of Economic and Social Affairs Million Death Study Collaborators. Causes of neonatal and child mortality in India: a nationally representative mortality survey Long-term and recent trends in hypertension awareness, treatment, and control in 12 high-income countries: an analysis of 123 nationally representative surveys Lower mortality of COVID-19 by early recognition and intervention: experience from Jiangsu Province Smoking is associated with COVID-19 progression: a meta-analysis Are patients with hypertension and diabetes mellitus at increased risk for COVID-19 infection?