key: cord-0618453-6w37btqf authors: Eryarsoy, Enes; Delen, Dursun title: Cross-Country Age Disparities in COVID-19 Cases with Hospitalization, ICU Usage, and Morbidity date: 2020-04-12 journal: nan DOI: nan sha: 64eaa66d4b4d594ccfc0692d17c65883bf8991f6 doc_id: 618453 cord_uid: 6w37btqf In this paper, we examine cross-country differences, in terms of the age distribution of symptomatic cases, hospitalizations, intensive care unit (ICU) cases, and fatalities due to the novel COVID-19. By calculating conditional probabilities, we bridge country-level incidence data gathered from different countries and attribute the variability in data to country demographics. We then provide case, hospitalization, ICU, and fatality estimates for a comprehensive list of countries using the existing data from a variety of countries. The first COVID-19 outbreak took place in the city of Wuhan in the Hubei province of China. Despite strict and robust prevention measures taken in the city, the virus has spread the rest of the world in a matter of a few weeks. Within three months, the World Health Organization (WHO) declared the outbreak a pandemic. While some more significantly than others, the virus has taken its toll on all countries with no exception. The global impact of COVID-19 has been very profound and probably unprecedented since the Spanish flu (H1N1 influenza circa 1918). Due to the novelty of the virus, and the nonexistence of vaccination, health professionals have been trying to cope with the pandemic using symptomatic treatment regimes. The rapid spread of the virus has caused a record influx of patients at hospitals, pushing the primary care health systems to the brink of total collapse. In order to ease the excessive burden on their healthcare systems, the governments are seeking out ways to suppress the transmission of the virus. The rapid spread of the virus has even made the calculation of the rate of spread difficult. One frequently used way of measuring the spread is by computing the average number of secondary cases, or infections, that each case generates. This is known as the R-naught ( 0 ) of the virus. The 0 's time and place dependent nature (typically smaller in the South Asian countries, depending on measures such as social distancing, isolation, partial or complete lockdowns, quarantines by the local authorities) is making modeling the spread of the virus a moving target (De Brouwer et al., 2020) . Even though the literature on COVID-19 is rapidly expanding, there is still a lack of consensus among academics and other scientists on the dynamics of the spread. This can perhaps be attributed to many reasons, such as the unpreparedness to a pandemic at this level, the lack of unified reporting systems due to diversity of health systems across the world, and the novelty of the pandemic itself. Many governments are seeking out forming different strategies that involve mitigating the spread until a method of prevention or a well-defined, and a successful treatment regime is found (Ferguson et al., 2020) . The main focus of such mitigation effects is to alleviate the burden on healthcare systems by spreading out the diffusion of cases over a more extended period of time. While trying to achieve this, governments also face many uncertainties. One such uncertainty involves the absence of proven methods to accurately estimate the potential demand for healthcare services. At the time of this paper's writing, several governments, such as Italy and Spain, already had over 100% health services capacity utilization, while others were about to experience a similar influx of critical patients. It is clear that governments are in need of better understanding the dynamics of the spread for optimal or near-optimal resource allocation decisions. Unfortunately, due to the emergency and the gravity of the pandemic and the lack of scantily found hard evidence cause such decisions to be made through the seat-of-the-pants approaches. Perhaps one of the reasons behind the lack of evidence is that there is no obvious way to map reports and studies pertaining to one country into another. Many regional differences make this mapping and transfer the learnings and knowledge over to another domain particularly difficult. For the case of COVID-19, gender, and age of the patient populations seems to be among the key drivers of such differences. Academics are acting swiftly to enrich the medical literature by reporting their findings on the virus-related population characteristics, diffusion patterns, treatment regimes, case dynamics, hospitalizations, ICU usages, and fatalities. In this paper, we build on the studies and reports that involve age-based clinical fatality risks (CFR), infection fatality risks (IFR), hospitalizations, ICU usages, and fatal outcomes. Using the latest literature as well as expert opinions, we attempt to combine data from different regions in order to estimate and highlight: (i) country-level differences, and (ii) healthcare system demands for individual age groups. Specifically, using the data from six different countries, we study the spread of the virus for different age groups. While it is now known that the virus affects the elderly population more severely than the youngers, studies often report inconsistent results. Several underlying reasons may explain these inconsistencies. Perhaps one of the most plausible reason is the abundance of undocumented cases. In their study, (Li et al., 2020b) highlight that one of the reasons for the rapid spread of the virus is due to no documentation. They estimate that around 86% of all infections were undocumented. Another study from South Korea suggests similar undocumented case percentages at around 55-86% (Kim et al., 2020) . News also suggests that even mortality cases often go unreported. A recent article in The Economist (Fatal flaws, 2020) highlighted stark differences between the number of expected death cases (including those attributed to COVID-19) and the actual death cases. Their estimation, based on regions' normal death rates, suggests the actual death-toll of the novel COVID-19 being more than double of what is being reported in different regions in Italy, Spain, and France. Perhaps this may be one of the reasons for conflicting CFR and IFR figures reported in the literature. While some studies suggested an estimated case fatality risk of as high as 7.2% (Onder et al., 2020) in Italy, other studies suggested a CFR of 3.4% in China (Wilson et al., 2020) , 2.3% using the age-adjusted Diamond Princess cruise ship data (Russell et al., 2020) . More recent works report somewhat lower case fatality rates circa 1.4-1.5%% (Guan et al., 2020; Wu et al., 2020) , both using data from Wuhan. Similar variations also hold for IFR. Studies report IFRs such as 0.5% using the Diamond Princess cruise ship data (Russell et al., 2020) , 0.94%, and 0.657% using Wuhan data (Famulare, 2020) and (Verity et al., 2020) , respectively. Even though these numbers significantly differ from each other, there seems to be (i) convergence to a CFR of 1.5% over time, (ii) CFR/IFR ratio appears to hover around 2-3, indicating as much as 40-70% asymptomatic cases of the virus. One of the apparent reasons behind the differences in reported CFRs is the different demographics in different countries. As the virus affects the elderly more than the young, the virus takes a different toll on each country, depending on its age demographics. There may be several other reasons, including the fact that the age distribution of the infected population may differ from the overall age distribution. Hence, 0 may significantly differ across age groups. For instance, a recent report indicated that the death-toll for under 60-year-old patients in Turkey is as much as four times higher than that of the death-rates in other European countries. As Turkey is one of the younger countries, this may be attributed to the demographical differences across countries or even mobility differences across the same age groups in different countries. In this study, we believe that we make two main contributions to the existing literature. First, we investigate the role of age demographics on identified cases, hospitalization, ICU usages, and fatalities across different countries. And then, by applying conditional probabilities and the existing reports from different countries, we create a country independent conditional probability for each group. Second, by coupling expert-formed scenarios for the US, with agestandardized rates estimated from different countries, we calculate the age breakdown of symptomatic cases, hospitalizations, ICU usage, and fatalities. In this study, we omit some critical analysis, such as taking into account gender differences. In essence, the virus seems to affect males more than females, due to scarcity of specific data, along with making some assumptions. For instance, we assume that deaths, hospitalizations, and ICU usages are proxy measures for COVID-19 spread. We also assume that similar spread patterns apply to each age group across countries-the virus is identical across all countries. We also demonstrate some evidence for the acceptability of these assumptions. The rest of this paper is organized as follows: in the next section, we describe our method and elaborate on the data used. The final section is dedicated to the findings and related discussions. Many of the existing studies focus on modeling the spread of the COVID-19 have been using a few different models such as susceptible-infected-recovered (SIR) and its covariates (mainly SEIR: susceptible-exposed-infected-recovered) (Fang et al., 2020; Liu et al., 2020; Peng et al., 2020; Radulescu and Cavanagh, 2020) , Sidarthe model (Giordano et al., 2020) . These studies often ignore age-dependent variations from one country to another, or are limited to one country, or sometimes two (Ferguson et al., 2020) . In this study, instead of computing the spread of the virus, we look at the results of different scenarios. We base our analyses on expert opinions and the existing data on specific countries. Based on the characteristics of the COVID-19, Center for Disease Control (CDC) estimated that 2.4 to 21 million Americans would require hospitalization, and a death-toll of as much as 480,000 may be expected (Fink, 2020) . According to the same projection, the death toll could be any figure from 200,000 to as high as 1.7 million. Another hospitalizations (as a percentage of reported cases), as well as ICU cases (in terms of cases), and fatalities. One model (Murray, 2020) makes forecasts of the number of days will be needed, while Spain data can be used to calculate this number directly (Ministerio de Sanidad, 2020). After age-adjustments, we use data about hospitalizations as well as ICU usages from Spain. Assuming 10-15 days of average stay (as suggested in the literature), the data are consistent with those reported in (Murray, 2020) . Similarly, estimating fatalities also is difficult. Case fatality rates depend on a number of factors, including: (ii) The delay between the symptom onsets and the time of deaths. (iii) The varying levels of adequacy/inadequacy of the healthcare systems. (iv) The rates of smoking or the prevalence of chronical illnesses. We chose to use CFR of 1.5% for the United States for our analysis. In this study, we use country age distributions (using population pyramids) and data involving different countries to create conditional probabilities. The severity of COVID-19 is also gender-dependent. However, due to the unavailability of data, we did not take gender into account. We report our findings using the mildest of the three scenarios. We use the following notation: We compute other conditional probabilities similarly for events {H, I, D}. We then use the conditional probabilities to simulate the mild scenario breakdowns for the United States. By using population pyramids and the US data, we replicate the same scenario for each individual country and report the results (per 1 million residents). A sample table, including some of the probabilities using the reports by the Spanish Ministry of Health, is given in Table 3 . Using age-corrections via conditional probabilities also shows that reported numbers are quite consistent across-countries (Table 4 and Figure 4 ). We then use country demographics, CDC estimations for the US, and data sets available ( (CDCMMWR, 2020b) population of more than 10 million people. We report all numbers per 1-million for ease of comparison. . (Shim et al., 2020) 4 . (Verity et al., 2020) 5 . (Onder et al., 2020) Studies report different case-and death-related age-breakdowns for a variety of countries. We observed that taking conditional probabilities-based on population age distributions in individual countries-into account, can help mitigate the variability in the reported results ( Figure 2) . Unfortunately, the only source of data that explicitly provided the age breakdowns of hospitalization and ICU cases (P(A|H), and P(A|I)) we could found was Spain (Ministerio de Sanidad, 2020). After comparing the estimated P(H) and P(I)s from Spain data and the data from the Institute for Health Metrics and Evaluation (2020), we concluded the numbers are consistent and decided to use age breakdowns from the Spanish dataset. In this paper, we focus on age-dependent breakdowns of cases, hospitalizations, ICU usages, and fatalities (events) using a range of scenarios. We construct these scenarios by using expert views and existing reports in the literature and based on the US data. We then use conditional probabilities to compute age-standardized breakdowns for the events for all individual countries. Our results propose a few important implications. Firstly, the results highlight the effect of demographical differences across countries on COVID-19 spread. They suggest that, provided everything else remains the same, the death toll difference due to age demographics could be as much as 20 times (Niger vs. Japan, Figure 4) . Secondly, our results have the potential to help decision-makers to accommodate agespecific aspects of the spread. Creating different age-based isolation strategies, depending on the age-demographics of individual countries may be considered. Also, our study attempts to combine several parameters calculated or taken from different academic papers, reports, or data sources together in creating a range of scenarios. While this approach provides a somewhat holistic view of the phenomenon, it also omits other country-level differences such as social isolation policies, prevention strategies, the effectiveness of the individual health-care systems. Our final table (Appendix-1) must be interpreted as a comparison tool for different countries' exposure to the virus. Preliminary Estimates of the Prevalence of Selected Underlying Health Conditions Among Patients with Coronavirus Disease 2019 -United States Severe Outcomes Among Patients with Coronavirus Disease 2019 (COVID-19) -United States COVID-19: four-fifths of cases are asymptomatic, China figures indicate Modeling the COVID-19 outbreaks and the effectiveness of the containment measures adopted across countries Analysis of COVID-19 data on numbers in intensive care from Italy 2019-nCoV: Preliminary Estimates of the Confirmed-Case-Fatality-Ratio and Infection-Fatality-Ratio, and Initial Pandemic Risk Assessment Transmission dynamics of the COVID-19 outbreak and effectiveness of government interventions: A data-driven analysis Worst-Case Estimates for A SIDARTHE Model of COVID-19 Epidemic in Italy Clinical Characteristics of Coronavirus Disease 2019 in China Institute for Health Metrics and Evaluation (2020). IHME | COVID-19 Projections Understanding and Interpretation of Case Fatality Rate of Coronavirus Disease Correcting under-reported COVID-19 case numbers: estimating the true scale of the pandemic Early Transmission Dynamics in Wuhan, China, of Novel Coronavirus-Infected Pneumonia Substantial undocumented infection facilitates the rapid dissemination of novel coronavirus (SARS-CoV2) The reproductive number of COVID-19 is higher compared to SARS coronavirus Predicting the number of reported and unreported cases for the COVID-19 epidemic in South Korea ActualizaciĆ³n n o 54 Forecasting COVID-19 impact on hospital bed-days, ICU-days, ventilatordays and deaths by US state in the next 4 months The COVID-19 Pandemic in the US: A Clinical Update Case-fatality rate and characteristics of patients dying in relation to COVID-19 in Italy Epidemic analysis of COVID-19 in China by dynamical modeling Management strategies in a SEIR model of COVID 19 community spread Estimating the infection and case fatality ratio for COVID-19 using age Transmission potential and severity of COVID-19 in South Korea Estimates of the severity of coronavirus disease 2019: a model-based analysis Case-Fatality Risk Estimates for COVID-19 Calculated by Using a Lag Time for Fatality Estimating clinical severity of COVID-19 from the transmission dynamics in Wuhan, China COVID-19's death toll appears higher than official figures suggest | Graphic detail | The Economist