key: cord-0792838-ed98nbty authors: Yang, Liren; Qi, Cuifang; Yang, Zixuan; Shang, Li; Xie, Guilan; Wang, Ruiqi; Sun, Landi; Xu, Mengmeng; Yang, Wenfang; Chung, Mei Chun title: Socio-Economic Factors Affecting the Regional Spread and Outbreak of COVID-19 in China date: 2021-07-03 journal: Iran J Public Health DOI: 10.18502/ijph.v50i7.6620 sha: d96e62f1b2ee77711cd75b8c3d7b0ef1b2f8b0d5 doc_id: 792838 cord_uid: ed98nbty BACKGROUND: This study investigated the impact of socio-economic factors on the spread and outbreak of COVID-19 based on Chinese data. METHODS: Cumulative confirmed cases were collected and divided into the First-stage cases cluster dominated by imported cases, and the Second-stage cases cluster dominated by secondary cases, according to the time of emergency state and Wuhan city lockdown. The linear regression was used for data analysis. RESULTS: A total of 12,877 cases in 30 provinces were analyzed in the study. The First-stage cases cluster included 675 cases and Second-stage cases cluster included 12,202 cases. The socio-economic factors were significantly associated with the cases (P<0.05). The GDP and proportion of population moving out of Wuhan were associate with the First-stage dominated by imported cases (β>0, P<0.05). The First-stage cases cluster, proportion of population moving out of Wuhan and urban population were associate with the Second-stage dominated by secondary cases (β>0, P<0.05). CONCLUSION: Socio-economic factors had impacts on the spread and outbreak of COVID-19. The combination of different socio-economic indicators at different stages of the epidemic may help control the epidemic. Since Dec 2019, novel coronavirus disease emerged in the world and has been become a global pandemic. In the early stage, Wuhan City, Hubei Province of China reported confirmed cases (1) . COVID-19 spread and broke out across China rapidly during a few days before the Spring Festival. The WHO declared that COVID-19 was a Public Health Emergency of International Concern on February 5 th , 2020 (2) . As of Jan 2 nd , 2021, COVID-19 has swept more than 200 countries, and the cumulative global confirmed cases was more than 84,000,000 and more than 1,800,000 patients (including health-care workers) died, of which the United States, India, Brazil, Russia, France and Britain were more serious (https://ncov.dxy.cn/ncovh5/view/pneumonia). To prevent the further development of COVID-19, most countries declared a state of emergency and took a lockdown, residents segregated themselves at home. The pandemic has brought a huge blow to social stability, global economy and a great threat to people's health. In the early stage, people lacked enough knowledge in characteristics of the novel coronavirus and its' early situation. COVID-19 spread across China along with population migration and transportation before the Spring Festival. As Jasmina suggested, we also ought to view socioeconomic driving factors as parts of a puzzle that needs to be built to give the full picture of how best to tackle COVID-19 spread (3) . But few researches about COVID-19 paid attention to the socio-economic driving factors for the spread and outbreak of the epidemic. Epidemic diseases are not only medical but also social events, it's necessary to involve socioeconomic factors in analysis (4, 5) . We found that the distribution of COVID-19 in China was quite different among provinces except for Hubei province (the epidemic area). This distribution seemed to be related to the economic conditions among provinces. Research also found that Chinese GDP growth are closely related to the incidence of Class B notifiable diseases (5) . We hypothesized that socio-economic factors may influence the spread of COVID-19. And the transportation network has been an important factor in the spread of SARS, MERS and so on (6, 7) . But are there any other regional characteristics affect the spread and development of COCID- 19? In this study, we aimed to investigate the relationship between socio-economic factors and the spread of COVID-19 and establish quantitative relationships between them. We hoped our findings could reveal the important factors affecting the spread of COVID-19 and help to prevent and control the epidemic. A data analysis among 30 provinces and 100 cities in China was conducted to explore the impact of socio-economic factors (economy, population and transportation) on the spread and outbreak of COVID-19 across China, based on the actual situation on population movement during Spring Festival Travel Rush (the annual population movement from workplace to domicile for family reunion during Spring Festival) and epidemic control. The National Health Commission of the People's Republic of China has determined that data collection and analysis of cases and close contacts of COVID-19 to be part of a continuing public health outbreak investigation and were thus considered exempt from institutional review board approval (8) . The public data of confirmed cases (n=78 064) regarding COVID-19 as of 24:00 February 25, 2020, was collected from the National Health Commission of the People's Republic of China (http://www.nhc.gov.cn). Socio-economic factors were obtained from China Statistical Yearbook(2019) of the National Bureau of Statistics of China (http://www.stats.gov.cn/tjsj/ndsj/) and Chinese Baidu Map (https://map.baidu.com) that were recorded as A. Gross Domestic Product (GDP, Ten Billion Chinese Yuan, refer to final output results of the economic production activi-ties in a certain period (2018) in a region). B. Passenger Traffic (Ten Thousand Persons, refer to the actual number of passengers transported by railway, highway and waterway within a certain period (2018) in a region). C. Passenger Traffic of Railways (Ten Thousand Persons). D. Passenger Kilometers (100 Million Passenger-Kilometer, refer to the product of the number of passengers was transported and the average distance traveled by region in a certain period (2018)). E. Passenger Kilometers of Railway (100 Million Passenger-Kilometer). F. Urban Population (Ten Thousand Persons, refer to the total number of people living in cities and towns (2018) of a region). G. Population Density (population number Per Square Kilometer, refers to the number of population (2018) per unit area (square kilometers)). H. Distance (Kilometers, refer to the shortest highway mileage (January 2020) from a region's administrative Center to Wuhan. I. The Proportion of Population Moving Out of Wuhan (percentage (%), the ratio of the population moved out from Wuhan in provinces or cities divided by the total number of people moved from Wuhan from Jan 1 to Jan 23, 2020). The data of Statistical Yearbook (2019) in the research belong to 2018, and the latest data in 2019 has not been released. Hence, socioeconomic factors of China Statistical Yearbook on GDP, Passenger Traffic, Passenger Kilometers, Urban Population and Population Density in the recent five years were collected and were used to analyze the stability of data in Yearbook and representativeness of the data in 2018. The main analysis was conducted in 30 provinces of China. Taiwan, Hong Kong and Macao were excluded because of special management policy, and Hubei province was excluded as an epidemic area. Then the data verification among100 cities was conducted (including 85 cities out of Hubei province and 15 cities in Hubei province except for Wuhan city). Data on GDP and the proportion of population moving out of Wuhan were fully collected (100%) in all cites. The rest factors could not be fully collected. The actual imported cases and secondary cases among provinces cannot be obtained. The total cases were divided into the First-stage cases cluster dominated by imported cases and Secondstage cases cluster dominated by secondary cases. The cases cluster was dominated by imported cases. In the early stage of COVID-19 before strictly epidemic control, the cases out of Hubei province were mainly imported cases from epidemic areas, which determined the later epidemic development. On Jan 23, 2020, China announced the lockdown of Wuhan city. With the majority of provinces under the state of emergency of the highest level in China, people began to be quarantined at home and the cases import were almost stopped after January 23, 2020. Combining the median incubation period of COVID-19 (4.8 days) and the median days from the onset to the diagnosis (1 day) after Janu 22 (8, 9) , the accumulative confirmed cases as of 24:00 Jan 29 among provinces were regarded as First-stage cases cluster in the study. The cases cluster was dominated by imported cases, which closely related to the contact with imported cases. On Feb 19, a related apartment in China formulated "Relevant Opinions on Adjusting the Emergency Response of COVID-19 Prevention and Control at the Region below Provincial Level", and more than 80% of provinces have completed the production resumption on Feb 26. So, the accumulative confirmed cases from Jan 29 to Feb 25 were regarded as Secondstage cases cluster in the study. The National Health Commission of the People's Republic of China has determined that data collection and analysis of cases and close contacts of COVID-19 to be part of a continuing public health outbreak investigation and were thus con-sidered exempt from institutional review board approval. R software, version 3.6.2 was used for statistical analysis. The Scatter plot was used to analyze the trend between cases and socio-economic factors. Natural log (Ln) transformation in population density was performed (data were not shown). Then linear regression analysis was adopted to explore the associations between the socioeconomic factors and cases. Collinearity analysis was used to examine the multicollinearity. The best subset model selection method was used to find a suitable multi-factor model, and the goodness-of-fit was validated. P value < 0.05 was considered as statistically significant. The outliers were excluded in sensitivity analysis to verify the robustness of the results. The specific data processing and analysis are shown in Fig. 1 . As of 24:00 Feb 25, 2020, there were 78,064 cases of COVID-19 in China and 12,877 cases in 30 provinces. First-stage cases cluster included 675 cases and Second-stage cases cluster included 12,202 cases (Fig. 2) . After adjusted for the Proportion (proportion of population moving out of Wuhan), the GDP, passenger traffic, passenger traffic of railways, passenger kilometers, passenger kilometers of railway and urban population were also positively associated with Total cases and Second-stage cases cluster. Except for the passenger kilometers of railway, population density and distance to Wuhan, the rest factors were still positively associated with First-stage cases cluster. After adjusted for the Proportion and First-stage cases cluster, the GDP, passenger traffic of railways, passenger kilometers, urban population were positively associated with more cases of total cases. In sensitivity analysis for the First-stage cases cluster, the trend of results was consistent with original analysis adjusted for Proportion. The suitable multi-factors models are shown in Table 2 This research explored and quantify the influence of socio-economic factors on the COVID-19 situation for the first time. The results showed that socio-economic factors had important impacts on the confirmed cases. Model 2 (urban population, Proportion and First-stage cases cluster), Model 1 (GDP and Proportion) and Model 2 had more impact on the total cases, the Firststage cases cluster (dominated by imported cases) and Second-stage cases cluster (dominated by secondary cases), respectively. As of Dec 2020, the cumulative confirmed case was more than 84,000,000 and died case was more than 1,800,000 worldwide. The global pandemic of COVID-19 brought multiple impacts, such as economic recession, the stock market fell, industrial chain interruption, commodity supply difficulties. It has been estimated economic losses caused by a major influenza as high as $7.3 trillion-a downturn on par with the Great Depression (10) . Researchers are making contributions to control COVID-19. However, researches mainly focused on clinical characteristics, treatment and epidemiological study (1, 9, (11) (12) (13) , less discussed the impact of socio-economic drivers on COVID-19 but mitigating the risks from infectious diseases requires preemptive measures against their social drivers. We found that Model 1 combined GDP and Proportion was closely related to the First-stage cases. GDP is a core indicator of regional productivity, could effectively reflect regional infrastructure (such as transportation), government management and industrial structure. A region with higher GDP represents a higher level of salary, transportation development and tertiary industry. The demand for talents and labor force is greater (14) (15) (16) , then the imported cases from epidemic area are more. Researches confirmed that population movement sets a condition for the spread of infection, which was consistent with SARS and MERS, the primary mechanism for the spread of infection from a city to other cities is population movement (4, 17) . This explains well the great effect of model 1 on the First-stage cases cluster. Although Model 3 (R 2 =86.6%) also significantly related to Firststage cases cluster with better goodness of fit than model 1. But GDP as a comprehensive indicator, it is more proper to explain the influence compared to other factors such as passenger traffic of railways population in Model 3. The results of the sensitivity analysis for Firststage cases cluster after excluding Zhejiang and Guangdong province were similar to the primary results. Zhejiang and Guangdong province were the top 2 in China's GDP for many years, the mobile population accounted for 1/3 in total provincial population. Guangdong province concentrated the cities of Guangzhou, Shenzhen and Zhuhai with huge scale of economy and population (18, 19) , which contributed to increased risk of COVID-19. For Zhejiang provinces, the industry of manufacturing, traffic and logistics are developed, many people are businessmen with frequent traveling. It is reported that 33,000 people of Wenzhou city (a city in Zhejiang province) returned from Wuhan and surrounding areas (20) . And businessmen had more social activities were more likely to be infected (21) , which may explain so many cases in Zhejiang Province. Results pointed that model 2 combined urban population, Proportion and First-stage cases clus-ter was considered suitable to the Second-stage cases cluster. The Proportion and First-stage cases represent the basis of cases import that influenced the further development of epidemic. Urban population also played an essential role in epidemic development and reflects region's population density and mobility. As urbanization deepens, more population is concentrated in cities, which provides a hotbed for infection such as the outbreak of Ebola virus in African cities, SARS in Hongkong and Pearl River Delta in China (10) . But finally, the population density did not make sense in the study, the difference may be caused by that the population density was calculated in units of provinces rather than cities. In addition, the higher prevalence of other diseases in cities also increases vulnerability to infection. This also explains why the patients of COVID-19 are mostly elderly (9) . Overall, the risk of epidemic is exacerbated by the concentration and interconnectedness of human populations associated with urbanization. We found model 2 combined urban population, Proportion and First-stage cases also had more influence on total cases across 30 provinces. Model 1 influenced the cases import mostly, which impacts the spread of epidemic at the later stage. As a result of timely measures taken in China, the imported cases accounted for a small proportion and the secondary cases accounted for a greater proportion in the total cases. This partly explains why model 2 closely related to the total cases but model 1. Based on these results, we put forward the simple and visual influencing factors on epidemic spread. For regions suffering from epidemic that couldn't control the imported cases well, related agencies should be taken to strengthen monitoring and management for population flow, especially the region with high GDP. For regions control the case's import of epidemic well, more strict measures should be adopted to curb the population contact to prevent further outbreak of epidemic, while to control the cases import from other epidemic areas. In the future, when a new epidemic emerges, at the initial stage dominated by imported cases, related departments should combine population monitoring from epidemic area and visual GDP to predict the spread of an epidemic, promote regional prevention measures, restrict population import as early and more strictly. At the later stage dominated by secondary cases, the areas with many imported cases and urban populations should strengthen inner population contact control to curb the further outbreak of the epidemic. The study explored and quantified the impact of socio-economic factors on COVID-19. And the data in the research was from authority agency, which reduces the error. The study promoted simple and visual indicators to prevent and control the epidemic. It is helpful to avoid the loss in life and health due to insufficient prevention and control of epidemic. It is not only provided phased and regional prevention and control suggestions for the prevention and control of COVID-19, but also provided prevention suggestions for epidemics in the future. But there were limitations in the study. First, some socio-economic factors were not the actual data during the epidemic of COVID-19. But the analysis of the recent 5 yr data showed that data in Statistical Yearbook of different yr are highly correlated. We will conduct more analysis when data be updated. Second, the sample number of provinces was small, which affected the robustness of analysis results. But there are only 34 provincial administrative units in China, and socio-economic factors in smaller administrative units may be incomprehensive. Hence, the analysis in 100 cities was adopted for verification analysis. We hoped to obtain more comprehensive data for analysis in the future to improve the robustness of the results. Socio-economic factors had an important impact on the COVID-19 situation and should be considered to prevent and control the COVID-19 and an emerging epidemic. Socio-economic factors contribute to the precise, phased and regionalized prevention and control measures. Ethical issues (Including plagiarism, informed consent, misconduct, data fabrication and/or falsification, double publication and/or submission, redundancy, etc.) have been completely observed by the authors. Clinical features of patients infected with 2019 novel coronavirus in Wuhan Coronavirus disease (COVID-19) pandemic Can mathematical modelling solve the current Covid-19 crisis? Ecosocial processes influencing infectious disease emergence and spread Multivariate time series analysis on the dynamic relationship between Class B notifiable diseases and gross domestic product (GDP) in China Modeling the worldwide spread of pandemic influenza: baseline case and containment interventions The role of the airline transportation network in the prediction and predictability of global epidemics Early Transmission Dynamics in Wuhan, China, of Novel Coronavirus-Infected Pneumonia Epidemiological and clinical features of the 2019 novel coronavirus outbreak in China. medRxiv Economic growth, urbanization, globalization, and the risks of emerging infectious diseases in China: A review Microneedle array delivered recombinant coronavirus vaccines: Immunogenicity and rapid translational development Genomic characterisation and epidemiology of 2019 novel coronavirus: implications for virus origins and receptor binding Treatment with convalescent plasma for critically ill patients with SARS-CoV-2 infection Relating traffic fatalities to GDP in Europe on the long term Short-term and long-term effects of GDP on traffic deaths in 18 OECD countries Bright lights, big city: Causal effects of population and GDP on urban brightness The Threat and Response to Infectious Diseases (Revised) Characterizing a large outbreak of dengue fever in Guangdong Province Spatiotemporal analysis of indigenous and imported dengue fever cases in Guangdong province Exploration and application of rapid risk assessment method in prevention and control of COVID-19 in urban areas: a case study based on data of Wenzhou Epidemiologic and Clinical Characteristics of 91 Hospitalized Patients with COVID-19 in Zhejiang, China: A retrospective, multi-centre case series Thanks to all of those who participated in the study, without whom the work would not have been possible. This study was funded by the Na- The authors declare that there is no conflict of interest.