key: cord-0903736-9ipykkh6 authors: Samany, Najmeh Neysani; Toomanian, Ara; Maher, Ali; Hanani, Khatereh; Zali, Ali Reza title: The most places at risk surrounding the COVID-19 treatment hospitals in an urban environment- case study: Tehran city date: 2021-08-30 journal: Land use policy DOI: 10.1016/j.landusepol.2021.105725 sha: e3052256965e24dff4409de3ea52eb294dfae4fe doc_id: 903736 cord_uid: 9ipykkh6 Investigations on the spatial patterns of COVID-19 spreading indicate the possibility of the virus transmission by moving infected people in an urban area. Hospitals are the most susceptible locations due to the COVID-19 contaminations in metropolises. This paper aims to find the riskiest places surrounding the hospitals using an MLP-ANN. The main contribution is discovering the influence zone of COVID-19 treatment hospitals and the main spatial factors around them that increase the prevalence of COVID-19. The innovation of this paper is to find the most relevant spatial factors regarding the distance from central hospitals modeling the risk level of the study area. Therefore, eight hospitals with two service areas for each of them are computed with [0–500] and [500–1000] meters distance. Besides, five spatial factors have been considered, consist of the location of patients’ financial transactions, the distance of streets from hospitals, the distance of highways from hospitals, the distance of the non-residential land use from the hospitals, and the hospital patient number. The implementation results revealed a meaningful relation between the distance from the hospitals and patient density. The RMSE and R measures are 0.00734 and 0.94635 for [0–500 m] while these quantities are 0.054088 and 0.902725 for [500–1000 m] respectively. These values indicate the role of distance to central hospitals for COVID-19 treatment. Moreover, a sensitivity analysis demonstrated that the number of patients’ transactions and the distance of the non-residential land use from the hospitals are two dominant factors for virus propagation. The results help urban managers to begin preventative strategies to decrease the community incidence rate in high-risk places. December formerly dispersal at a very fast rate (WHO, 2019; WHO, 2020) . In different geographical areas, the pandemic seems to be scattering at fluctuating rates. COVID-19 spreads more quickly than other breathing diseases (e.g., SARS Severe Acute Respiratory Syndrome) (Coronavirus disease, 2019; Bhaganagar and Bhimireddy, 2020) . According to the number of confirmed cases till 20 January 2021, Iran is the 16 th country in the world in terms of infection with 1,3320,000 infected people and 57,057 deaths (Organization, 2020) . Tehran, the Capital of Iran as a hob of different cities, has a high statistical prevalence of COVID-19 due to its large population and high volume of daily traffic (Rahmatinia et al., 2020) . The COVID-19 pandemic, as a community health reserve, has transported to light many tasks in survives and livings in Tehran. Presently, control of the COVID-19 epidemic is a universal J o u r n a l P r e -p r o o f concern and has become a crucial challenge in Iran (Asadzadeh et al., 2020) . Among different challenges on the COVID-19-episode, detection of involving spatial criteria could help us to prevent form more infection. The first-place people will refer to after the disease is the hospitals. According to the studies, hospitals are the most susceptible places to coronavirus infection. They indicated that the risk of COVID-19 infection would increase with hospitalization or contact with people admitted to hospitals and medical centers (Han et al., 2021; Ertem et al., 2021; Sun et al., 2021; Mohammadi et al., 2021; Arman et al., 2021) . Ertem et al. (2021) indicated that keeping social distances from polluted people is the main aid for infection prevention. Arman et al. (2021) stated that the low distance to hospitals centered for COVID-19 will increase the possibility of COVID-19 incidence. Based on Tobler's first law of geography (Tobler, 1970) , the nearer locations to the hospitals, the more prone area to the virus infection due to more dense population and movement compared to the distant sites. Therefore, assessing urban infrastructure, people activity (financial transaction as an index of patients' sign), and the number of patients in each hospital could specify the main criteria on COVID-19 propagation in different distances from the hospitals. The focus of this paper is to find the most places at risk surrounding the hospitals using a multilayer perceptron artificial neural network. The main contribution is discovering the influence zone of the COVID-19 treatment hospitals and the main spatial factors around them that increase the prevalence of COVID-19 in Tehran. The innovation of this paper is related to finding the most relevant spatial factors regarding the distance from central hospitals and modeling the risk level of the study area due to COVID-19 propagation via MLP-ANN. Thus, eight leading hospitals encountered with COVID-19 with two service areas for each of them have been J o u r n a l P r e -p r o o f computed with [0-500] and [500-1000] meters spread. Also, five spatial factors have been considered, including the location of patients' transactions, the distance of streets from hospitals, the distance of highways from hospitals, distance of the non-residential land use from the hospitals, and the number of patients in the hospitals. The involving factors have been considered based on the expertise knowledge (25 experts include 12 urban planners, 7 physicians, and 6 GIS specialists). The proposed method utilized an AI-based approach to detect the association between derived spatial urban factors and predict the risky zones around the hospitals dynamically. Different studies evaluated the relationship between environmental and infrastructural factors of COVID-19 outbreak cities using data mining approaches Saez et al., 2020; Wu et al., 2020; Velásquez and Lara, 2020) . From the knowledge discovery perspective, recent studies present a wide range of research on the COVID-19 infection in urban areas. Mele and Magazzino (2020) assessed the association between financial evolution, contaminating emanations, and COVID-19 deaths, discovery a fundamental link among PM 2.5 , CO 2 , NO 2 releases, and COVID-19 deaths. They used time series and D2C algorithms and concluded that a specified pollution concentration produced by financial progress could increase COVID-19 contamination via production of the respirational system more vulnerable to contamination. Their results provide new strategies for controlling COVID-19 prevalence in India. Wan (2020) analyzed the events and travel performance of urban residents as the key for the deterrence and control of epidemic situations due to an improved spatiotemporal trajectory data mining method. Through the applied approach, a lot of valued travel info can be derived and calculated. The J o u r n a l P r e -p r o o f regular behavior of separate users and the spatial distribution features of group users' measure can be explored. The consequences of the implemented method revealed the traffic-congested roads as an efficient aid in preventing more infection. Li et al. (2020) used a spatiotemporal design of depressing symptoms produced by COVID-19 by means of the social media and Correlation Explanation (CorEx) learning algorithm. They investigated a robust association amid stress symptoms and the number of enlarged COVID-19 cases for major U.S. cities. The achieved results showed that people's risk insight is sensitive to the release of COVID-19 related media messages and public news. Pansini and Fornacca (2020) studied the spatial appeal of COVID-19 and its association with different ground indexes of air quality in China, Iran, Italy, Spain, France, Germany, the UK, and the USA. The proposed algorithm investigated social media data to compute stress indicators underneath COVID-19 pandemic spatiotemporally. The consequences proved more viral contaminations in the areas distressed by high PM 2.5 and NO 2 values. Rajendran and Jayagopal (2020) studied the impact of lockdown on densely populated districts using SIR and SIER as regression-based methods. Their achievement demonstrated that three restrictions, including the lockdown, the social distancing, and putting on facemask, could control the wide prevalence. Also, they assessed the age/gender-wise social distancing. The critical findings were to execute lockdowns that defeat the broadcast of the diseases and insulate persons with comorbidity. Pourghasemi et al. (2020) monitored COVID-19 outbreak trend in Iran (from 19 February to 14 June, 2020) using random forest (RF). Alteration discovery of the COVID-19 risk maps with a chance forest model specified that the utmost key urban components were the distance from bus stations, mosques, bakeshops, hospitals, ATMs (automated teller machines), banks, and the least temperature of the coolest month. Sugg et al. (2021) developed a GIS-based multivariate analysis and identified risk issues of COVID-19 cases in the nursing J o u r n a l P r e -p r o o f home including population density, per-capita pay, average domestic size, and smaller configuration. Guo et al. (2021) assessed the relationships amid COVID-19 occurrence and diurnal temperature, close humidity, and wind speed using a distributed lag non-linear model. The results showed a stronger relationship with temperature relative to humidity or wind speed. An inverse association was recognized between the COVID-19 frequency and temperature. Sannigrahi et al. (2020) investigated the socio-demographic factors' impacts on the global casualties caused by the COVID-19 using geographically weighted regression (GWR). They applied the global and local spatial relationships between the main variables and COVID-19 cases and deaths in the European and selected total population, poverty, and income as the most effective criteria for COVID-19 outbreak. Mansour et al. (2021) derived four fundamental factors in increasing the COVID-19 incidence rates geographically. These factors contain global ordinary least squares (OLS), spatial lag, and spatial error regression models (SLM, SEM), also two local regression methods include geographically weighted regression and multiscale geographically weighted regression (MGWR). They concluded that the local models could express the dynamic relationships amongst variables. To clarify the applied data mining methods, the assessed factors and study area are summarized in Table 1 . The investigation of Table 1 revealed that the role of proximity to the hospital and influential spatial factors around them on the COVID-19 outbreak did not assess until now. However, the primary spatiotemporal maps of COVID-19 outbreaks show that hospitals with COVID-19 infected people are the primary source of virus propagation. This fact could be proved by considering two service areas for the central hospital allocated for COVID-19 treatments and assessing different criteria in the movement of infected people and disease vectors. As a result, introducing an influence zone for hospitals and introducing involving spatial factors leads to a better decision-making process to manage the COVID-19 pandemic. To our knowledge, this is the first research, which applies a GIS-based approach using AI to study spatial patterns of disease incidence. The consequences support the urban managers to initiate some new strategies to decrease the community incidence rate in the high-risk zones. The key idea of this research is a plan of hospitals' influence zones and involving spatial factors of the COVID-19 outbreak. In this regard, Tehran is a high-populated city with central hospitals to treat of COVID-19 infected people. This section describes the study area and the procedure of the proposed method. Tehran, is the Capital of Iran with a population of around 8 million people. It is ranked as the 24 th populated metropolitan in the world. The city is separated into 22 urban districts. Most of the city districts are located in Tehran County's central district, while two districts are located in the counties of Shemiranat and Ray (Tehran, 2021) . More than 200 large clinics and hospitals in Tehran, of which around 140 hospitals accept COVID-19 patients in the study period. Three other medical universities supervise these hospitals in the city and suburbs. Due to many logistics limitations, at the beginning of the pandemic, many people from the different country provinces proceed to these hospitals. However, after designing and implementing a hospitalization protocol, there is a normal distribution in the cities. Less patient is moving from a town to the Capital, which leads to a more accurate statistic. In this research, the top eight hospitals in COVID-19 treatment, located in the city core and eliminated hospitals in the suburbs, have been considered. The hospitals in the major/minor roads have been selected and instead of using the hospitals with a specific purpose (e.g., maternity), the general hospitals have been assessed. The main reason for choosing these eight hospitals was the highest number of patients. Other hospitals received lower COVID-19 patients significantly in the study period in Tehran. Fig. 1 shows the study area and the considered hospitals for assessing the pattern of the COVID-19 outbreak. The chief idea of this paper is to assess the influence zone of the hospitals and involving spatial factors of the COVID-19 outbreak in an urban zone. To attain this goal, an artificial neural network with multi-layer perceptron architecture is developed since they could detect unknown patterns that shares identical distinctive features. Based on this framework, the proposed method is described as depicted in Fig. 2 . The main spatial criteria considered as input are as follows: distance from main streets, distance (1) Based on the normalized maps, the training data are selected and they are classified into train, validation, and test data considering 70%, 15%, and 15% of all data correspondingly. The multilayer perceptron is designed to assess the relationships between the input and target values. In this paper, two MLP networks consist of an input layer (with five neurons), one hidden layer (with ten neurons), and an output layer (with one neuron) are designed. Where each layer is completely organized with weighted associates to the consequent layer. These networks convert inputs to outputs over a sigmoid function. The output of each network is computed by the activation function as Eq. 2: Where stands for the activation function, x h is the value of the neurons in layer h and W ho is the interconnection between h th hidden layer node and o th output layer node. The activation function is considered as Eq. 3: The activation level of the nodes in the hidden layer is calculated similarly. An error is defined According to the differences between the calculated output and the target values Eq. (4): Where n is the number of patterns in the data set and L stays the number of output nodes. The process tries to decrease the error by regulating the interconnections between layers. The weights are tuned using a gradient descent backpropagation (BP) algorithm. During the training process, MLP begins with an accidental set of initial weights, and formerly training continues till the set of W ih and that of W ho is improved to a predefined error threshold is met between x o and t o (Altun & Gelen, 2004) . To assess the reliability of the designed networks, the number of hidden neurons is changed and the best design (based on least RMSE, and best performance) will select for valuation. The proposed algorithm is implemented in the study area to assess the influence zones of the hospitals. The two designed MLPs have been programmed and accomplished in MATLAB 2016 on a computer with an Intel Core i7 @ 3.20 GHz Processor with 4 GB RAM. For each selected hospital, two specific service areas within [0-500] and [500-1000] meters distance derived from the road network. Fig. 3(a) , represents the distribution of the existing and selected hospitals in the study area. As the main factors, the location of transactions around the hospitals is illustrated in Fig. 3(b) . Also, the density map of quarantined during June-August 2020, is depicted in Fig. 4 . First, the density map of COVID-19 quarantined patients is shown in Fig. 4(a) , then 'distance to residential land use', 'distance to non-residential land use', 'distance to the main street', 'distance to highways' are illustrated in Fig.s (b-e) . Finally, the density map To validate the robustness of the designed MLP, the number of hidden neurons considered as 8,10, and 12, while the best results due to the RMSE, R, and performance curve is related to 10 neurons in the hidden layer. The results of the proposed method in the first scenario (within [0-500] meters distance) are depicted in Fig. 5 (a-h) for eight hospitals. The hospital names are omitted due to privacy reasons. Hospital#8. The R-values of training, validation, test dataset, and the RMSE of the implemented ANN within [0-500] meters are considered as described in Table 2 . J o u r n a l P r e -p r o o f 20 Fig. 6 shows the relationships between the input and target in the first scenario. According to the R-value, the stronger relations between the input parameters and target are related to the 'Hospital#1', 'Hospital#3', and 'Hospital #4'. The weaker relation is associated with 'Hospital#6'. It means that by decreasing the distance from 'Hospital#1', 'Hospital#3', and 'Hospital #4', the role of non-residential land use and patients' transaction has been intensified. Hospital#8. The R-Values of training, validation and test dataset, also the RMSE of the implemented ANN within [500-100] meters, are considered as described in Table 3 . The results of the proposed method in the second scenario (within [500-1000] meters distance) are depicted in Fig. 9 for eight hospitals. According to the trained MLP, the output risk map of COVID-19 is generated as depicted in Fig. 10 . The purpose of sensitivity analysis is to determine the impact of each of the input parameters on the output. Indeed, the sensitivity analysis indicates the influence of uncertainty sources (input and model) on the output uncertainty (Neysani Samany et al., 2009; Rezaei et al., 2020; Nadizadeh Shorabeh et al., 2020) . In each stage, one of the factors is omitted, and the R-value has been computed as it has been depicted in Fig. 11 . In other words, by omitting one factor, the designed MLP is iterated and the R-value shows the significance of that factor. The greater the quantity of decline, the more significant factor. J o u r n a l P r e -p r o o f In this study, the focus was to determine the influence zones of the central hospital for COVID-19 treatment, and the association between the spatial risk factors and location of home patients have been assessed. The spatial risk factors include the location of patients' transactions, the distance of streets from hospitals, the distance of highways from hospitals, the distance of the non-residential land use from the hospitals, and the number of patients in the hospitals. The proposed approach has some specific characteristics that make it applicable in different provinces in Iran, also in other countries worldwide. The first key feature of this study is related to defining two service areas for each hospital, which provides a comparative framework for assessing the effect of closeness to the hospitals on COVID-19 incidence. Furthermore, considering more service areas or using gradual distance The other promotion of this study is referred to the method of implementation, which used the MLP-ANN with sigmoid activation function as it can discover the nonlinear relationships between the risk factors and COVID-19 spatial statistics. At the same time, the sensitivity J o u r n a l P r e -p r o o f analysis specified the more decisive risk factors of COVID-19 incidence. Another advantage in comparison with other similar studies is network service areas instead of the raster data model. However, the style of current research in which people travel through a road to reach a specific location such as land use (e.g., fast food, supermarket), requires a network analysis. In addition, the proposed method can predict the geographical distribution of patients in the study area according to the designed input and trained ANN. Though the compensations of the proposed method are evident, there are some shortcomings to this research that should be considered in future studies. The use of financial transactions has some challenges. One significant issue is the duration of stay of suspicious people in the financial transaction location. According to medical studies, the duration time of contact has a major effect on the infection spread (Ashcroft et al., 2021) . The more transaction in a location, the higher the probability of space. Another challenge of using financial transactions during the hospitalization period is using the Bankcard by accompanying and not the patient. This might be correct in some cases, but in reality, the patient was with the family during the disease incubation period and all of them might be a disease vector. Moreover, a similar weight was given to all location/ land uses to simplify the calculation. However, as an example, a location in a more population-dense area can be at higher risk rather than a site in a less populated place. Another dimension of limitation in this study is accessing diverse datasets. Although we have had a complete dataset from the headquarters in the study area for COVID-19, still a number of datasets can improve the results. An important dataset is the traffic dataset that can help to determine the dynamic population in the study area. Such datasets can be derived from cellphone J o u r n a l P r e -p r o o f tracking datasets but accessing a mobile dataset is almost impossible for the study area due to privacy reasons. More detailed information about the land use datasets can be counted as other important resources to enrich the research. In this study, the focus was the location of land uses. Yet, additional attributes help get better results, such as the number of employees in each workplace showing the spread density in hotspots. As a result, there are other parameters and effecting factors that can be selected as a weight for any of these models and were skipped due to the complexity of the prototype and data access limitations. This paper aimed to find the most places at risk using the multi-layer perceptron artificial neural network technique in Tehran city. Eight hospitals with two service areas with 500 and 1000 meters for each location, and five spatial factors have been considered (location of patients' transactions, distance of streets from hospitals, distance of highways from hospitals, distance of the non-residential land use form the hospitals and number of patients in the hospitals). The implementation results revealed a meaningful relation between the distance from the hospitals and patient's density in the study area. In addition, the sensitivity analysis consequences demonstrated that the number of patients' transactions and the distance of the non-residential land use from the hospitals are two dominant factors for virus propagation. However, the distance of streets from hospitals and the number of patients in the hospitals have the less influence on the COVID-19 outbreak. The achieved consequences established the ability of the proposed technique to develop for big data as it clusters the vast data into several divisions in different distances. The consequences will support the urban managers to initiate some new strategies to decrease the community J o u r n a l P r e -p r o o f incidence rate, such as: (1) according to the critical role of urban land use, some new restrictive rules for unnecessary land uses due to the closeness to hospitals should be determined. These rules should be intensified based on the number of patients in a hospital, and (2) the spots with a high density of financial transactions of COVID-19 patients should be recognized and restricted. Finally, the financial transaction location can be used as a measure to track patients' behavior besides personal cellphone location monitoring records. More into details, instead of tracking people's mobile movement, individual payment location records may track suspicious people and monitor the patients during/ after quarantine. This is due to the sensitivity of mobile location records and more restriction to access such datasets. In addition, the land uses where the purchase is located can be investigated more in detail to define different scenarios such as lockdown and/or limitations for each commercial union. Enhancing performance of MLP/RBF neural classifiers via a multivariate data distribution scheme Risk factors of developing critical conditions in Iranian patients with COVID-19 Seismic vulnerability assessment of urban buildings and traffic networks using fuzzy ordered weighted average Information technology in emergency management of COVID-19 outbreak Quantifying the impact of quarantine duration on COVID-19 transmission. eLife Local atmospheric factors that enhance air-borne dispersion of coronavirus -High-fidelity numerical simulation of COVID19 case study in realtime Vulnerability mapping and risk analysis of sand and dust storms in Ahvaz COVID-19 incidence in 190 countries: An observational study Spatial distribution characteristics of the COVID-19 pandemic in Beijing and its relationship with environmental factors Combined measures to control the COVID-19 pandemic in Wuhan Spatial-temporal analysis of urban environmental quality of Tehran Multi-Layer Perceptron Artificial Neural Network Based IoT Botnet Traffic Classification Modeling spatiotemporal pattern of depressive symptoms caused by covid-19 using social media data mining The relationship between air pollution and COVID-19-related deaths: An application to three French cities Sociodemographic determinants of COVID-19 incidence rates in Oman: Geospatial modelling using multiscale geographically weighted regression (MGWR) Pollution, economic growth, and COVID-19 deaths in India: a machine learning evidence Artificial neural network and logistic regression modelling to characterize COVID-19 Spatial modeling of areas suitable for public libraries construction by integration of GIS and multi-attribute decision making: Case study Tehran Developing a SINTACS-based method to map groundwater multi-pollutant vulnerability using evolutionary algorithms Environmental Science and Pollution Research 3D continuous K-NN query for a landmark-based wayfinding location-based service, 3D Geo-Information Sciences Automatic landmark extraction from geotagged social media photos using deep neural network Modeling spatio-temporal relevancy in context-aware systems using voronoi continuous range query and multi-interval algebra FIA 5 : as customized Fuzzy Interval Algebra for modeling spatial relevancy Knowledge discovery web service for spatial data infrastructures WHO Coronavirus Disease (COVID-19) Dashboard Higher virulence of COVID-19 in the air-polluted regions of eight severely affected countries Walking toward Metro Stations: The Contribution of Distance, Attitudes, and Perceived Built Environment 2020) Spatial modelling, risk mapping, change detection, and outbreak trend analysis of coronavirus (COVID-19) in Iran Impact of suspending traffic management plan in Tehran on air quality and COVID-19 mortality and morbidity Accessing Covid19 epidemic outbreak in Tamilnadu and the impact of lockdown through epidemiological models and dynamic systems Application of two fuzzy models using knowledge-based and linear aggregation approaches to identifying floodingprone areas in Tehran Effects of long-term exposure to air pollutants on the spatial spread of COVID-19 in Catalonia Spain Examining the association between socio-demographic composition and COVID-19 fatalities in the European region using spatial regression approach Mapping community-level determinants of COVID-19 transmission in nursing homes: A multi-scale approach Spatial inequalities of COVID-19 mortality rate in relation to socioeconomic and environmental factors across England A computer movie simulating urban growth in the Detroit region Forecast and evaluation of COVID-19 spreading in USA with reduced-space Gaussian process regression Ricardo Manuel Arias Application of a radial basis function neural network for diagnosis of diabetes mellitus Application of semantic location awareness computing based on data mining in COVID-19 prevention and control system Natural and human environment interactively drive spread pattern of COVID-19: a city-level modeling study in China The authors are thankful to the Shahid Beheshti University of Medical Sciences for the support in the preparing of COVID-19 data.Word count: 5909 The authors contributed fully and equally to this work Najmeh Neysani Samany and Ara Toomanian specified the research design and implemented the methodology. Ali Maher and Khatereh Hanani investigated the sources, literature, and the applied policy. Ali Reza Zali analyzed the data and also provided an extensive discussion. All authors supported a detailed revision and inscribed the paper, read, and agreed with the final manuscript. ☒ The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.☐ The authors declare the following financial interests/personal relationships which may be considered as potential competing interests.