key: cord-352717-g247rjh9 authors: Guirao, B.; Campa, J. L.; López-Lambas, M. E. title: The Assessment of the HSR Impacts on Spanish Tourism: An Approach Based on Multivariate Panel Data Analysis date: 2016-12-31 journal: Transportation Research Procedia DOI: 10.1016/j.trpro.2016.12.027 sha: doc_id: 352717 cord_uid: g247rjh9 Abstract Literature review shows that little research has done so far to estimate how tourism indicators are affected by new High Speed Rail (HSR) lines. In 2012, a multivariate panel analysis developed by Chen and Haynes was applied to the Chinese regions to quantify the HSR impact on tourism output. The Chinese experience confirmed that, during the period 1999-2010, emerging high speed rail services did have significant positive impacts on boosting tourism in China. Since them, no similar empirical tool has ever been tested in Europe. The aim of this paper is to analyze and validate the suitability of this tool to assess empirically the effects of HSR on Spanish tourism during the period 1999-2014, and to enhance the abovementioned model with a tourism database. With more than 20 years’ HSR experience, and operating the longest HSR network in Europe (2,900 km), Spain offers a good scenario for this model application because Spanish tourism sector represents 10.2% of its gross domestic product (GDP). Results clearly show that there is a direct linkage between the evolution of certain Spanish tourism outputs and the construction of the HSR network. However, authorś recommendations include future new research on some variables limitations like the type of tourism output considered or the consideration of alternative explanatory indicators. In a climate of worldwide financial crisis, the exorbitant cost of building a new HSR line requires empirical evidence of its economic and social efficiency and this fact has generated a great debate in the scientific community (Betancor and Llobet, 2015; Albalate and Bel, 2015; Albalate et al, 2015) . Especially in countries with high tourism attractiveness, future HSR line projects need all possible benefits to be studied. Tourism impacts on HSR could be considered as a benefit variable although we recognize that tourism is a complex phenomenon to analyze (Sinclair and Stabler, 1997; Caccomo and Solonandrasana, 2001) , a form of complementary demand for which the main components are transport, food, and accommodation (Morley, 1992) . The link between tourism and transportation infrastructure has been widely analyze in the literature (Khadaroo and Seetanah, 2008) and there is a general feeling that an increase in the accessibility to the tourism destination usually leads to an increase in tourism demand. But HSR is a relatively new mode of transportation and represents a contemporary revolution in transportation technology which has been promoted only in some countries around the world. For that reason, although literature review shows how tourism has affected HSR demand (Guirao and Campa, 2015) , little research has done so far to empirically estimate the opposite effect: how tourism indicators are affected by new HSR lines, which is the main goal of this paper. The majority of published studies describe specific HSR experiences, mainly study cases (Guirao and Soler, 2008; Bazin et al., 2011) and some observed HSR effects on tourism but, as it is described below, they do not lead to a general conclusion. Using destination choice, Delaplace et al. (2014) studied HSR impacts in Paris and Rome tourism. Masson and Petiot (2009) used an economic geographic model (Krugman, 1991) , to discuss the influence of the southern European high speed rail (HSR) between Perpignan (France) and Barcelona (Spain) on both tourism activity and economic development. Using a non-empirical discussion, they argued that HSR can facilitate the development of tourism activities, and particularly business and urban tourism. With a similar methodology, using a gravitational model (the iso-tourist line from a time-space replacement concept) Wang et al. (2012) studied in China the projected HSR effects on tourism: redistribution and transformation of the tourist market, market competition on a larger scale, and reallocation of the urban tourism centre. But the methodology developed by Wang et al. was not empirical and did not allow the estimation of tourism ouput, like tourism demand or tourism revenue. Apart from economic geography models and the choice destination approach, the existing literature on tourism demand is dominated by econometric models that tend to follow a single-equation panel data approach (Lim, 1997; Song and Li, 2008; Song and Wong, 2003) . Chen and Haynes (2012) were the first to apply this methodology to HSR impact on tourism. Through a multivariate panel analysis, they investigated the impact of Chinese high-speed rail systems on the tourism industry, selecting only the numbers of incoming foreign tourists and tourism revenue as the dependent variables. The regional and geographic characteristics in the east and west, north and south of China vary substantially and the selected unit of their analysis was the provincial level, which includes 27 provinces and 4 municipalities. This experience confirmed that, during the period 1999-2010, emerging high speed rail services did have significant positive impacts on boosting tourism in China. In this context, the study of the suitability of the Chen and Haynes´s tool when applied to other European case studies seems to be the more reasonable first step towards the development of a systematic tool to evaluate the integral impact of HSR on the tourism industry. The man aim of this research lies in the first assessment of the dynamic panel data model tested in the Chinese scenario when applied to a European country: Spain. This paper is divided into the following parts: first, literature review in section 1; description and results of the model tested in China (section 2), application to the Spanish case study (section 4); and finally, the main conclusions (section 4). The model proposed by Chen and Haynes (2012) is based on a multivariate panel analysis using the data from 1999 to 2010 at the provincial level. Table 1 shows variables (and their main descriptive statistics) used by Chen and Haynes in their model. In relation to the dependent variables, the criteria used by authors to select the tourism output were focused on the availability of the related data base: number of foreign tourist arrivals (taf), number of oversea tourist arrivals (tat) and tourism revenue (tr). Three different models, using the same structure and independent variables but different tourism output, were tested in order to quantify the impact of the HSR lines on China tourism. It is important to remark that domestic tourism demand data was not publicly available and was not considered in the research and this is an important drawback of the methodology. Moreover, the numbers of total oversea tourist arrivals differs from the numbers of foreign tourist arrival in that the former not only includes the later, but also tourists from Macau, Hong Kong and Taiwan. Fortunately, tourism revenue output includes revenues from total oversea tourist arrivals. In relation to explanatory variables (dependent variables), rail features are defined by three indicators: lengtht of the railway network (including conventional lines), rail traffic (ridership) and the presence of HSR lines in each province (dummy variable). A preliminary multicollinearity data analysis found that the correlations among two of these rail variables (rail length and ridership) were not low and this fact forced the authors to present two separated models per tourism output (see Equations 1 and 2). In relation to the rest of independent variables, authors used two economic indicators: the annual nominal exchange rate (exc) from US dollar to RMB (RenMinBi, the legal tender of China) and the Gross Provincial Product per capita (gpppc). Some multidimensional features of local tourism are also collected: number of World Heritage Sites, number of scaled restaurants, number of museums, or number of public libraries (lib). Impedance variables were also took into account as the number of accidental pollution releases or the dummy variable associated to the outbreak of SARS (Severe Acute Respiratory Syndrome), originated in China in 2003. Attracting dummy variables were also considered to control the influences of events, like the 2008 Beijing Olympic Games and the 2010 Shanghai Expo respectively. Finally, there is another additional variable in the model (called year, that does not appear in Table 1 ), which finally collects the sequence of the observations (372 valid observations) and tries to measure time trends. Equations 1 and 2 shows the structure of the models, which were implemented using a fixed effects (FE) formulation (to take into account the unobserved individual effects). The terms i and t denote tour destination provinces and time period respectively while Y represents the dependent variables (foreign tourist arrivals, total overseas tourist arrivals or total overseas tourist revenue). Consequently, three pairs of estimations were carried out by Chen and Haynes and the error term εit is the sum of the unobserved province-specific effects μi and the white noise error term υit. The results of the application of the six models (2 per tourism output) demonstrate the effectiveness of Chen and Haynes´ estimation. The adjusted R square of the 6 models range between 0.67 and 0.81, indicating that the selected variables can explain between 67 to 81 percent of variances of the different tourism outputs. Equation 2 Among the results obtained, surprisingly, the local site characteristics such as numbers of five stars hotels, high scale restaurants, world heritage sites and museums are not statistically significant in all the tested models. Only the deterrent destination variable SARS shows a strong significant but negative impact on all the tourism outputs while the number of pollutions is not statistically significant. On the other hand, the used origin variable, the exchange rate (exc), results a key determinant on tourism outputs together with the destination gross provincial product per capita (gpppc) variable. In relation to rail variables, and this is the main result obtained by Chen and Haynes, the key policy variable HSR is the most relevant, being statistically significant at 5% significance level. For instance, if the province has the HSR service, it is likely to have an additional 20 percent of foreign visitors, or 23 percent total oversea visitors, or 25 percent total oversea revenue in that year. With these findings, the authors confirm that HSR services do have a significant influence on promoting tourism but recognize the validity of previous HSR investments only in that particular time and spatial context. China results have encouraged authors of this paper to study the implementation of the multivariate panel data analysis to Spain. Immediately after, next section contains a demonstration and discussion of the modelling process and the results for the Spanish case. The direct application of Chen and Haynes´s tool to the Spanish case raises several difficulties, mainly associated to the adaptation of the statistical database to the variables used in the original model. In order to use similar criteria, a provincial level data base was considered and 47 provinces were selected (after removing those located on the Spanish islands). The multivariate panel analysis was carried out taking into account the data base limitations and the period from 1999 to 2015 was selected for the study. In accordance with the original model, the dependent variables were focused on similar tourism demand indicators: number of foreign (non-resident in Spain) tourists (taNR) and tourism revenue from not resident in Spain tourists (tr). Table 2 shows the list of variables (and their main descriptive statistics) introduced in the Spanish models. Rail variables were similar to the ones considered in China case study (rlen and rider) and, before designing the equations for the Spanish case study, a preliminary multicollinearity data analysis found that the correlations among the explanatory variables rider and rlen were important (similarly to Chen and Haynes´s research). This fact forced the authors to present two separated models per output variable (rail ridership and rail network length). That means that four models have been designed and their results have been compared to the Chinese findings. In the time period considered (1999) (2000) (2001) (2002) (2003) (2004) (2005) (2006) (2007) (2008) (2009) (2010) (2011) (2012) (2013) (2014) (2015) no special impedance variables were took into account, like the Chinese accidental pollution releases or the outbreak of diseases of SARS. Instead, three special dummy variables related to the promotion of big international cultural and sports events were introduced as attracting indicators: the Forum 2004 held in Barcelona (Forum), the case of Valencia and the America's Cup in 2007 (AmCup), and finally the case of the International Exhibition in Zaragoza held in 2008 (expoZ). There is another additional variable in the model (called year), not linked to origin, destination or policy features, which finally collects the sequence of the observations and tries to measure time trends. In case of China, authors only used the annual nominal exchange rate (exc) from US dollar to RMB (RenMinBi), but this variable has been modified in the Spain case study. According to the statistics of foreign tourism (Instituto Español de Turismo, 2009), more than 75% of the Spanish foreign tourism is coming from the European Union and for that reason, our exchange variable has been defined as the difference between the provincial GDP and the European (Eurozone) GDP, referring currency to year 1999. Other modifications from the original model have to do with the type of hotels considered in the analysis: in Spain, four star hotels are the most demanded ones for both domestic and foreign visitors (Instituto Español de Turismo, 2015) , and for that reason the hotel variable (hotel) has included the number of 4 and 5 star hotels (China case study only included 5-star hotels). In relation to the unobserved individual effect in a panel model, four fixed effects model (FE) were developed, which consisted in statistical models that represents the observed quantities in terms of explanatory variables that are treated as if the quantities were nonrandom. Table 3 shows the determinants of foreign tourism in Spain obtained from model 1 (using rail network length) and a comparison with Chen and Haynes findings using the same tourism output. The Tables 3-6 shows the determinants of tourism output in Spain obtained from the model 1-4 (using rail ridership or rail length) and a comparison with Chen and Haynes results. The first finding, in the Spanish case, is related to the local site characteristics such as numbers of five stars hotels, high scale restaurants, world heritage sites and museums. In China, these variables were not statistically significant in any of the designed models, while in Spain, there are some indicators, like the hotel variable, which are statistically significant at 1% level. The coefficients of the hotel variable, considering the four models, range between 0.941 and 0.591. Other local site characteristics, like museums or libraries reached high levels of significance, although their coefficients are much lower than hotel variable values. The insignificant results for the event variables are similar for the Chinese and the Spanish experience. The special events such as the 2008 Beijing Olympic Game and the 2010 Shanghai Expo and number of pollution incidents are not statistically significant even at the 1% level. However, in China, the year dummy of 2003, which represents the epidemic of SARS in that year, is highly statistically significant. The estimation makes sense since SARS has a reasonably strong negative effect on foreign tourist arrivals In the Spanish model no deterrent event variable was introduced and results are consistent with the Chinese results. Among all the explanatory variables, it is not surprising to find that the gross provincial product per capita (gpppc) variable has a significant impact on the number of foreign tourists, either in China or in Spain. Another key determinant is the exchange rate (exc), which represents the price level of tourism. In the case of China, a decrease of exc denotes the appreciation of RMB and a depreciation of US dollars, in other words, the price of a Chinese tour rises. On the contrary, the increase of exc would indicate the decrease of tour price in China, which thus may further attract more international tourists. In relation to rail variables, while ridership and length are not really significant, the dummy indicator HSR plays an important role (being statistically significant at 1% significance level) in the foreign revenue models of China and Spain. This result is the key finding of this paper, although coefficients obtained in Spain are much lower than the ones obtained for China. In China, the highest coefficient is around 0.25, which can be interpreted as provinces with HSR services in that particular year are associated with approximate 25 percent additional tourism revenue than either year when they do not have the service or the provinces that do not have the services. In Spain, the highest coefficient (see models 3 and 4) is 0.017, which means that provinces with HSR services in that particular year are associated with approximate 1.7 percent additional tourism revenue than either year when they do not have the service or the provinces that do not have the services. In conclusion, results shows how HSR in Spain has affected positively to foreign tourism (especially revenue coming from foreign tourism), although his impact is not as important as the one registered in China. There are other explanatory variables in the model with identical significance but higher coefficients, like the gross provincial product or the number of hotels. In Spain, HSR serves mainly domestic trips and one of the main drawbacks of the Chen and Haynes´s model is that it does not consider domestic tourists, and this percentage is quite high in Europe. Moreover, the HSR impacts on tourism in a country like China, where the alternative interurban transport network is less developed than in Europe, are probably of a higher magnitude and easier to detect by an econometric model. Further research is also needed on using variables related to the territorial distribution of the HSR lines and the services offered by the operating companies. In these commuter touristic relations, for example, most of the visitors do not use to stay at hotels and consequently, they are not registered as tourists by official statistics. In conclusion, the model needs a greater detail of data disaggregation together with a better understanding of the territorial distribution of the HSR lines through new explanatory variables. The HSR variable behavior is a key issue in this paper, being highly significant either in China or in Spain, although coefficients obtained in Spain are much lower than the ones obtained for China. Results show how HSR in Spain has affected positively to foreign tourism (especially revenue coming from foreign tourism), although his impact is not as important as the one registered in China. There are other explanatory variables in the model with identical significance but higher coefficients, like the gross provincial product or the number of hotels. In Spain, HSR serves mainly domestic trips and one of the main drawbacks of the Chen and Haynes´s model is that it does not consider domestic tourists, and this percentage is quite high in Europe. Nevertheless, it must be highlighted that the most important tourism output considered in the Spain case study (number of foreign tourists) is linked to the stay at hotels. Due to the lack of other national data, like the number of visitors staying in apartments or in second residences, which have not been taken into account by the model, results should be interpreted with caution. Their consideration as well as to deepen in the understanding of the existing impacts should be subject to further research with a new and ad-hoc data base elaboration. Finally, the model needs a greater detail of data disaggregation together with a better understanding of the territorial distribution of the HSR lines and the type of HSR service offered by the operating companies. This paper only represents the first step towards a better understanding of the HSR impacts on the tourism industry. The Economics and Politics of High-Speed Rail. Lessons from Experiences Abroad When supply travels far beyond demand: institutional and regulatory causes of oversupply in Spain's transport infrastructure High speed railway, service innovations and urban and business tourism development Tourism industry and High Speed Rail, is there a Linkage: evidence from China's High Speed Rail development Can High Speed foster the choice of destination for tourism purpose Impacts of the new high-speed service on small touristic cities: the case of Toledo, in The Sustainable City V. Urban Regeneration and Sustainability. Wessex Institute of Technology The role of transport infrastructure in international tourism development: a gravity model approach Increasing returns and economic geography Review of international tourism demand models Can the high-speed reinforce tourism attractiveness? The case of the high-speed rail between Perpignan (France) and Barcelona (Spain). In Technovation A microeconomic theory of international tourism demand The choice of journey destination: a theoretical and empirical analysis The Economics of Tourism Tourism Demand Modeling and forecasting-A review of recent research Tourism demand modelling: a time varying parameter approach Behavior of HTR Fuel Elements in Aquatic Phases of Repository Host Rock Formations