key: cord-0827956-6zx7j4tn authors: Fotiadis, Anestis; Polyzos, Stathis; Huan, Tzung-Cheng T.C. title: The good, the bad and the ugly on COVID-19 tourism recovery date: 2020-12-13 journal: Ann Tour Res DOI: 10.1016/j.annals.2020.103117 sha: 4f5c53eb643a0e721bc8deee6f0d8ac7777254de doc_id: 827956 cord_uid: 6zx7j4tn This paper is to produce different scenarios in forecasts for international tourism demand, in light of the COVID-19 pandemic. By implementing two distinct methodologies (the Long Short Term Memory neural network and the Generalized Additive Model), based on recent crises, we are able to calculate the expected drop in the international tourist arrivals for the next 12 months. We use a rolling-window testing strategy to calculate accuracy metrics and show that even though all models have comparable accuracy, the forecasts produced vary significantly according to the training data set, a finding that should be alarming to researchers. Our results indicate that the drop in tourist arrivals can range between 30.8% and 76.3% and will persist at least until June 2021. Differently from the recent epidemic outbreaks such as SARS, Ebola, and H1N1, the coronavirus (COVID-19) remain the world's deadliest epidemic outbreak that comes along with a systemic global healthcare crisis, financial crisis, and economic downturn known as COVID-19 recession. To limit the spread of the COVID-19, governments across the globe have taken drastic measures by locking down the entire country or the most affected cities and also by prohibiting entry to their borders, resulting in an immense hit for the global tourism industry, particularly the travel and hospitality sector. The COVID-19 outbreak has forced many tourism destinations to stop their operations following lockdown measures and travel bans, as well as canceled bookings and limited logistics. According to an estimate of the UNWTO (2020b), international arrivals dropped by 22% in the first quarter of 2020 and are expected to register a decline between 60 and 80% for the whole year, translating into a loss of between US$910 billion to 1.2 trillion. Moreover, social distancing has suppressed the hospitality industry where several accommodation facilities were forced to instantly stop their operations and/or significantly downsize them. Among other types of businesses, Hotels and accommodations are considered the preliminary hotspots that transform local epidemic outbreak into a pandemic and the preliminary point The term "crisis" refers to sudden and unexpected events that can result in major unrest and threats for citizens. In this case, a public health crisis is a difficult circumstance that affects individuals in several geographic regions or a whole country. In the case of a global health crisis, this frequently originates in a particular region before spreading to an entire country and the entire planet as the current COVID-19 crisis. Therefore, the choice of traveling and visit to a destination depends on tourists' perceptions regarding their safety and security (Taylor & Toohey, 2007) and the imagery formed by how the media or social media report the crisis. As a result, it becomes difficult for the tourism industry to face the challenges posed by health crises as these crises are often subject to negative media coverage (Novelli et al., 2018) . Following the SARS epidemic crisis, several studies have revealed the adverse effects of the outbreak on the global tourism industry and particularly on the economies of Southeast Asian neighboring countries (Hai, Zhao, Wang, & Hou, 2004; Hanna & Huang, 2004; Overby, Rayburn, Hammond, & Wyld, 2004; Pine & Mckercher, 2004; Siu & Wong, 2004) . In their study, Abdullah et al. (2004) show that the SARS outbreak in 2003 has caused a 2.6% decrease in international travel during the first 4 months of 2003. For the Asia-Pacific region, these authors reported a decrease of 10% in March and 50% in April. As the most affected region, Hong Kong has registered a decline of 64.8% in March and 67.9% in April following the SARS outbreak during that same year (Abdullah et al., 2004) . One should expect important changes in the behavior of Chinese tourists (Wen et al., 2020) as many tourism destinations globally are waiting for the recovery efforts. Interestingly, one should expect tourists to choose less crowded destinations that facilitate the practice of social distancing (Seraphin & Dosquet, 2020) . Therefore, one may expect a limit in the interaction between employee and customer in the provision of accommodation services. Within this vein, Wen, et al. (2020) suggest the importance for managers to prepare for hotel development once the COVID-19 pandemic crisis is over and also suggest the need for managers to pay attention to possible experiences stemming from this crisis. In a recent study, Gössling et al. (2020) analyze the effect of prior epidemics and pandemics and the effect of the COVID-19 pandemic on the global tourism industry, the airline and hospitality industries succeeding the travel restrictions and the great lockdown. Based on the results of their study, Gössling et al. (2020) caution how a pandemic outbreak can change society, national economies, and the tourism industry. Accordingly, they conclude that the negative impact of epidemic outbreaks will be greater on the tourism industry and supporting sectors in the world's poorest economies. The negative effects may be further aggravated by changing work behavior in hotel workers (Stergiou & Farmaki, 2020) . As mentioned earlier, the negative impact of epidemic outbreaks is not only disastrous for the tourism industry but also for the supporting sectors. The study by Williams and Kayaoglu (2020) illustrates the impact of an epidemic outbreak on the tourism industry and the supporting sectors to the tourism product and service delivery, as well as employment. These authors point out that nearly one-tenth of the European non-financial economic activities were linked to tourism and account for 9.5% of employment among the active population in the EU in 2016. Williams and Kayaoglu (2020) also point out that accommodations and the food and beverage sectors in the EU contributed to 19.7% and 58.7% respectively for the overall employment in the tourism industry. On this basis, it clear that the great lockdown and shutdown of the tourism-related businesses and supporting businesses following the COVID-19 pandemic have resulted in an unprecedented socio-economic impact. Tsionas (2020) forecasts post-COVID-19 gradual adjustment in the tourism, hospitality, and related industries. The results show that reopening gradually that requires only nonnegative profits is quite feasible, whereas reopening that requires the same profit level as in the pre-COVID-19 period remains significantly more difficult and appears achievable by reopening at capacity neighboring 33%. Based on these results, Tsionas concludes that lower capacities necessitate governmental supports which are likely to differ substantially from hotel to hotel. Within the same vein, Assaf and Scuderi (2020) argue that the tourism industry has been confronted with numerous crises in the past and thus caution that the present crisis resulting from the COVID-19 outbreak remains by far the most damaging one. This argument is also shared by Karabulut et al. (2020) and by Dolnicar and Zare (2020) . Consequently, the tourism industry and government have an important role to play in the recovery efforts as the tourism industry will look different from post-pandemic (Assaf & Scuderi, 2020) . Consistently, it is expected that the consequences of COVID-19 will result in a significant decline in value industries hotels, airlines, cruise lines, and car rentals (Sharma & Nicolau, 2020) . The decline is significant enough in each industry to show concerns over the long-term outlook for each of these industries. What is apparent, nevertheless, is that the most critical concerns are largely connected to the cruise industry. Our first approach extends the decomposable time series model of Harvey and Peters (1990) and splits the model into three components, the trend, the seasonality, and the "irregular" component. The latter component is essentially the training set used. This model specification is essentially similar to the GAM methodology of Hastie and Tibshirani (1990) and has been introduced by Taylor and Letham (2017) . We build on their model to accommodate the particular needs of our forecasting task. The general form of the model can be described as follows: where g is the general trend of the time series, s is the seasonal component and k represents the effect of the training set, to capture irregular effects on the time series. In this manner, we can model systematic (periodic and non-periodic) changes in the data set, through the seasonality and the trend components respectively, while unsystematic outcomes are modeled by the "special" component. Finally, the error term ε will capture the residual, idiosyncratic changes not accommodated by the model. The errors are assumed to be normally distributed and this has been verified in our implementation both visually and by using the Jarque-Bera and the Shapiro Wilk (Shapiro & Wilk, 1965) test for normality. Under this approach, the seasonality component is an additive factor in the modelling approach, in a technique similar to exponential smoothing. The main advantage of the GAM approach is its ability to incorporate new components with a simply additive transformation. Taylor and Letham (2017) present a "holiday" component that captures the effects of special dates in each calendar year. We choose to replace this component with our training data set, thus decomposing the time series into a different set of predictors. Also, this model has very good performance not only in terms of its predictive accuracy but also in terms of its fitting speed, using different methodologies. Similarly to the original version of the algorithm, we fit the model using the Limited-memory Broyden-Fletcher-Goldfarb-Shanno algorithm (L-BFGS). L-BFGS is a popular algorithm for parameter estimation in machine learning due to its ability to minimize the target function f(x), a differentiable scalar function, over an unconstrained set of real values for the vector X. Its fast performance provides us with the flexibility to calculate different models using the four different training sets and a worst-case scenario, as discussed in section 0. For the trend component of the model, we select the non-linear, saturating growth approach over the linear trend with breakpoints. This is justified since the tourism data series demonstrates non-linear patterns, without specific breakpoints that boost demand on a global level (Hassani et al., 2017) . However, demand breakpoints can be captured in our model, as we will see below. The trend component is as follows: where P represents the global population at each period, h is the growth rate and m is an offset parameter. The growth rate is assumed to be non-constant and exhibits structural breaks. For B breaks in the data series at points b j (j = 1 to B), we can define δ as a vector of growth rate adjustments, and thus calculate the growth rate, h, at any given point by. At each changepoint, j, we can define the growth rate adjustment by We then define a vector α(t) such that Combining Eqs. (2), (3), (4), and (5), we get the final growth model as: For the seasonality component, we follow the flexible model of periodic effects of Harvey and Shephard (1993) , which defines a Fourier series as follows: The seasonality parameters vector [φ 1 , ω 1 , ………, φ n , ω n ] has normal distribution around zero means. The parameters for the estimation are selected using AIC. For yearly seasonality, which is displayed by our data, AIC sets N = 10 (Taylor & Letham, 2017) . The second approach used is the Long Short Term Memory (LSTM) neural network, as implemented by Polyzos et al. (2020) . LSTM networks are very efficient in explaining autoregressive data series and tourist arrivals are a prime candidate for the implementation of this methodology (Athanasopoulos & de Silva, 2012; Law et al., 2019) . Similarly to Polyzos et al. (2020) , we crossvalidate the forecasting network using backtesting, whereby we split the data sample into groups that are used for training and validation and we calculate performance metrics for each window. Under this cross-validation technique, we essentially develop different models on the subsets of our data and test them against a validation data set. This calibrates the logic gates of the model and increases the robustness of our predictions, helping us achieve the desired prediction accuracy level. The LSTM algorithm is a deep learning algorithm and was first introduced by Hochreiter and Schmidhuber (1997) and it is particularly useful when attempting to model time series with high degrees of autocorrelation, using the fact that it is able to adapt to long-term dependencies (Gers et al., 1999) and overcome the errors of similar algorithms in the back-propagation of information contained in recent input events (Bengio et al., 1994) . In this manner, LSTM networks can utilize the information contained in recent input and this information is used for long periods after the input time. LSTM is essentially a special case in the group of the recurrent neural network, which utilizes sequential information by selectively passing inputs across time steps during data element processing of data elements (Cho et al., 2014) . The LSTM topology (source: Schmidhuber and Hochreiter (1997) Fig. 1) features a recurrent learning unit inside the network and, also, several decision gates that utilize two important attributes: the longer states from the starting units and the shorter states from the last unit of information. This feature has permitted LSTM networks to achieve great success in solving time series forecasting problems (Law et al., 2019) . The general setup follows that of Recurrent Neural Networks (RNN) and includes an input and an output layer, with many hidden layers in between. However, the process involves an attention mechanism that can assign different weights to the various inputs of the model, thus permitting it to learn the importance of new input during data processing. A stateful LSTM methodology suggests that cell states are preserved after each iteration and are simply updated with the new information. The modelling approach includes building a forecasting network ϑ which will predict n future values based on a vector of T previous values of the same data series. The model is as follows: The decision functions at each gate (forget gate f, include gate i and output gate o) and the hidden layer (h) are as follows: where σ represents a sigmoid function, W f, i,o represents the weight vector of inputs, h t-1 is the hidden layer from previous periods, y t is the new input vector and b f,i,o is the bias of each gate. The bias coefficient is a common feature of all machine learning functions and can either be set beforehand or calculated during the training phase. The sigmoid function is implemented as the gating function for the three gates of the LSTM network because it outputs a value between 0 and 1. Thus, the gate can either let no flow or complete flow of information pass through the gates. However, to overcome the vanishing gradient problem, we implement the tanh function in the hidden layer. The tanh function is commonly used in LSTM networks to determine which candidate values will be added to the internal state since its output can be either positive or negative, allowing for increases and decreases in the cell state. To determine the new candidate values (vector e C t ) that can be added to the neural cell's state, LSTM uses the following equations. These are determined as follows: Thus, the new cell state, C t , is as follows: We implement early stopping on the LSTM models in order to prevent overfitting. This is an important addition to deep learning models, as too many training epochs may result to overfitting but too few can yield an underfit model. Early stopping is considered a robust algorithm that prevents overfitting and we make sure that each epoch results in an improvement of the evaluation metrics of the model. Our implementation includes two LSTM layers and a dropout layer. The dropout layer helps prevent overfitting by reducing the sensitivity of the model to the specific weights of individual neurons. This is performed by discarding randomly selected neurons during training. It is important to remember that we have implemented a stateful LSTM, which means that all cells "remember" their previous state with each iteration. We have chosen to add two layers as this has been shown to be able to handle most complex problems, while more layers can be difficult to train, with little gain in terms of accuracy. The same is true for the process of selecting the number of neurons, where we find that nine neurons are adequate since adding more does not further improve accuracy. Also, by limiting our layers to two, we can implement sigmoid activation function, as discussed earlier. The model is optimized using the Adam optimization algorithm which is a stochastic gradient-descent method. This methodology is computationally efficient and yields robust results with low memory requirements (Kingma & Ba, 2014), which is an appropriate choice given the number of variations in our methodology. The literature defines three principal metrics for the evaluation of prediction accuracy. These are the Mean Absolute Percentage Error (MAPE), the Mean Absolute Error (MAE), and the Rooted Mean Square Error (RMSE). We implement these measurements to evaluate the accuracy of the different models designed. The formulae for the metrics are given below: where y t and̂y t are the observed and fitted values of the variable at time t, respectively. Data & training periods We use monthly data regarding international tourist arrivals sourced from the UNWTO Tourism Data Dashboard (UNWTO, 2020a), starting from January 1998 up to May 2020. For reference purposes, we extended the value of May 2020 to June 2020, due to lack of available data. Thus, our forecasts begin from July 2020, where borders around the world started to gradually open up. The figures retrieved relate to the number of international tourist arrivals worldwide, in millions of tourists. The period is long enough to include all the necessary events to perform the rolling-window training, as well as training on the three specific events that interest us. Since both methodological approaches consider the autoregressive patterns, we choose not to deseasonalize the data series, as the seasonality component is captured by both methodologies. The descriptive statistics for the data are presented in Table 1 , while the data is demonstrated graphically in Note: This figure demonstrates the international tourism flows from January 1999 to May 2020. We zoom in on the three period of interest for training reasons, namely the period of SARS epidemic, that of the MERS outbreak and on the GFC period. Fig. 2 , where we also zoom in on the three periods of interest (SARS epidemic, MERS outbreak and the GFC period). We choose to discuss these periods since all three represent points of interest in the tourist demand bibliography. The literature suggests the SARS epidemic one of the most important crises in tourism (Overby and al., 2004) . However, despite widespread fears, the results of the SARS epidemic were limited to the countries with a larger number of cases. In addition, the MERS outbreak has also had detrimental effects on tourism flows, particularly due to the fact the many middle eastern countries are global tourism destinations (Choe et al., 2020) . Finally, the GFC of 2007-2009 is often discussed as the biggest crisis in the tourism sector of the recent years (Sheldon & Dwyer, 2010) . The periods have been included as training periods in order to produce forecasts for the results of the current pandemic. , which reveals the correlation between a given time series and lagged versions of itself, in the international tourist arrivals time series. LSTM models require data with a high degree of autocorrelation (Gers et al., 1999) and thus this is the first step of the LSTM modelling approach. The optimal lag is set according to Akaike Information Criterion (AIC) at 12 months, where it is evident that autocorellation is maximized. This is unsurprising as the data is highly seasonal. At the 12-month lag, the calculated autocorrelation exceeds the threshold value of 0.5, suggesting that this lag can be considered as for the optimal lag setting and thus help to build the forecasts that are needed employing the LSTM approach. After the 12-month lag, autocorrelation decreases again and thus this value is the optimal value. We construct ten different models (five for each methodology) as follows. The first model uses the entire sample as a training data source. The next three models are constructed using training data from major crises of the recent past, namely the SARS epidemic, the GFC and the MERS outbreak. The last model in each methodology is a "worst case" scenario approach, which uses the minimum values for each data point (month) in the data series. By training the models with this data, we are able to present a pessimistic scenario for international tourist arrivals forecasts. We duplicate these training periods once in the GAM model and once in the LSTM model, resulting in ten different forecasting models. After constructing the models, we perform the rolling-window sampling plan (Fig. 4) in order to compute accuracy metrics and determine the optimal modelling approach. Using the setup of Polyzos et al. (2020) , which splits the data into slices with a three-year training period (36 data points) and a one-year testing period (12 data points), we create six subsamples that allow us to test our prediction model. The sampling window is shifted with each subsample in order to create more datasets (the data subsamples) and to permit us to compute metrics on the accuracy of each model's predictions. Fig. 4 shows the six data slices with forecasts created using the LSTM model. We calculate three accuracy metrics for each model. We also include a simple ARIMA (Auto Regressive Integrated Moving Average) model, trained only over the entire data sample, which is a popular methodology, as a baseline model, for comparison purposes. The reported metrics helps us determine the relative performance of each methodological approach and of each training set. The results are displayed in Table 2 , which also includes the average value that the model produces for each metric. Note: The two columns are not synched, but rather they demonstrate the values of the metrics for each data series separately. This means that, for example, the minimum number of tourist arrivals, which is 3 million, did not occur in January 1998 (in fact it was in April 2020). January 1998 is simply the minimum value in the Date column. A. Fotiadis, S. Polyzos and T.-C.T.C. Huan Annals of Tourism Research 87 (2021) 103117 The table shows that both methodologies outperform the ARIMA model and produce forecasts with comparable accuracy, with the LSTM model being somewhat more efficient for the specific data series. Also, we see that using the entire data sample as a training set yields lower prediction errors, which confirms the efficiency of our metrics, as this is an intuitively expected result. In addition, in both cases, the worst-case scenario will produce the highest errors, regardless of the approach or the metric uses, which is also an anticipated result. It is important to note that we do not use raw error values in any of our metrics (as would be used for example in the Mean Percentage Error metric). Fig. 5 demonstrates our forecasts on international tourist arrivals from July 2020 onwards, using our two methodological approaches and the five different training sets. The graph clearly shows the significant losses of the tourism sector due to the pandemic, which seems to have detrimental effect on tourist flows. In terms of the current situation, the peak in the summer of 2019 is followed by the current trough with the reduction in tourist arrivals being as high as 97.6% (Year-to-year change from May 2019 to May 2020, actual data). This effectively means that the current tourist season should be counted as an almost total Moving on to our forecast summaries (Table 3) , we can see that all models predict that the significant losses are persistent even up to the next tourist period. Firstly, the yearly drop in December 2020, as compared to December 2019, is calculated to be around 56.1% (the averages of both methodologies coincide), with the best scenario predicting a drop of 41.7% and the worst scenario suggesting a drop as steep as 75.2%. In the two-year change (June 2019 to June 2021), our predictions average a 48.5% drop, with different scenarios ranging from 30.8% to 76.3%. Both the values and ranges of our predictions suggest that recovery of the tourism sector is hardly nigh. The next year holds difficult predictions for the industry and our average forecast for June 2021 suggest that the loss of tourism demand takes the sector back to the tourist flows of 2005, which means that 15 years of growth will be lost due to this pandemic. In addition, the ranges of the prediction values show that the drop will not necessarily be smoother as the months progress. The worstcase for June 2021 is worse than that of December 2020, in percentage, year-to-year terms, despite the fact that the average predictions seems to be better. In terms of methodologies, we see that GAM predictions are somewhat more conservative when compared to LSTM predictions. However, we can see that the events included as training sets affect the prediction outcome in a different manner according to the methodology. For example, despite lower predictions in general, the GAM model yields higher predictions on average when the SARS or the GFC training set is used. This means that, particularly in machine learning algorithms, the training set used is important in determining the final outcome of the prediction model. It is important to note, however, that except for the worst-case scenario, all training sets yield similar accuracy metrics and thus there does not seem to be a consistent pattern of performance in the different training sets. Finally, it is important to note that, barring the worst-case scenario, the GFC training set produces the lower forecasts in both methodologies, confirming existing literature that the GFC was the deepest crisis faced by the tourist sector before the current pandemic (Sheldon & Dwyer, 2010) . Using a multi-perspective approach to evaluate the impact of crises has important practical implications for economic actors that still look forward to starting with phase one recovery. Crises are undesirable events that greatly affect the tourism industry. By viewing conflicts as crises, introducing a holistic and multi-perspective framework provides researchers with substantial insights on how tourism-related businesses can develop and manage post-conflict goals (Reddy et al., 2020) . In this paper, we produced a 12-month forecast for international tourist arrivals, using two methodologies and five different training sets. Our empirical findings show that the COVID-19 pandemic will result in losses of around 50% for the next year and that these losses will be persistent at least until the next summer and will backtrack the growth of the tourism industry as much as 15 years. In addition, we show that the worst-case scenario predicts that a deterioration of tourist arrivals in the coming months, suggesting that the tourism industry needs to prepare for a potentially worsening crisis. The predicted drop is in line with UNWTO forecasts of a 60%-80% for 2020, but our results do not agree with expectations of recovery in early 2021 (UNWTO, show that it may take up to a year for the trend in tourist arrivals to return to pre-crisis levels. In addition, our work shows that it is important to compare different training sets when designing machine learning algorithms, since each training set may yield different results. It is noteworthy that the different training sets used produce forecasts with comparable accuracy across both methodologies. However, the predictions of each model vary to a great extent and this is an important conclusion and one that is often overlooked by the literature. Machine-learning research tends to compare different methodologies across the same training set, whilst our work suggests that different training sets should also be explored. Our results have important implications both for policymakers and for researchers. In terms of policy, our predictions suggest that the crisis is far from over in the tourism sector. Without a medical solution to the pandemic, tourists will be reluctant to travel, despite the protective measures taken by transport and hospitality companies. The current tourist season should be counted as an almost total loss for the industry, since the drop for the current year currently exceeds 98%. Our forecasts for the coming months show a drop of around 50% in December 2020, compared to the previous December, while predictions in some of our scenarios for June 2021 are even graver. In addition to being faced with reduced revenue, the industry is also facing increased costs when implementing the protective measures put in place by authorities. The persistence of this crisis, as signified by our predictions, suggests that firms in the tourism sector will need to re-evaluate their business models to incorporate for the reduced demand and increased costs. The new status quo, which will linger for at least a year based on our predictions, suggests that profitability in the industry needs to be re-evaluated and new tourist products may need to emerge, reshaping the sector. These changes may be temporary or may persist even after the crisis has passed. As a response, authorities will need to subsidize the protective measures they propose in order to support tourism, especially in countries where tourism revenue contributes highly to GDP. Government subsidies on these measures will also help confirm that the measures will be implemented as they are planned by medical professionals, since poor or scanty implementation can have the same detrimental effects as no implementation, with the added cost of false security. It is important that the tourist sector embraces medical guidelines as they represent the shortest path out of the current crisis. Finally, our work illustrates the importance of implementing different methodologies when researchers attempt to produce data forecasts. Additionally, methodologies that utilize training data should be implemented using many different data sets, from different periods, for this task, since we have shown that the results generated can differ significantly. This is an important finding and one that is of particular significance in light of the recent development of machine learning methodologies and their increasing adoption by the academic community. Researchers should cross-validate their findings with different data sets, in order to compare the outcomes and add further substance to the results. Our conclusions and suggestions are built on the outcomes of the particular methodologies employed, given the specific training sets. We acknowledge that research incorporating different methodologies and different variables could yield different results. As Kock et al. (2020) note, previous research assumptions on tourism may need to be revised during the COVID-19 era. Our findings depend heavily on post-COVID-19 travel behavior and also, on the discovery and distribution of a safe and effective vaccine that can be made widely available worldwide. On the other hand, if a vaccine is available only in certain regions, vaccine tourism can be a prime candidate for tourism revenue recovery. Similarly, proliferation of travel insurance packages could also alter travel patterns (Uğur & Akbıyık, 2020) . The results of the current research would need to be revised when there is a foreseeable end to the pandemic. An appealing methodological addition would be to prepare the data for multi-step forecasting, which is a useful extension of LSTM models that uses each forecast as a new data point and retrains the model using it as an observation. What is more, the COVID-19 pandemic is still evolving and the numbers worldwide are still rising rapidly. As the crisis is developing, the outcomes of our predictions may need to be adjusted. A particular issue comes from data availability. Safer predictions could be postulated after the summer of 2020, since the summer months of July and August have typically been the yearly peaks of tourist traffic. We believe that it would be interesting to explore similar predictions once data up to September 2020 become available. Furthermore, our forecasts do not consider the potential changes in tourist behavior and choices, following the pandemic (Li, Nguyen, & Coca-Stefaniak, 2020) . The differences in transportation choices or vacation duration could have a significant effect on our projections. Finally, our work is focused solely on international tourism flows, meaning that our results do not account for regional particulars which affect tourism flows in the coming months. In addition, we cannot take into account domestic tourism flows. We believe that the numbers in staycation tourism will increase as a response to border closures and this may alleviate the negative effects of the pandemic, at least in part. 100-150 words answering the question: what is the contribution to knowledge, theory, policy or practice offered by the paper? Our results have important implications both for policymakers and for researchers. In terms of policy, our predictions suggest that the crisis is far from over in the tourism sector. Without a medical solution to the pandemic, tourists will be reluctant to travel, despite the protective measures taken by transport and hospitality companies. Our work also illustrates the importance of implementing different methodologies when researchers attempt to produce data forecasts. Additionally, methodologies that utilize training data should be implemented using different data sets for this task, since different results can be generated. This is an important finding and one that is of particular significance in light of the recent development of machine learning methodologies and their increasing adoption by the academic community. 100-150 words answering the question: how does the paper offer a social science perspective/approach? From a social science perspective and as mentioned in last paragraph, our results have important implications both for policymakers and for researchers. This paper makes three important contributions to the extant literature. First, we add to the discussion on the potential economic costs resulting from the current COVID-19 pandemic. Second, it compares two methodologies that can be used to produce tourist demand forecasts and shows that they have comparable performance. Thirdly and last, we demonstrate the importance of the training set in machine learning modelling, showing how different training sets produce different results that could potentially lead researchers to different conclusions. Impact of severe acute respiratory syndrome (SARS) on travel and population mobility: Implications for travel medicine practitioners Multivariate exponential smoothing for forecasting tourist arrivals Learning long-term dependencies with gradient descent is difficult A charter for sustainable tourism after COVID-19 The impact of the Middle East respiratory syndrome coronavirus on inbound tourism in South Korea toward sustainable tourism Learning phrase representations using RNN encoder-decoder for statistical machine translation COVID19 and Airbnb -Disrupting the disruptor Economic policy responses to a pandemic: developing the COVID-19 economic stimulus index International tourism and outbreak of coronavirus (COVID-19): A cross-country analysis Learning to forget: Continual prediction with LSTM Pandemics, tourism and global change: A rapid assessment of COVID-19 The short-term impact of SARS on the Chinese economy The impact of SARS on Asian economies Estimation procedures for structural time series models Structural time series models Forecasting accuracy evaluation of tourist arrivals Generalized additive models Long short-term memory The role of the hotel industry in the response to emerging epidemics: A case study of SARS in 2003 and H1N1 swine flu in 2009 in Hong Kong Biosecurity, automation technologies and economic resilience of travel How pandemics affect tourism: International evidence Understanding the COVID-19 tourist psyche: The evolutionary tourism paradigm Tourism demand forecasting: A deep learning approach Gradient descent with early stopping is provably robust to label noise for overparameterized neural networks Coronavirus impacts on post-pandemic planned travel behaviours. Annals of Tourism Research, Article 102964 No Ebola…still doomed" -The Ebola-induced tourism crisis The China syndrome: The impact of the SARS epidemic in Southeast Asia The impact of SARS on Hong Kong' s tourism industry Tourism demand and the COVID-19 pandemic: An LSTM approach Perceived risk and the non-institutionalized tourist role: The case of Israeli student ex-backpackers COVID-19: An opportunity to review existing grounded theories in event studies Mountain tourism and second home tourism as post COVID-19 lockdown placebo? Worldwide Hospitality and Tourism Themes An analysis of variance test for normality (complete samples) An open market valuation of the effects of COVID-19 on the travel and tourism industry The global financial crisis and tourism: Perspectives of the academy Economic Impact of SARS: The Case of Hong Kong Ability and willingness to work during COVID-19 pandemic: Perspectives of front-line hotel employees Forecasting at scale Perceptions of terrorism threats at the 2004 Olympic games: Implications for sport events COVID-19 and gradual adjustment in the tourism, hospitality, and related industries Impacts of COVID-19 on global tourism industry: A cross-regional comparison. Tourism Management Perspectives, 36, 100744. UNWTO (2020a). UNWTO tourism data dashboard International tourist number could fall 60-80% in 2020 Macroeconomic and health care aspects of the coronavirus epidemic: EU, US and global perspectives COVID-19: Potential effects on Chinese citizens' lifestyle and travel COVID-19 and undeclared work: Impacts and policy responses in Europe Threat of infectious disease during an outbreak: Influence on tourists' emotional responses to disadvantaged price inequality He has published more than 40 research papers in international academic journals such as Tourism Management and Journal of Business Research along with several books and book chapters His major fields of interest are agent-based simulations, banking crises He also serves as the President of Tainan University of Technology, Taiwan. His research interests are tourism research methodology The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.