key: cord-0927503-7zsjtfvk authors: Al-Maadid, Alanoud; Alhazbi, Saleh; Al-Thelaya, Khaled title: Using Machine Learning to Analyze the Impact of Coronavirus Pandemic News on the Stock Markets in GCC Countries date: 2022-04-28 journal: Res Int Bus Finance DOI: 10.1016/j.ribaf.2022.101667 sha: 564da5278b07ab2a2b4fe339373d01242993dca4 doc_id: 927503 cord_uid: 7zsjtfvk COVID-19 has resulted in high volatility in financial markets across the world. The goal of this study is to investigate the impact of COVID-19-related news on the stock markets in Gulf Cooperation Council (GCC) countries. The study utilizes machine learning approaches to assess the role of COVID-19 news in stock return predictability in these markets. The results reveal that the stock markets in the United Arab Emirates (UAE), Qatar, Saudi Arabia, and Oman were impacted by coronavirus-related news; however, this news had no impact on the stocks in Bahrain. Moreover, the results indicate that the impacted markets were influenced differently in terms of the quantities and types of news. The extremely unpredictable spread of the coronavirus (COVID-19) has led to unprecedented global economic decline. Global economic activity has been heavily influenced by several inevitable reductions in travel, trade, and production. The outbreak of this pandemic has resulted in high volatility in financial markets across the world. Baker et al. (Baek et al., 2020) reported that the COVID-19 pandemic impacted stock markets more powerfully than any other infectious disease outbreak that came before. Goodell (Goodell, 2020) presented a comparative study of the impacts of COVID-19 and other natural disasters on the global economy, and the study indicated that COVID-19 is unique in terms of its global scope, wide range and long-term economic damage. In the U.S. stock market, the observed volatility levels surpassed some of the highest volatility levels that occurred in late 1929, the early 1930s, October 1987, and December 2008 . Between the last week of Feb. 2020 and the last week of March, the S&P 500 index lost 33% of its value. Similarly, financial markets in South Korea, Australia, Hong Kong, and China dropped by more than 5% daily (Salisu & Vo, 2020) , the Nikkei index in Japan plunged more than 30%, and the U.K.'s main index dropped by more than 35% (Dayong Zhang et al., 2020) . The analysis of the impact of the pandemic on stock markets is ongoing research, and many studies have adopted different analytical approaches to illustrate and quantify the influence of the pandemic on stock markets from different perspectives. These include the impacts of government measures (Ashraf, 2020a) (Zaremba et al., 2020) (Haroon & Rizvi, 2020a) , spillovers and contagion between markets (Akhtaruzzaman et al., 2020) (Okorie & Lin, 2020) , the numbers of confirmed infections and deaths (Al-Awadhi et al., 2020) (Ashraf, 2020b) (Harjoto et al., 2020) , and related news (Cepoi, 2020)(Ambros et al., 2020) (Buckman et al., 2020) . Generally, news is an important factor that greatly impacts stock markets; investors are usually influenced by media reports when making their decisions. Although news might be incomplete or biased, it impacts investors, especially retail investors, because acquiring and processing real data is costly in terms of time and cognitive efforts (Engelhardt et al., 2020 ). An extensive literature discusses the impact of news on financial markets (K. Li, 2018) (Sinha, 2016) (Beetsma et al., 2013) (Smales, 2014) (Al-Maadid et al., 2020) . This includes different types of news, such as macroeconomic announcements (K. F. Chan & on the bootstrap aggregation technique (bagging), where its weak learners are trained independently and in parallel, and the result of the model is the average of the outputs of all weak learners. On the other hand, XGB is based on a boosting technique, where its weak learners are trained sequentially, the data instances that are predicted incorrectly by weak learners are spotted, and the learner weights are increased to improve the accuracy of the following learners. Third, the data used in this study cover wider aspects and longer periods of COVID-19 news. In this study, we consider different types of COVID-19-related news, including negative news on Twitter, business news, Google search volumes, numbers of confirmed local cases, numbers of confirmed global cases, numbers of local deaths, and numbers of global deaths. Additionally, the data in this study cover the period from the beginning of 2020 to the beginning of October 2020. This period includes the situation before any local cases were reported and the period of global stock decline, as well as the period during which the markets recovered. Thus, these data capture the role of COVID-19 news in these different situations, not only during the periods before and during the stocks experienced their worst decline in March 2020, as used by other studies. Third, despite the significant increase in the volatility of the GCC stocks as a result of COVID-19, only the study in (Bahrini & Filfilan, 2020) investigated the effects of the pandemic on these markets; however, the study only investigated the impacts of confirmed cases and deaths during the period from April 1, 2020, to June 26, 2020. Therefore, our study contributes to filling the gap in the literature regarding the understanding of the impact of COVID-19 on GCC stocks. Overall, this study finds that COVID-19-related news affected stock markets in the UAE, Qatar, Saudi Arabia, and Oman but had no influence on the stock market in Bahrain. Moreover, the influence of news varied among the impacted markets; the markets in the UAE and Qatar were influenced more than the markets in Saudi Arabia and Oman, which were influenced only slightly. The rest of this paper has the following structure. Section 2 presents a review of the relevant literature. Section 3 describes the utilized dataset. Section 4 illustrates the methodology and the machine learning models used in this study. The empirical results are presented and discussed in Section 5, while Section 6 concludes the study. The literature on the impact of COVID-19 on different aspects of life is evolving. The massive level of media coverage associated with this pandemic has affected investors' behaviors in stock markets. Akhtaruzzaman et al. (Akhtaruzzaman et al., 2021) found that the media played the main role in transmitting financial contagion across stock markets. To study this impact, Buckman et al. (Buckman et al., 2020) used news sentiment analysis. Their study employed the Daily News Sentiment Index (DNSI) in the U.S., exploiting the lexical approach and machine learning to process economics-related news articles and provide real-time data representing news sentiment scores. The news sentiment index was compared with the University of Michigan's index of consumer sentiment. The two indices were found to be correlated and exhibited similar or close levels of response to different economic events. The results showed that the increasing coverage of COVID-19 through mid-March was synchronized with the decline in the news sentiment index. Similarly, Lee (Lee, 2020) used the DNSI in addition to the Google search volumes of terms related to COVID-19 to study the impact of COVID-19 sentiments on S&P sector indices. The results showed that the DNSI is positively related to all sectors but with distinct effects across the sectors. The mutual impact between stock markets and the media was also addressed in (Mamaysky, 2020) , where news sentiment analysis was used to explore the mutual relations between COVID-19 news articles from Reuters and the prices of five asset classes, including the S&P and VIX indices. The study ran a linear regression analysis to infer the causal relations between news and financial economic indices. The author concluded that there is a strong mutual relationship between stock prices and news. It was found that news can be used to infer prices in the future and vice versa. Similarly, to study the impact of media coverage on stock market volatility, Haroon and Rizvi (Haroon & Rizvi, 2020b ) used a panic index (represented by the Ravenpack finance index) to perform sentiment analysis based on COVID-19 news. They employed exponential generalized autoregressive conditional heteroskedasticity (GARCH) models to study the impact of the media on the price returns of multiple stock market indices in the U.S. based on sectoral index data. However, the study found that media coverage and news sentiment had little to J o u r n a l P r e -p r o o f moderate impacts on price volatility. The same conclusion was found by Ambros et al. (Ambros et al., 2020) , who performed a regression analysis to study the impact of news sentiment on the high-frequency price returns of nine stock market indices. Nevertheless, the study acknowledged that COVID-19 news partially influences market uncertainty. Cepoi (Cepoi, 2020) investigated the reactions of stock markets to COVID-19-related news in the six countries with the most infections, including the U.S., the U.K., Germany, France, Spain, and Italy. It was found that the stocks in these countries presented asymmetric dependencies with news related to COVID-19. In the same way, Salisu and Vo (Salisu & Vo, 2020) examined how COVID-19 news has influenced the stock markets in the twenty countries hit worst by the pandemic. To select these countries, they relied on local official statistics of reported cases and deaths. The same countries were also selected in a study by Cepoi (Cepoi, 2020), along with China, Switzerland, the Netherlands, Belgium, South Korea, Turkey, Austria, Canada, Portugal, Brazil, Norway, Australia, and Sweden. The study evaluated the role of health news collected through Google search trends on the predictability of stocks in these countries during the period from January 1 to March 30, 2020. The study revealed that the use of health news improved stock forecasts and had a statistically significant negative effect on stock returns. Esgelhardt et al. (Engelhardt et al., 2020) found that stock markets have mainly been driven by news attention during the COVID-19 pandemic. The authors used a sample of 64 national stock markets, representing 94% of the global gross domestic product (GDP), and used Google search volumes to explore the impact of COVID-19 news on these markets. Moreover, the study estimated that the cost of news hype was approximately USD 200 billion. The study in (Kamal et al., 2021) used the event-study methodology to explore the short-term effects of pandemic news on maritime shipping stocks in the NYSE during eight major COVID-19 news events. The authors considered four of these events as pessimistic and four of them as optimistic. The pessimistic events included the identification of the first cases in China, the outbreak of the pandemic in Italy, the announcement of the World Health Organization (WHO) declaring COVID-19 as a global pandemic, and reaching the new record of over 1 million cases in the US. The optimistic events included the news of easing the lockdown restrictions in many countries, the announcement of extra economic stimuli in the US, the announcement of Oxford University regarding the strong immune reaction to the developed vaccine, and the announcement of the first vaccine approval in Russia. The study found that investors exhibited adverse reactions to pessimistic news and positive overreactions to optimistic news. Similarly, Goodell and Huynh (Goodell & Huynh, 2020) used the event-study methodology to assess the impact of sudden COVID-19-related news on industrial stocks in the US alongside an analysis of the attention levels paid by investors to COVID-19. They found that public attention was associated with abnormal returns in several industries, which were also associated with COVID-related news announcements. Duan et al. (Duan et al., 2021) studied the impact of media sentiments related to COVID-19 on Chinese stocks. The authors collected 1.8 million pieces of official news and 4.5 million Sina Weibo microblogs. They built two sentiment indices by classifying each entry in the dataset as an optimistic, neutral, or pessimistic entry by using a support vector machine (SVM). Then, they used a regression model to investigate the relations between these two indices and Chinese stock market returns and found that the stock returns were positively predicted by the sentiments with both platforms. Baek and Lee (Baek & Lee, 2021) used the daily death rate and recovery rate of COVID-19 as proxies for bad and good news, respectively, and they employed the BEKK-multivariate GARCH model to examine the influences of these rates on the US stock market. The results showed that the stock market was significantly and positively impacted by the volatility of the death rate (bad news) and negatively associated with the volatility of the recovery rate (good news). Furthermore, the study found that the US stock market was affected by bad news much more than by good news. daily data of each market are downloaded from the Investing.com website. The daily closing prices are used to compute the daily returns by setting ̂ ( ) The daily returns of the closing prices of the five GCC stock market indices are illustrated in Figure 1 . The dataset of each market contains economic and COVID-19-related news indicators. Both data types are obtained from Bloomberg. The economic indicators represent the control variables, including local interest rates, local GDP surprises, i oil prices and U.S. stock volatility index (VIX) data. The GDP surprises are calculated as the differences between the Bloomberg consensus forecasts that are usually released before the actual GDP announcements and the actual GDP releases; this indicator captures the surprise that might be experienced by the local stock market. We include the oil prices and VIX indicators because of their effects on market predictability (Alhazbi et al., 2020) . The correlations between the stock markets in GCC countries and oil prices are well documented and exhibit volatility comovement (Alqahtani et al., 2019); the effect of oil prices was significantly larger during COVID-19 than during the previous period (Abuzayed & Al-Fayoumi, 2021) . Additionally, it is found that VIX has a significant positive influence on GCC stocks, and U.S. stock uncertainty spreads to the GCC markets (Benlagha & El Omari, 2021). Furthermore, the GCC markets exhibit connectedness (Hung, 2021) , so to capture the spillover effects on each market, we also include the daily returns of the other GCC markets in the control variables. The COVID-19-related news indicators are also obtained from Bloomberg; these indicators include the number of daily news items on Twitter that are related to each country, the daily ratio of negative items to total items, the number of daily articles in newspapers and magazines that discuss coronavirus and business news related to each country, the daily ratio of negative articles to total articles, and daily Google search trend data. Generally, Google search trends are indicators of the relative popularity levels of all searches on Google at a specific time and place. In our study, Google search trends represent an indication of the community's interest in news about the coronavirus pandemic in each country. Additionally, to capture the severity of the health situation in each country and the global severity, we include the numbers of cases, deaths, and recoveries at both local and global scales. A statistical description of the data is presented in Tables 1, 2, 3, 4, and 5 for Bahrain, Oman, Qatar, Saudi Arabia, and the UAE, respectively. 1 here Insert table 2 here Insert table 3 here Insert table 4 here Insert table 5 here In this study, we utilize machine learning approaches to analyze the impact of COVID-19-related news on GCC stocks. We use three types of machine learning models to perform the analysis: decision trees (DTs), an RF, and XGB. To assess the role of COVID-19 news in the predictability of the stock returns in each country, we conduct two experiments by using the three models for each country. In the first experiment, we use only the control variables (economic indicators), which include the interest rate, GDP surprises, VIX, oil price, and daily returns of other GCC markets. In the second experiment, we use the economic indicator data in addition to the news data. The news data include the number of daily news items on Twitter, the daily ratio of negative Twitter news items, the number of daily business news items, the ratio of daily negative business news items, the daily Google search trends, the number of daily confirmed local cases, the number of daily local deaths, the number of daily global cases, the number of global deaths, and the number of daily local recoveries. The data are split into training (80%) and testing (20%) sets, and we use 10-fold cross validation to measure the performance of each model in both experiments. The difference between the two experiments in terms of the accuracy of each model represents the impact of the utilized news on the model performance, which is measured by using the root mean square error (RMSE). The RMSE measures the discrepancy between the predicted values and the actual values; therefore, smaller RMSE values indicate high model accuracy. The RMSE is calculated as follows: where ̂ represents the predicted price returns produced by the regression model and y represents the actual price returns. All the experiments are conducted in Python with the Scikit-learn (Pedregosa et al., 2011 )(Pedregosa et al., 2011 library. The data are normalized between 0 and 1 via min-max normalization to scale the data variability according to the following formula. where represents the actual input variable and ̂ represents the scaled values. The following subsections describe the three models used in this study. A regression tree is constructed via top-down induction by splitting the given samples based on the input feature set; the optimal split is obtained based on one input feature, and the best gain metric criteria are considered at each step. Ultimately, a set of rules is maintained based on the best metric criteria. This procedure is repeated recursively until splitting does not produce better predictions or the subset of leaves has values similar to those of the target variable. The recursive procedure can also be stopped if the maximum depth limit is reached (Loh, 2011) . Different metrics are commonly used to determine the best subset splits for measuring the homogeneity of the predicted variables for each subset. In our experiments, the MSE is used as the objective function of the splitting criterion, where different subsets are compared, and the best split is selected. For example, to split a particular set into two subsets S A and S B , one must seek the plane that minimizes the squared errors of the two subsets, as follows (Loh, 2011) . where y i and ̂ represent the target variable before and after performing the split decision, respectively. Therefore, to determine the level of improvement yielded by the split rule, variance reduction is employed in cases where the target variable is continuous. To measure the variance reduction I v for any split rule at any node N in the regression tree, the regression decision tree uses the following formula (Loh, 2011) . In both experiments, we set three stopping criteria: the maximum depth is five, the minimum number of samples is equal to two for each split, and the minimum number of samples that must be in a leaf is set to one. An RF is composed of a collection of DTs trained by using a bootstrap sample drawn with replacement from the training data. Each tree in the forest is constructed based on an input random vector that aims to add a randomness factor to the process to avoid overfitting the data. The collection of tree predictors is denoted by h(x;η k ),k = 1,....,K, where x represents the input data and η k denotes independent and identically distributed random vectors. For regression problems, the output of an RF is an estimation of a continuous variable Y. The final predictions of the tree predictors are then averaged to produce final prediction values, as follows: The bootstrap aggregation technique is used by the RF algorithm to conduct the training process. For each DT, a sample from the training data is selected with replacement to train the tree. The bootstrapping procedure reduces the variance of the model and does not increase the bias. Unlike a traditional DT, an RF performs a random split on each node based on a random sample of input features. As in the DT mode, we set three stopping criteria: the maximum depth is five, the minimum number of samples is equal to two for each split, and the minimum number of samples that must be in a leaf is set to one. Furthermore, we set the number of trees (estimators) to 100. These hyperparameters are the same in both experiments. XGB is a DT-based method developed based on boosting tree models that uses a gradient boosting technique. This algorithm outperforms many machine learning techniques on small-to-medium structured/tabular data. The difference between this algorithm and other traditional boosting tree models is that it enables parallel CPU computing via multithreading. It also utilizes different methods to prevent overfitting. Gradient boosting is employed by the XGB algorithm to combine many DT learning models into a stronger learner in an iterative manner. The residual at each iteration is used to improve the prediction accuracy of the previous predictor to optimize the objective loss function. The objective function also includes a regularization term to prevent overfitting. The objective function is given by where β represents the parameters adjusted by the training data, the loss function is denoted by L, and ω is a regularization term that is used to prevent overfitting based on model complexity measures. The where the number of leaves in the tree is denoted by T and the complexity of each node is also used in the regularization function and denoted by γ. The penalty scaling parameter is denoted by λ, and the scoresof-leaves vector is denoted by w. A second-order Taylor expansion is then utilized for the loss function instead of the first-order expansion, which is normally used in the traditional gradient boosting approach. The objective function can be defined using the MSE loss function (Chen & Guestrin, 2016): where the first and second derivatives of the MSE loss function are denoted by g i and h i , respectively. q(.) is a function that assigns data points to the corresponding leaf. The loss function can also be expressed by the sum of the loss values of all leaf nodes as follows: More details about the XGB model can be found in (Chen & Guestrin, 2016) . As in the RF, we set the number of trees in this model to 100, the maximum tree depth is set to six, the learning rate is set to 0.2 in both experiments, and the gamma regularization term is set to 0.08. Table 6 illustrates the hyperparameters used for the three models. To identify which indicators in the news dataset have greater impacts on the markets in the UAE, Qatar, and Saudi Arabia, we use the feature importance metric offered by the scikit-learn machine learning library. It is the score that is assigned to each feature and represents the extent to which the feature influences the model relative to other features. In other words, this metric represents the model accuracy reduction induced when the corresponding feature is removed. Since we perform 10-fold cross validation and we have three models for each country, we average the importance of each feature over the thirty trained model iterations for each country. In scikit-learn, the importance of a feature in a DT is calculated by measuring how effective the feature is at reducing the impurities of nodes, so it is computed as the normalized total of the weighted impurity decreases in all nodes that are split by using that feature. The impurity of a node in a DT for regression is calculated by using the MSE (which is also called variance reduction), as follows: where n is the number of instances in a node, y is the value of an instance, and µ is the mean of the instances in this node. The weighted impurity decrease (reduction) of a split in node based on feature j is calculated as follows: where the weight is the ratio of the number of instances in a node to the total number of instances before splitting. "Left" and "right" represent the left and right children of a node in the tree, respectively. The normalized importance of feature j is calculated by dividing the total variance reduction achieved when the feature is used in all nodes by the total variance reduction achieved by using all features as follows: The feature importance for the RF and XGB models is calculated by dividing the total feature importance in all the trees in the model and the number of trees in the model. Table 7 and Figure 2 present the results of the two experiments conducted for each country with each of the three models. The results show that the XGB model has the highest accuracies in both experiments for all stock markets, and its RMSEs are the smallest in both experiments compared to those of the other two models. The RF model ranks second in terms of accuracy for all markets in both experiments, and the DT model has the lowest accuracy. The results indicate that there are differences between the GCC countries in terms of the impact of coronavirus news on model predictability. The accuracies of the three models are improved when COVID-19 news indicators are included among the input features for market predictions in the UAE, Qatar, Saudi Arabia, and Oman. However, including these indicators does not improve the accuracy of any of these models in the case of the Bahrain stock market. Moreover, the improvements in the prediction abilities of the three models vary among the impacted markets; the improvement in the Oman market case is very small compared to those in the UAE, Saudi Arabia, and Qatar market cases. Furthermore, the results show that the improvements yielded by the three models are larger in the UAE and Qatar cases than in the case of Saudi Arabia. This reveals that COVID-19 news has an impact on stock market returns in the UAE, Qatar, Saudi Arabia, and Oman but has no impact on the stock market in Bahrain. It also indicates that COVID-19 news has very little impact on the stock market in Oman compared to that on the markets in the UAE, Qatar, and Saudi Arabia. Furthermore, the impacts on the stock markets in the UAE and Qatar are greater than the impact on the market in Saudi Arabia. This finding coincides with that of the study by (Al-Maadid et al., 2020) , which found that the leading stock markets (in the UAE, Qatar, and Saudi Arabia) are more sensitive to news in general than the markets in Oman and Bahrain. A possible explanation for this difference maybe be related to the characteristics of these markets. The markets of Bahrain and Oman are the smallest in terms of their total market capitalizations and trading values. Table 8 compares the market capitalizations and trading values of these markets by the end of 2019, i.e., before the COVID-19 pandemic. It is also worth mentioning that according to the MSCI classification, the markets in Oman and Bahrain are considered frontiers, while the markets in the UAE, Qatar, and Saudi Arabia are considered emerging markets. The markets in Qatar and the UAE were upgraded from frontier to emerging markets in 2014, while the market in Saudi Arabia was upgraded in 2019. Usually, frontier markets have less liquidity than emerging markets. Less liquidity with small market capitalization and small daily trading values, such as in the cases of Oman and Bahrain, allows large institutional investors in the market to control price changes and reduce volatility; this is usually achieved by the institutions that are owned by the government. Furthermore, comparing the individual daily trading value percentages to those of the institutions, the differences are found to be greater in the UAE, Qatar, and Saudi markets than in Oman and Bahrain. Previous studies (W. Li et al., 2017) (Merli & Roger, 2013) found that individual investors herd more than institutional investors during crises. Accordingly, the levels of herding among the investors in the three larger markets (the UAE, Qatar, and Saudi Arabia) are expected to be higher than those in the markets of Bahrain and Oman. Because herding behavior is increased by investors' fears, its impact on stock markets was more visible during COVID-19 due to the uncertainty regarding the future of the global economy. Additionally, the economies of these countries vary in terms of their degrees of openness and connectedness with the global economy; for example, one of the sectors most influenced during COVID-19 was the real state sector in the UAE because Chinese businessmen are major investors in this sector. Additionally, the cancellation of large events such the Hajj (pilgrimage) in Saudi Arabia and Dubai's 2020 Expo resulted in the loss of hundreds of millions of dollars (Salman & Ali, 2021) . Accordingly, the investors in these countries are expected to be more sensitive to any global news that might affect the economy and stock markets. Table 9 and Figure 3 illustrate the importance of COVID-19 news features to the impacted markets. The results show that in the case of the UAE, the top three features are Google search trends, Twitter news, and the number of local recoveries. In the case of Qatar, the top three features are Twitter news, Google search trends, and the number of global deaths. In the case of Saudi Arabia, the top three features are negative business news, Twitter news, and the number of local cases. In the case of Oman, Google search trends are the most important, followed by the number of global cases and the number of global deaths. We note that Twitter news has a significant impact on the stock market in Qatar. Its importance is 34% and represents the top indicator. It also greatly impacts the stock markets in Saudi Arabia (8.6%) and the UAE (9%), and it is ranked second in terms of importance in both cases but is only slightly important in the case of Oman (2%). Google search trends also exhibit high importance in the cases of the UAE (13%), Oman (13%), and Qatar (9%). Negative business news has high importance in the case of Saudi Arabia (16%) but has no importance in the other impacted markets. These differences among the impacted markets in terms of feature importance might be related to the different sources of news that investors pay attention to in these markets, which are affected by a variety of factors, such as internet penetration and social media popularity. According to a report (Mashino, 2019) , Qatar and the UAE have the highest rates of social media use among their populations, followed by Saudi Arabia, while Oman has the lowest rate. More research is needed to reveal the differences among the individual investors in these markets. On the other hand, we note that the number of local death cases has a very low impact on the four markets because the general numbers of coronavirus-related deaths in the GCC countries were lower than those in other countries, such as China, the U.S., and European countries. In contrast with the study of (Ashraf, 2020b), who researched the impact of COVID-19 news on the GCC during the period between April 1 st , 2020, and June 26 th , 2020, our study covers a longer period, so it captures the periods before any local cases were reported and during the global stock decline, as well as the market recovery period. This might explain the inconsistency in the findings, where we find that the number of local deaths has a low impact on the four markets. Furthermore, the study in (Ashraf, 2020b) did not investigate the impact on each market individually. In this study, we use three machine learning models, namely, DTs, an RF, and XGB, to investigate the impacts of different types of COVID-19-related news on the stock markets in the GCC countries. The results indicate that the stock markets in the UAE, Qatar, Saudi Arabia, and Oman were influenced by news related to COVID-19, but the stock market in Bahrain was not impacted. This finding is consistent across all three models, and the results show that the XGB model has the highest accuracy, followed by the RF, while the DT model has the lowest accuracy. Furthermore, the results reveal that the UAE and Qatar were impacted more than Saudi Arabia, and the impact was nonsignificant in the case of Oman. This difference might be due to the differences among these markets in terms of their sizes, types, and investor types. The markets in Bahrain and Oman are the smallest markets in the GCC countries; they are classified as frontiers, while the markets in the UAE, Qatar and Saudi Arabia are considered emerging markets. Usually, most of the daily trading that takes place in the markets of Oman and Bahrain is achieved by institutional investors who can control the prices and manage the volatility in these small markets. On the other hand, individual investors have the main role in the three largest markets, so hoarding behavior is more pronounced in these markets during crises. Overall, the results of this study demonstrate that the GCC markets vary in terms of their responses to COVID-19-related news, which has important implications for investors in these markets, who should diversify their portfolios across the markets. These results can be used by investors and portfolio managers to develop risk management strategies that take news related to large global events or disasters into consideration. These strategies include developing monitoring and analysis tools to track the intensity of the global news volume during crises and assess its impact on GCC stock markets. To mitigate risk, different types of news sources should be included, including traditional outlets and social media. Furthermore, the results indicate that the impacted markets were influenced differently by various news types; for example, Twitter news greatly impacts the stock markets in Qatar, the UAE, and Saudi Arabia. However, negative business news has a high impact only on the Saudi Arabian market. Future work can be performed to further investigate the relationships between Twitter news and the stock markets in the GCC over a longer period beyond the COVID-19 pandemic. Additionally, more research is needed to investigate the impacts of COVID-19 news on the different sectors in these markets. One of the limitations in this study is that only U.S. stock volatility (VIX) is considered for capturing the spillover from international markets to the GCC markets; contagion from other markets (such as China) can be addressed in future research. Al-Maadid, A., Caporale, G. M., Spagnolo, F., & Spagnolo, N. (2020) . The impact of business and political news on the GCC stock markets. Risk spillover from crude oil prices to GCC stock market returns: New evidence during the COVID-19 outbreak Google search intensity and its relationship to the returns and liquidity of Japanese startup stocks. Pacific-Basin Finance Journal Financial contagion during COVID-19 crisis COVID-19 media coverage and ESG leader indices Death and contagious infectious diseases: Impact of the COVID-19 virus on stock market returns COVID-19 Sentiment and the Chinese Stock Market: Evidence from the Official News Media and Sina Weibo Google search and stock returns: A study on BIST 100 stocks What Drives Stocks during the Corona-Crash? News Attention vs COVID-19 and finance: Agendas for future research Did Congress trade ahead? Considering the reaction of US industries to COVID-19 How do equity markets react to COVID-19? Evidence from emerging and developed countries Flatten the Curve and Stock Market Liquidity -An Inquiry into Emerging Economies. Emerging Markets Finance and Trade COVID-19: Media coverage and financial markets behavior-A sectoral inquiry Financial connectedness of GCC emerging stock markets Stock market reactions of maritime shipping industry in the time of COVID-19 pandemic crisis: an empirical investigation Exploring the Initial Impact of COVID-19 Sentiment on US Stock Market Using Big Data Reaction to news in the Chinese stock market: A study on Xiong'an New Area Strategy Web Media and Stock Markets : A Survey and Future Directions from a Big Data Perspective Differences in herding: Individual vs. institutional investors. Pacific-Basin Finance Journal Classification and regression trees Financial Markets and News about the Coronavirus THE SPREAD OF SOCIAL MEDIA IN THE GCC AND THE POTENTIAL FOR ITS UTILIZATION IN B2C BUSINESS Impact of macroeconomic factors and country risk ratings on GCC stock markets: evidence from a dynamic panel threshold model with regime switching What drives the herding behavior of individual investors? Google search and stock returns in emerging markets Stock markets and the COVID-19 fractal contagion effects Scikit-learn: Machine learning in Python Predicting stock returns in the presence of COVID-19 pandemic: The role of health news Covid-19 and its impact on the stock market in GCC Underreaction to News in the US Stock Market Non-scheduled news arrival and high-frequency stock market dynamics Investment Strategy Optimization Using Technical Analysis and Predictive Modeling in Emerging Markets Investor herds and oil prices evidence in the Gulf Cooperation Council (GCC) equity markets Infected Markets: Novel Coronavirus, Government Interventions, and Stock Return Volatility around the Globe A Data-Driven Design for Fault Detection of Wind Turbines Using Random Forests and XGboost Financial markets under the global pandemic of COVID-19 Highlights  Among GCC markets, the stock markets in United Arab Emirates (UAE), Qatar, Saudi Arabia, and Oman were impacted by coronavirus-related news, while Bahrain stock was not influenced.  The impacted markets were influenced differently in terms of the quantity and the news types.