key: cord-0797920-q9v67fxn authors: Zebende, G.F.; Santos Dias, R.M.T.; de Aguiar, L.C. title: Stock market efficiency: An intraday case of study about the G-20 group date: 2022-01-24 journal: Heliyon DOI: 10.1016/j.heliyon.2022.e08808 sha: 9c7cf2898cc60b18ba3f72a97706eea9481f8424 doc_id: 797920 cord_uid: q9v67fxn Given the importance of the financial markets in the global context, data analysis and new statistical approach are always welcome, especially if we are referring to G-20 group (the world's richest countries). As we know, the pandemic outbreak of COVID-19 has affected the global economy, and its impact seems to be inevitable (as it was in 2020). From the perspective of what was raised above, this paper aims to analyze the stock market efficiency in 21 indexes of G-20. We are going to do our analysis with intraday scale (of hour), from May 2019 to May 2020. In order to be successful in this analysis, we applied the DFA and the DCCA methods, to identify or not two points: i).. Are G-20 stock market efficient in their weak form? ii).. With open/close values, it is possible to identify some type of memory in G-20 group? The answer to these points will be given throughout this paper. For this purpose, the entire analysis will be divided into two different time-scale: Period I, time-scale less than five days and Period II, with time-scale greater than ten days. In the pandemic times of COVID-19, our results show that taking into account the DFA method, for time-scale shorter than 5 days, the stock markets tend to be efficient, whereas for time-scale longer than 10 days, the stock market tend to be inefficient. But, with DCCA method for cross-correlation analysis, the results for open/close indexes show different types of behaviors for each stock market index separately. The outbreak of COVID-19 has caused global concerns. On January 30, World Health Organization (WHO) declared the pandemic to be a global health emergency. The easy spread of this virus has caused uncertainty in the global population. This epidemic has also changed people's lifestyles, and (bi)millions of people have been placed in isolation to reduce the transmission of the virus. Companies have closed to control the spread of the virus, causing loss of income and leading to significant levels of unemployment. Worldwide, flights have been canceled and transportation systems have been shut down. The clear consequence of this episode is that economic activities have been disrupted and many countries and stock exchanges have dropped sharply [1, 2, 3, 4] . As the disease has spread rapidly around the world, there is an enormous amount of data concerning the COVID-19, as never before seen (in many countries with under-reporting) for: active, recovered, and dead cases. The social isolation, a way to contain the spread of the G20 group (with intraday data). For this proposal we applied the DFA method, which will assess the presence or not of long memories in these time-series, and also, with the DCCA method, we analyze the cross-correlation between Open/Close indexes of the stock markets individually, in order to assess whether the stock market adjust efficiently, see [23] for more details. Therefore, the main objective of this paper is to contribute in the informational efficiency for G-20 group in the context of COVID-19. For this implementation, the paper was organized as follows in the text: Section 2.1, presents the state of the art regarding the literature on efficient market hypothesis; Section 2.2, introduces the methodology; Section 3, exhibit the results; and Section 4, concludes the paper. The efficient market hypothesis, EMH, of Fame [24] claims that the financial markets are efficient in relation to the information. In other words, an investor is not able to consistently achieve returns above the market average (with a certain level of risk), considering only the publicly available information at the present time. There are three assumptions: • The weak hypothesis, that considers that the prices reflect all publicly available historical information; • The semi-strong hypothesis, where the prices reflect all publicly available information, and also that the prices change instantly to reflect the new public information; • The strong hypothesis, where the prices reflect instantly the hidden or privileged information. Many articles have been published within this premise, such as: [25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36] . Therefore, according to the efficient market hypothesis, on the weak form, we intend to contribute with a new statistical approach, following the methodology presented right below. The two statistical methods to analyze the stock market indexes, the first for auto-correlation and the other for cross-correlation analysis (with its referred coefficient), will be described shortly below in two sub-sections. [14] The DFA algorithm, for long-range auto-correlation analysis, is presented here in six steps: 1 -Consider a time-series { } with = 1, ..., (time-series length). We integrate to obtain where < > is the time-series mean, and = 1, ..., ; 2 -The signal, , is divided into ( − ) overlapping boxes of equal length (time-scale). Each box containing ( + 1) values, that starts at and ends at ( + ); 3 -For each box, we compute the local trend (a linear least-squares fit) on , in order to obtain the adjusted value, ̃, , with ( ≤ ≤ + ); 4 -The integrated signal is subtracted with ̃, (detrended) to obtain: 5 -Now, for a given time-scale of length , -( ) is calculated by: More details about the DFA algorithm can be seen at the link below. https://physionet .org /content /dfa /1 .0 .0/ Therefore, if long-range auto-correlation appear, then -∼ , and is a self-affinity parameter representing the long-range autocorrelation, such that [37] : long-range persistent DFA method is able to identify seasonal components [38, 39, 40] , and -( ) can be applied to discriminate different types of signals [41] . However, we can applied the DFA generalization to analyze crosscorrelation between two time-series, that is, the DCCA method [15] , through its robust coefficient, , described just below. [16] For two time-series, { } and { }, of equal length , we compute with = 1, ⋯ , . Then, we generalized the DFA algorithm (six steps described above in DFA algorithm) to find 2 ( , ) ≡ 1∕( +1) ∑ + = ( − , )( −̃, ) (covariance of the residuals) and the detrended function by: as a function of for SPX return index with Open/Close cases. This figure is split in two-time domain: Period I with < 5 and Period II with > 10 days. But, for quantify the level of cross-correlation, the DCCA crosscorrelation coefficient can defined as the ratio between the detrended cross-correlation function, 2 -, , and the detrended autocorrelation function, -and -, for the time-series { } and { }, respectively: Some properties of naturally appear, the most important is that: In this case, = 0.0 means there is no cross-correlation between { } and { }, and it splits the level of cross-correlation between positive and the negative case. This coefficient has been tested on selected time-series [21, 42] and proved to be quite robust, mainly for statistical analysis between non-stationary time-series [43, 44, 45, 46, 47, 48] , among other cases. It is noteworthy that there is the generalization, for more than two time-series analysis, what we call multiple DCCA coefficient, For this project we acquired the Open (first value at this time) and the Close (last value at this hour) data indexes, related to the 20 largest world economies, briefly identified as G-20 group. The indexes were obtained from Thomson Reuters platform (values in local currency, ), with intraday resolution (hour) and with one year of data, see Table 1 with these indexes for more details. In addition to the top twenty financial market indexes, we included the S&P 500, or SPX, a stock market index that measures the stock performance of 500 large companies listed on stock exchanges in the United States. It is one of the most commonly followed equity indexes, and many consider it to be one of the best representations of the U.S. stock market. As each index based on its own criterion, and that they usually have a very high standard deviation around their mean value, here we will analyze the return, commonly applied in economic and financial time-series analysis, i.e.: As an example, Fig. 1 present the SPX index and respectively the return, with Open(black line)/Close(red line) values. In the naked eye, by the Fig. 1 (a) , it is practically impossible to identify an appreciable difference between the values. This figure shows that the return has mean around zero allowing us to analyze basically the fluctuations (volatility). Also, the time-series comprises a complex time, with the beginning in January of the global pandemic outbreak (COVID-19) . The return reveals a high volatility in the subsequent months, and the Open/Close indexes go together in this process. For an initial analysis, in the section 3.2 we present the descriptive statistical for all indexes, with Open/Close values. For all indexes the mean value is around zero, but with different standard deviation values, see Fig. 2 (a) . The standard deviation, sd, is presented in descending order (for open case), with Merval, IBOV, and SENSEX being the highest values, and JPNK400, JCI and IMOEX the lowest values. In the point of view of the degree of asymmetry of the distribution, Skewness (sk), the values are in general negative, see Fig. 2 (b) , that is, the distribution has a heavier left tail. Finally, on the rule of classical descriptive analysis, we measure the Kurtosis, "the peakedness", of the distribution. The excess of kurtosis, with kurt=0, tells us that the distribution is well adjusted to the normal distribution. The results showed that the distributions are leptokurtic (more concentrated than the normal distribution), mainly in the MERVAl index, see Fig. 2 (c) . We can see that the results (sd, sk, and kur) for Open/Close values are very similar for all indexes. Therefore, if we want to produce a more sophisticated statistic, that give us a better information about As an example of the procedure for calculating the fluctuation function, × , we choose the SPX index, as shown in the Fig. 3 . This figure shows the DFA method applied to Open (left panel) and Close (right panel) return index. With the fluctuation function, , we realize there are two possible intervals for further analysis, that are: 1. Period I, with < 5 days; 2. Period II, with > 10 days. To confirm our premise, the complete result for all indexes can be accessed in the Supplementary Figures, with × . Also, the complete DFA analysis can be found in the Table 2, here with value at Open/Close cases, described in the Periods I and II, respectively. It should be noted that in the Table 2 , the column h/day, means the number of hours on the specified stock market on a trading day. Continuing the analysis via DFA, Fig. 4 summarizes all results for . Most indexes presented an exponent similar, if we look at these two time periods. This fact can be seen easily by visual comparison between Figs. 4 (a,c) and Figs. 4 (b,d) . Also, we can see in Figs. 4 (a,b) for Period I, that in the average ≃ 0.50 (see the last line in the Table 2), theoretically designated here as a stock market weakly efficient. However, individually we can identified 0.5 < < 0.5. Therefore, in G-20 group, MERVAL, SASEIDX, and MEXBOL are among those who have the greatest persistence ( > 0.5), while IMOEX, DAX, and CAC are anti-persistent stock market ( < 0.5). However, for the Period II, the stock market indexes show persistent behavior, and on average the Open/Close indexes had the same values, = 0.65. But, each index has its own value, with FTSEMIB, SPTSX, and DAX the biggest and SHCOMP, SASEIDX, and MEXBOL the lowest values. With the result presented by the DFA method, via exponent, it was not possible in general to identify a significant difference between the Open/Close return index. However, it is possible to measure how much these values are mutually related, and for this task we can apply the DCCA method, and the robust DCCA cross-correlation coefficient, . The objective in this sub-section was to apply the between Open/Close return indexes. As an example of this application can be viewed in Fig. 5 with SPX index. Right now we can see as a function of the time-scale , with cross-correlation coefficient divided between ± weak, ± medium, and ± strong values (horizontal dash lines). For more technical details see [43] . Also, in the figure of ( ), we can see those two time Periods (I and II), with < 5 and > 10 days (vertical red lines). With measured between Open/Close it is possible to see that its value is greater than 0, meaning there is a positive cross-correlation between the Open/Close return, but this cross-correlation is not perfect, with < 1 for small timescale, i.e., < 5 days. On the shortest time-scale, = 4 hours, appears in the transition from weak to medium cross-correlation. However, tends to strong cross-correlation quickly, with this effect in less than one business day. For long time-scale, namely > 10 days, ∼ 1. The rest of the results for all indexes are presented in the Figs. 6 and 7 respectively, with as a function of . The results show that for time-scale > 10 days for all G-20 group, tents to a perfect cross-correlation between the Open/Close return. However, for time-scale < 5 days there is a noticeable difference. It is important to highlight that the results for short time-scale only appear because originally intraday data, with time-scale of hours, was used. Continuing, it is clear to say that, for small time-scale, < 5 days, the results for divided the G-20 group in three different samples: emerging (with weak cross-correlation), intermediate (with medium cross-correlation), and consolidated (with strong crosscorrelation) economies. The exception noted here was seen for the United Kingdom, because this stock market presents hybrid results, and can have two interpretations: first due to the pandemic COVID-19, that all markets are currently subjected or because a causality referring to the BREXIT, see these papers [7, 51] . The COVID-19 pandemic outbreak has affected the global economy, and its impact on all financial markets is/was inevitable. In light of these events this article analyzes the stock market efficiency, in its weak form, among the G-20 group of more advanced economies with intraday data of price (in time-scale of hour), from May 2019 to May 2020 (one year). Here we analyze mainly two issues: (i) Are these stock markets efficient? If yes, (ii) could the presence of long-memories call into question the diversification of portfolios? In order to answer such questions, we performed two types of analysis with the return of prices, by auto-correlation (with DFA) and by cross-correlation (with DCCA). Starting with DFA method, and with the premise that the most efficient stock market will be those with = 0.50, we were able to identify first two time-scale, one with < 5 (Period I) and the other with > 10 days (Period II). For < 5 days the stock markets tend to be more efficient, with ≃ 0.50 on average. But, looking at individually, the following comparison can be performed between ( -open; -close): (i) for the Period I, -open ≃ -close, with a value close to 0.5, i.e., most efficient in their weak form. (ii) for the Period II, of long time-scale, the results present long-range auto-correlations with ≃ 0.65 on average, going out of its weak form of efficiency. Therefore, the DFA method was unable to differentiate the opening from the closing index. Then, in order to improve the study of the efficiency of stock market with Open/Close values we applied coefficient. The results show that for > 10 days, the value of tends to 1.00 (perfect DCCA cross-correlation). However, for small time-scales, < 5 days, show existence of weak cross-correlations in emerging economies (except UKX), and cross-correlations values ranging from medium to strong analyzing the consolidated economies. Thereby, it was possible to identify, in a short time-scale (less than 5 days), the most efficient stock markets. These results clearly answer the questions raised at the beginning of this paper. Finally, as a suggestion, we will investigate the entire effect of the pandemic on the G-20 indexes. In this sense, our proposal will be to analyze these indexes for two periods, one before the COVID-19 and another during/"after" the COVID-19. Preliminary result in this direction can be see in the Supplementary Figures, for S&P 500 index. Gilney F. Zebende, Rui M. Santos Dias and Lazaro de Aguiar: Conceived and designed the experiments; Performed the experiments; Analyzed and interpreted the data; Contributed reagents, materials, analysis tools or data; Wrote the paper. This work was supported by the Conselho Nacional de Desenvolvimento Científico e Tecnológico (310136/2020-2). Data associated with this study is available online at Thomson Reuters (https://finance .yahoo .com /quote /TRI /?guccounter =1) The authors declare no conflict of interest. Supplementary content related to this article has been published online at https://doi .org /10 .1016 /j .heliyon .2022 .e08808. Environmental perspective of COVID-19 Global economic and financial effects of 21st century pandemics and epidemics, SSRN Electron Stock markets' reaction to COVID-19: cases or fatalities? The COVID-19 outbreak and affected countries stock markets response An Introduction to Econophysics: Correlations and Complexity in Finance Beyond econophysics (not to mention mainstream economics) An econophysics approach to study the effect of BREXIT referendum on European Union stock markets Testing the random walk hypothesis for real exchange rates Integration in central European capital markets in the context of the global covid-19 pandemic COVID-19 pandemic and its influence on safe havens Detrended correlogram method for non-stationary timeseries analysis Heterogeneity in economic relationships: scale dependence through the multivariate fractal regression Simplified calculations of time correlation functions in non-stationary complex financial systems Mosaic organization of DNA nucleotides Detrended cross-correlation analysis: a new method for analyzing two nonstationary time series DCCA cross-correlation coefficient: quantifying level of crosscorrelation Short-term predictability of crude oil markets: a detrended fluctuation analysis approach An investigation of Forex market efficiency based on detrended fluctuation analysis: a case study for Iran The cross-correlations of stock markets based on DCCA and time-delay DCCA, Nonlinear Dyn Modeling traffic flow correlation using DFA and DCCA DCCA cross-correlation coefficient differentiation: theoretical and practical approaches Several fundamental properties of DCCA crosscorrelation coefficient Zebende, between openclose stock markets Efficient capital markets: a review of theory and empirical work Does the stock market rationally reflect fundamental values? Dividend yields and expected stock returns Empirical evidence on weak form of efficiency in Indian stock market Are Islamic equity indices more efficient than their conventional counterparts? Evidence from major global index families Testing weak form of efficient market hypothesis: empirical evidence from South Asia Do foreign exchange and equity markets co-move in Latin American region? Detrended cross-correlation approach How long is the memory of the US stock market? Investigation of market efficiency and financial stability between s&p 500 and London stock exchange: monthly and yearly forecasting of time series stock returns using arma model Global financial crisis and weak-form efficiency of Islamic sectoral stock markets: an MF-DFA analysis Stock market efficiency: a comparative analysis of Islamic and conventional stock markets Are stock prices a random walk? An empirical evidence of Asian stock markets Random walks and market efficiency in Chinese and Indian equity markets Self-Organized Biological Dynamics and Nonlinear Control Cross-correlation between time series of vehicles and passengers Analysis of the variability in the sdb star KIC 10670103: DFA approach Autocorrelation and cross-correlation in time series of homicide and attempted homicide Analysis of the eeg bio-signals during the reading task by DFA method Statistical test for Δ cross-correlation coefficient Statistical tests for power-law cross-correlated processes Measuring cross-correlation between non-stationary series with DCCA coefficient DCCA cross-correlation analysis in time-series with removed parts Effect of weather on agricultural futures markets on the basis of DCCA cross-correlation coefficient analysis DCCA cross-correlation coefficient apply in time series of air temperature and air relative humidity applied between air temperature and relative humidity: an hour/hour view Detrended multiple cross-correlation coefficient Detrended multiple crosscorrelation coefficient applied to solar radiation, air temperature and relative humidity Differential market reactions to pre and post BREXIT referendum G.F. Zebende thanks CNPq (Conselho Nacional de Desenvolvimento Científico e Tecnológico) (Grant 310136/2020-2), Brazilian agency.