key: cord-0067939-m8ybta2k authors: Aslam, Muhammad title: Assessing the Significance of Relationship Between Metrology Variables under Indeterminacy date: 2021-09-27 journal: MAPAN DOI: 10.1007/s12647-021-00503-8 sha: e97865c79aa7acb4cee4acaa9e1ea6183e532d0c doc_id: 67939 cord_uid: m8ybta2k This paper introduces a Z-test for two correlation coefficients under neutrosophic statistics. The necessary steps to implement the proposed test are given. The statistic of the proposed test under indeterminacy introduced the first time. The application of the proposed test is given using temperature and wind speed data. From the analysis of the energy example, it is concluded that the proposed test is more efficient than the Z-test for two correlation coefficients under classical statistics in terms of the measure of indeterminacy, information, and flexibility. Based on the comparative study, it is recommended to apply the proposed test in the areas of energy and weather. In practice, the practitioners, experimenters, and energy experts are interested to investigate the relationship between variables at different places and years. This objective can be obtained from the study of the correlation analysis. The Z-test for two correlation coefficients under classical statistics is quite helpful to the significance between two variables recorded at different places and years. The main aim of this test is to see either the relationship between a pair variable recorded at one place/year is significantly different from a pair of variables recorded at the second place or another year. This type of study will guide them to estimate/forecast the relationship between these variables for the next years. Kumar and Kumar [1] studied the relationship between metrological and COVID-19 pandemic. Weaver and Wuensch [2] , Lee [3] , Chen [4] , Giakoumis [5] , Shan et al. [6] and Rezaee et al. [7] studied various applications of correlation analysis in a variety of fields. Statistical analysis has been widely applied for analyzing the wind speed for forecasting purposes. The correlation analysis is also helpful in analyzing the wind speed and temperature relationship. For example, the energy experts or meteorologists may interest to see the relationship between the wind speed and temperature in the year 2019 and the wind speed and temperature in the year 2020. Bechrakis and Sparis [8] used the correlation to study the relationship between wind speed of various stations. Su et al. [9] studied the relation between wind speed and wind turbines. Shen et al. [10] presented the work on the correlation between energy variables. More applications of statistical method in forecasting and analysis of wind speed can be seen in Rahimiyan [11] , Arias-Rosales and Osorio-Gómez [12] , Katinas et al. [13] , Min et al. [14] , Barhmi et al. [15] , and Ben et al. [16] . The Z-test for two correlation coefficients under classical statistics is workable under the assumption that the data follow the normal distribution and have all determined observations. In real life, the wind speed and temperature data are recorded in intervals. In such cases, when the data are intervals or there is uncertainty in parameters or observations, the statistical methods using fuzzy logic are applied. Damousis et al. [17] studied the relationship between energy-related variables using fuzzy logic. Some statistical tests developed using the fuzzy logic can be seen in Montenegro et al. [18] , Petković [19] , Grzegorzewski and Ś piewak [20] , Sezer et al. [21] and Nie et al. [22] . The neutrosophic logic is said to be more efficient than the fuzzy logic as it gives additional information about the measure of indeterminacy, see Smarandache [23] . Smarandache and Khalid, [24] proved its efficiency over the fuzzy logic and interval-based analysis. Several applications of the neutrosophic logic can be read in Abdel-*Corresponding author, E-mail: aslam_ravian@hotmail.com M APAN-Journal of Metrology Society of India https://doi.org/10.1007/s12647-021-00503- 8 Basset et al. [25] , Smarandache [26] and Nabeeh et al. [27] . Smarandache [28] introduced the neutrosophic statistics that gives efficient results when data are in the interval, imprecise and indeterminate. Chen et al. [29, 30] discussed the measure of indeterminacy evaluation for neutrosophic numbers. Aslam [31] presented the wind forecasting method under neutrosophic statistics. More details can be seen in Aslam [32] and Aslam [33] . A rich literature on wind speed analysis using correlation under classical statistics and fuzzy logic is available in the literature. The existing tests are unable to give information about the measure of indeterminacy. By exploring the literature and best of our knowledge, there is no work on Z-test for two correlation coefficients under neutrosophic statistics. In this paper, we will originally introduce a Z-test for two correlation coefficients under neutrosophic statistics. The statistic of the proposed test will introduce under indeterminacy. The application of the proposed test will be given on the energy data with the expectation that it will be efficient and flexible to apply under uncertainty. It is expected that the proposed test will be helpful for studying the relationship between wind speed and temperature at various places, stations and years. . . .; n N Þ; I 1yN 2 I 1yL ; I 1yU  à be a pair of neutrosophic random of the first sample of sample size n N 2 n L ; n U ½ : Let à be a pair of neutrosophic random of the second sample, where I 1xN 2 I 1xL ; I 1U ½ and I 2yN 2 I 2yL ; I 2yU  à present the corresponding measure of indeterminacy. Note that X i1L ; Y i1L ; X i2L and Y i2L present the determined observations and X i1U I 1N ; Y i1U I 1N ; X i2L and Y i2L are indeterminate part of neutrosophic forms. Based on this information, the neutrosophic correlation for the first sample, say r 1N 2 r 1L ; r 1U ½ can be defined as follows The neutrosophic correlation for the second sample, say r 2N 2 r 2L ;r 2U ½ can be defined as follows The neutrosophic correlation, say r N 2 r 1N ;r 2N ½ is defined as follows The neutrosophic form r N 2 r 1N ; r 2N ½ can be written as In Eq. (4), the first part r 1N 2 r 1L ; r 1U ½ denotes the correlation under classical statistics and r 2N I rN denotes the indeterminate part, where I rN 2 I rL ; I rU ½ presents the indeterminacy associated with r N 2 r 1N ; r 2N ½ . The neutrosophic form reduces to correlation under classical statistics if I rL ¼ 0. The existing Z-test for two correlation coefficients has been applied widely for testing the significant difference between correlation coefficients when the observations in each pair of two samples are precise, exact and determined. The use of the existing test may mislead when the samples have intervals, inexact and indeterminate observations. In this section, the design of the proposed Z-test for two correlation coefficients under neutrosophic statistics will be presented. It is assumed that the two samples are drawn from the neutrosophic normal distributions and the relationship between the neutrosophic independent variables M.Aslam and neutrosophic dependent variables is linear. Suppose that q 1N and q 2N are the corresponding neutrosophic population correlations, see Kanji [34] and Smarandache [28] . The Z-test of a correlation coefficient for the first sample under neutrosophic statistics is calculated as follows In neutrosophic form, the values of Z 1N 2 Z 1L ; Z 1U ½ can be written as The Z-test of a correlation coefficient for the second sample under neutrosophic statistics is calculated as follows In neutrosophic form, the values of Z 2N 2 Z 2L ; Z 2U ½ can be written as The neutrosophic mean of Z 1N 2 Z 1L ; Z 1U ½ and Z 2N 2 Z 2L ; Z 2U ½ are given by The neutrosophic variance of Z 1N 2 Z 1L ; Z 1U ½ and Z 2N 2 Z 2L ; Z 2U ½ are given by The test statistic, say Z N 2 Z L ; Z U ½ under neutrosophic statistics is given by where r N 2 r L ; r U ½ is defined by In neutrosophic form, the statistic Z N 2 Z L ; Z U ½ can be written as Note here that the statistic Z N 2 Z L ; Z U ½ reduces to statistic under classical statistics when I ZL ¼ 0 and I ZN 2 I ZL ; I ZU ½ is indeterminacy interval associated with Z N 2 Z L ; Z U ½ . Step 1: State the null hypothesis that H 0N : q 1N ¼ q 2N vs. the alternative hypothesis H 1N : q 1N 6 ¼ q 2N . Step 2: State the level of significance a. Step 3: Compute the values of the test statistic Z N 2 Z L ; Z U ½ using the neutrosophic sample information. Step 4: Select the critical value from the Z-table corresponding to a and decide about the rejection region according to H 1N . Step 5: Do not reject H 0N : q 1N ¼ q 2N is the calculated value of Z N 2 Z L ; Z U ½ falls within the acceptance region. In this section, the application of the proposed test is given on the weather data. For the study, the two important weather variables, namely temperature and wind speed are selected. The purpose of the application of the proposed test is to show the significant relationship between the temperature and wind speed at various time periods. The minimum and maximum values of two variables for the month of January are recoded for Lahore, Pakistan, and reported in Table 1 for the years 2019 and 2020. The energy experts are interested to see either the relation between temperature and wind speed for the year 2019, and the year 2020 is significant or not. As the data are recorded in indeterminate intervals, therefore the use of the existing test under classical statistics is not appropriate or may mislead the energy experts. In this situation, the proposed test can be applied to see either the correlation between temperature and wind for the two years is significant or not. r N 2 0:2672; 0:2672 ½ and Z N j j 2 1:3642; 1:7226 ½ . In neutrosophic form, the statistic Z N 2 Z L ; Z U ½ is given by The proposed test for the real data sets is stated as follows Step 1: State the null hypothesis that H 0N : q 1N ¼ q 2N vs. the alternative hypothesis H 1N : q 1N 6 ¼ q 2N . Step 2: State the level of significance a ¼ 0:05. Step 3: Compute the values of the test statistic Z N j j 2 1:3642; 1:7226 ½ using the neutrosophic sample information. Step 4: The critical value from the Z-table is 1.96 corresponding to a ¼ 0:05 and H 1N : q 1N 6 ¼ q 2N . Step 5: Do not reject H 0N : q 1N ¼ q 2N as Z N j j\1:96 From the proposed test, it can be concluded that the relationship between the temperature and wind speed of January 2019 and January 2020 is insignificant. Similarly, the relationship between variables can be studied for other months of the years 2019 and 2020. As discussed earlier, the proposed test is a generalization of the test under classical statistics. The proposed test reduces to the existing test when all observations in the data are exact, determined and certain. In this section, the comparison of the proposed test is given over the existing test in terms of the measure of indeterminacy, flexibility and information. For the comparison purpose, the neutrosophic form of the statistic Z N 2 Z L ; Z U ½ is considered only. The other neutrosophic quantities can be explained in the same manner. The neutrosophic form of Z N 2 Z L ; Z U ½ is Z N ¼ 1:3642 þ 1:7226I ZN ; I ZN 2 0; 0:2081 ½ . Note here that this neutrosophic form reduces to statistic under classical statistics when I ZL ¼ 0. Therefore, the first part of the neutrosophic form presents the value of test statistic under classical statistics. Similarly, the second part 1:7226I ZN shows the indeterminate part of the neutrosophic form. In addition, the measure of indeterminacy associated with this test is 0.2081. According to the proposed test, the values of the statistic of statistic Z N 2 Z L ; Z U ½ are flexible and lie in the indeterminate interval that is Z N j j 2 1:3642; 1:7226 ½ . According to the proposed test, under an uncertain environment, the value of Z N 2 Z L ; Z U ½ can be expected from 1.3642 to 1.7226. This range differentiates the proposed test from the existing test under classical statistics which gives the determined value which is not appropriate in uncertainty. Another aspect of the proposed test is that it gives more information about the testing process under indeterminacy. The proposed test gives additional information about the testing procedure which is the measure of indeterminacy. For the energy example, for testing H 0N : q 1N ¼ q 2N , the probability that H 0N : q 1N ¼ q 2N will be accepted is 0.95, the change of rejecting it when it is true is 0.05 and change of uncertainty about H 0N : q 1N ¼ q 2N is 0.2081. For fuzzy statistics, there are lower value of the interval (measure of truth) and the upper value of the interval (measure of falseness). It means that Z N 2 Z L ; Z U ½ can be from 1.3642 to 1.7226. The analysis based on fuzzy statistics does not give information about the parameter ''measure of indeterminacy.'' From this study, it is clear that the proposed test is flexible, informative and reasonable to apply for testing H 0N : q 1N ¼ q 2N under uncertainty. From the study, it is concluded that the proposed test under neutrosophic statistics is better than the test under classical and fuzzy statistics in terms of information and flexibility. This paper introduced a Z-test for two correlation coefficients under neutrosophic statistics. The necessary steps to implement the proposed test were given. The statistic of the proposed test under indeterminacy was introduced the first time. The application of the proposed test was given using temperature and wind speed data. The proposed test was the extension of the existing test under classical statistics. The application of the proposed test on the energy data showed it is efficient in measure of indeterminacy, flexibility and information. The proposed test can be applied for ocean big data as future research. The efficiency of the proposed test using some distributions can be considered a fruitful area of future research. A correlation study between meteorological parameters and COVID-19 pandemic in Mumbai SPSS and SAS programs for comparing Pearson correlations and OLS regression coefficients The effect of nonzero autocorrelation coefficients on the distributions of Durbin-Watson test estimator: three autoregressive models Spatial autocorrelation approaches to testing residuals from least squares regression Analysis of 22 vegetable oils' physico-chemical properties and fatty acid composition on a statistical basis, and correlation with the degree of unsaturation Correlation coefficients for a study with repeated measures Application of time series models in business research: correlation, association, causation Correlation of wind speed between neighboring measuring stations Correlation analysis for wind speed and failure rate of wind turbines using time series approach Study of time and meteorological characteristics of wind speed correlation in flat terrains based on operation data A statistical cognitive model to assess impact of spatially correlated wind production on market behaviors Wind turbine selection method based on the statistical analysis of nominal specifications for estimating the cost of energy An investigation of wind power density distribution at location with low and high wind speeds using statistical model A statistical modeling approach on the performance prediction of indirect evaporative cooling energy recovery systems Forecasting of wind speed using multiple linear regression and artificial neural networks Inter-and intraannual wind speed variabilities in wide valley regions of the middle reaches of the Yarlung Tsangpo River A fuzzy model for wind speed prediction and power generation in wind parks using spatial correlation Two-sample hypothesis tests of means of a fuzzy random variable Adaptive neuro-fuzzy approach for estimation of wind speed distribution The sign test and the signedrank test for interval-valued data Financial time series forecasting with deep learning: a systematic literature review Research on hybrid wind speed prediction system based on artificial intelligence and double prediction scheme Neutrosophic probability, set, and logic, ProQuest information & learning Neutrosophic precalculus and neutrosophic calculus Utilising neutrosophic theory to solve transition difficulties of IoT-based enterprises Neutrosophic set is a generalization of intuitionistic fuzzy set, inconsistent intuitionistic fuzzy set (picture fuzzy set, ternary fuzzy set), pythagorean fuzzy set, spherical fuzzy set, and q-rung orthopair fuzzy set, while neutrosophication is a generalization of regret theory, grey system theory, and three-ways decision (revisited) An integrated neutrosophic-topsis approach and its application to personnel selection: a new trend in brain processing and analysis Introduction to neutrosophic statistics Scale effect and anisotropy analyzed for neutrosophic numbers of rock joint roughness coefficient based on neutrosophic statistics Expressions of rock joint roughness coefficient using neutrosophic interval statistical numbers Forecasting of the wind speed under uncertainty Design of the Bartlett and Hartley tests for homogeneity of variances under indeterminacy environment On detecting outliers in complex data using Dixon's test under neutrosophic statistics Acknowledgements We are thankful to the editor and reviewers for their valuable suggestions to improve the quality of the paper.Data availability The data are given in the paper. Conflict of interest The author declares no conflict of interest.