key: cord-0861384-duycbjvk authors: Almongy, Hisham.M.; Almetwally, Ehab M.; Aljohani, Hassan M.; Alghamdi, Abdulaziz S.; Hafez, E.H. title: A New Extended Rayleigh Distribution with Applications of COVID-19 Data. date: 2021-03-12 journal: Results Phys DOI: 10.1016/j.rinp.2021.104012 sha: 6b66bcc1b27c9bd5accf33d0451797a8fc252e0c doc_id: 861384 cord_uid: duycbjvk This paper aims to model the COVID-19 mortality rates in Italy, Mexico, and the Netherlands, by specifying an optimal statistical model to analyze the mortality rate of COVID-19. A new lifetime distribution with three-parameter is introduced by a combination of Rayleigh distribution and extended odd Weibull family to produce the extended odd Weibull Rayleigh (EOWR) distribution. This new distribution has many excellent properties as simple linear representation, hazard rate function, and moment generating function. Maximum likelihood, maximum product spacing and Bayesian estimation methods are applied to estimate the unknown parameters of EOWR distribution. MCMC method is used for the Bayesian estimation. A numerical result of the Monte Carlo simulation is obtained to assess the use of estimation methods. Also, data analysis for the real data of mortality rate is considered. One of the main tasks of statistics is to be modeling the natural life events in the form of probability distributions. Probability distributions are used to modeling natural life phenomena that are characterized by uncertainty and riskiness. Many of the probability distributions have been derived because the natural life phenomena are complex and diversified. However, known probability distributions remain unable to accurately represent data for some natural phenomena. These lead to the expansion and Modification of generalized probability distributions. Generalized probability distributions have been progressing with the popular character of having adding parameters. The addition of some parameters to the known probability distributions improved the quality of suitability for the natural phenomena data and higher accuracy of describing the shape of the tail of the distribution. Most of extensions of the Rayleigh probability distribution have been derived because of their great importance in describing many natural life phenomena. The Rayleigh probability density function, attributed to Lord Rayleigh (1842 Rayleigh ( -1919 , it concerned with describing skewed data Rayleigh [1] Many researchers consider one scale parameter Rayleigh, like Robert C.P. Diebolt and Robert [2] discussed deviation and distance measure in economic, which can be applied in another real phenomena data. The Extended of probability distribution was originally introduced by Lehmann [3] . Kundu and Raqab [4] provided a generalization of the Rayleigh probability distribution and estimated its unknown parameters using several different methods. In Voda [5] used the conservative technique to derive a new generalization of the Rayleigh probability distribution. In Dey [6] presented Bayesian estimates of the Rayleigh probability distribution parameters using the Linux loss functions and error square. In Merovci [7] used the square ordinal transformation method in developing the Transmuted Rayleigh probability distribution. In Merovci and Elbatal [8] presented a Weibel-Rayleigh probability distribution. Based on type II censored data, in Mahmoud and Ghazal [9] discussed the parameters estimation of exponentiated Rayleigh. In Ateeq et al. [10] RayleighRayleigh derived the distribution (RRD) using Transformed Transformer technique. El-Sherpieny and Almetwally [12] introduced Bivariate generalized Rayleigh distribution based on clayton copula with different applications. In Almetwally et al. [11] maximum likelihood and maximum product spacing estimates for generalized Rayleigh distribution based on the adaptive type-II progressive censoring schemes. In Al-Babtain [13] proposed a new extension of the Rayleigh distribution with two parameter called type I half logistic Rayleigh distribution. Here we study a new model with three parameters, it is called extended odd Weibull Rayleigh (EOWR) distribution. The EOWR distribution is obtained based on the extended odd Weibull-G (EOW-G) family, which introduced by Alizadeh et al. [14] . Let G(x; θ) = 1 − G(x; θ) and g (x; θ) = dG(x;θ) dx denote the survival function (S) and probability density function (PDF) of a baseline model with parameter vector θ respectively, so the CDF of the EOW-G family is given by: The corresponding PDF of (1) is defined by where α and β are positive shape parameters. The random variable with PDF (2) is denoted by X ∼EOW-G(α, β, θ). Afify and Mohamed [15] introduced a new flexible three-parameter exponential distribution called the extended odd Weibull exponential distribution. Alshenawy et al. [16] maximum likelihood and maximum product spacing estimates of the extended odd Weibull exponential distribution have been discussed under progressive type-II censoring scheme with random removal. Our goal is to study point estimation of the unknown parameters of EOWR by using four classical methods of estimation and Bayes estimation method. A statistical comparison between these methods is conducted via simulation to asses the performance of these methods and to study how these estimators behave for several sample sizes and parameter values. The rest of this paper is organized as follows. In Section 2, we define EOWR distribution. EOWR linear representation of its PDF is obtained in Section 3, along with some of its statistical properties. Three methods of point estimation are studied in Section 4. In Section 5, a simulation study is conducted in order to compare the performance of these estimation methods. Three real data sets of COVID-19 from different life applications are used in Section 6 to prove the efficiency of the EOWR distribution with respect to other distributions. Finally, conclusions are given in Section 7. The three-parameter EOWR distribution is a special model of EOW-G family with Rayleigh distribution as a baseline function. The Rayleigh distribution under consideration has PDF and CDF of the form g (x; δ) = 2δxe −δx 2 and G (x; δ) = 1 − e −δx 2 , x > 0, δ > 0. By substituting the CDF and PDF of the Rayleigh model in (1) and (2), we obtain the CDF and PDF of the EOWR distribution respectively as; (3) f (x; α, β, δ) = 2αδxe δx 2 e δx 2 − 1 Therefore, a random variable with PDF (4) is denoted by X ∼EOWR(α, β, δ). The EOWR model reduces to the two parameter Weibull Rayleigh model when β → 0 + . The hazard rate function (HR) of the EOWR distribution are given by Figures 1 and 2 are different shapes of the PDF and HR of the EOWR distribution. These figures show that the PDF of the EOWR distribution can be right-skewed, symmetric or decreasing curves. The HR of the EOWR distribution has some important shapes, including, constant, decreasing, and upside down curve, which are attractive characteristics for any lifetime model. It can be noticed from the application section, that the EOWR distribution possesses great flexibility and can be used to model skewed data, hence widely applied in different areas such as biomedical studies, biology, reliability, physical engineering, and survival analysis. In this section, we observe some statistical properties of the EOWR distribution namely, the linear representation of PDF, which is useful in finding the moments and moment generating function (MGF). Also we obtain the mean residual life and mean inactivity time. Linear representation for the EOWR density using series techniques is useful for finding many statistical values and properties of the needed distribution. Alizadeh et al. [14] showed that EOW-G family has the following mixture representation of its density is the Exponential-G density with positive power parameter αj + k. Now substituting the PDF and CDF of the Pareto distribution, the above equation can be written as Equation (5) can be written as where υ j,k = a j,k (αj+k) , and g (x; αj + k, δ) denotes Generalized Rayleigh density with αj + k, δ as parameters of Generalized Rayleigh distribution. Hence the PDF of EOWR can be expressed as a linear combination of Generalized Rayleigh distribution. Let X be a random variable having Generalized Rayleigh distribution. Then, the ith ordinary moment, and MGF of X are The ith moment of X follows directly from Equations (6) and (7) Referring to Equation (6), the MGF of the EOWR distribution is given by: The quantile function of EOWR distribution is used in the theoretical aspect of probability theory for this model, statistical applications, and simulations. Simulation algorisms used quantile function to produce simulated random variables. The quantile function (Q) of the EOWR distribution are given by In particular, the median of EOWR distribution can be derived from 10 be setting q = 0.5. Then, the median is given by median In this section, we use different point estimation methods to estimate the unknown parameters of the EOWR. We use maximum likelihood estimator (MLE), Least Square (LS), maximum product of spacing estimator (MPS) and Bayesian estimation methods. In the last few years, parameter estimation using different estimation methods got great attention by many authors such as Almetwally and Almongy [17] , Haj Ahmad and Almetwally [18] , Basheer et al. [19] and Afify and Mohamed [15] . Let x 1 , · · · , x n be a random sample from the EOWR distribution with parameters α, β, and δ. The likelihood function can be written as: the log-likelihood function is where H i = (x δ i − 1) and Ω = (α, β, δ) is a vector of the EOWR parameters. The MLE are obtained by solving the following normal equations, These equations cannot be solved explicitly, hence a nonlinear optimization algorithm as Newton Raphson method is used. According to Cheng and Amin [20] , the maximum product spacing method (MPS) is an efficient estimation method that proved to have some advantages with respect to other point estimation methods. So we use MPS in this section to have point estimation of the unknown parameters of EOWR distribution. This can be obtained by solving the normal equations resulted from taking partial derivatives of logarithm of product spacing function G(Θ) which is written as: where ∆(x i ) = e δx 2 i − 1 and the logarithmic function of G(Θ) The MPS estimators of Ω are obtained by differentiating the log-product equation (13) with respect to each parameter separately, then we solve the nonlinear system of equations found by using any iterative procedure techniques such as Newton Raphson algorithms. This developed in last few year to estimation parameter of model under censoring scheme as Ng et al. [21] , Basu et al. [22] , Almetwally and Almongy [23] , Almetwally et al. [17, 24] , and El-Sherpieny et al. [25] . Bayesian methods is a statistical inference that depends on the choice of the prior distribution and the loss function. In this method all parameters are considered as random variables with certain distribution called prior distribution. If prior information is not available which is usually the case, we need to select one. Since the selection of prior distribution plays an important role in estimation of the parameters, our choice for the priors are the independent gamma distributions. On the other hand, the loss function is important in Bayesian methods. Most of the Bayesian inference procedures are developed under the symmetric and asymmetric loss functions. One of the most common symmetric loss function is the squared error loss function. The independent joint prior density function of Ω can be written as follows: The joint posterior density function of Θ is obtained from (11) and (14) π(Θ|x) = ℓ(x|Θ).π(Θ) The Bayes estimators of Θ, say ( α B , β B , δ B ) based on squared error loss function is given bŷ It is noticed that the integrals given by (16) can't be obtained explicitly. Because of that we use the Markov Chain Monte Carlo technique (MCMC) to find an approximate value of integrals in (16) . Many of studies used MCMC technique such as, Almetwally et al. [17, 24] . In this section Monte-Carlo simulation procedure is performed for comparison between the classical estimation methods: MLE, MPS and Bayesian estimation method under square error loss function based on MCMC, for estimating parameters of EOWR distribution in life time by R language. Monte-Carlo experiments are carried out based on data-generated 10000 random samples from EOWR distribution, where x has EOWR life time for different actual values of parameters and different sample sizes n:(50, 100 and 200). We could define the best estimators methods as which minimizes the bias and root mean squared error (RMSE) of estimators. Tables 1, 2 summarizes the simulation results of point estimation methods proposed in this paper. We consider the bias and the RMSE values in order to perform the needed comparison between different point estimation methods. The following remarks can be noted from these tables: In this section, three real data of COVID-19 from Italy, Mexico and Netherlands are given to test the goodness of the EOWR distribution. The EOWR model is compared with other related models such as, Rayleigh, Marshall-Olkin Rayleigh (MOR) [26] , Kumaraswamy exponentiated Rayleigh (KER) Rashwan [27] and extended odd Weibull exponential (EOWE) distribution [15] . Tables 3 4 and 5 provide values of Crammer-von Mises (W*), Anderson-Darling (A*) and Kolmogorov-Smirnov (KS) statistic along with its P-value for all models fitted based on three real data sets. The second data represents a COVID-19 data belong to Mexico of 108 days, that is recorded from 4 March to 20 July 2020. This data formed of rough mortality rate. The data are as follows: 8 The third data represents a COVID-19 data belong to Netherlands of 30 days, that is recorded from 31 March to 30 April 2020. This data formed of rough mortality rate. The data are as follows: 14 From Tables 3 4 and 5 it is obvious that EOWR distribution has minimum values of all information criteria compared with other distributions. Also the P-value for KS has its highest value when the life time is EOWR distribution. This leads us to conclude that EOWR better fit the three real sets of data. Empirical, Q-Q and P-P plots shown in figures 3, 4 and 5, indicate that our distribution is a good choice for modeling the above real data. Xii. on the resultant of a large number of vibrations of the same pitch and of arbitrary phase. The London, Edinburgh, and Dublin Philosophical Magazine Estimation of finite mixture distributions through Bayesian sampling The power of rank tests Generalized Rayleigh distribution: different methods of estimations A new generalization of Rayleigh distribution Comparison of Bayes estimators of the parameter and reliability function for Rayleigh distribution under different loss functions Transmuted rayleigh distribution Weibull-Rayleigh distribution: theory and applications Estimations from the exponentiated rayleigh distribution based on generalized Type-II hybrid censored data An extension of Rayleigh distribution and applications Adaptive type-II progressive censoring schemes based on maximum product spacing with application of generalized Rayleigh distribution Computer Science and Operation research, Faculty of Graduate Studies for Statistical Research A New Extended Rayleigh Distribution The extended odd Weibull-G family: properties and applications A new three-parameter exponential distribution with variable shapes for the hazard rate: estimation and applications Progressive Type-II Censoring Schemes of Extended Odd Weibull Exponential Distribution with Estimation Methods for the New Weibull-Pareto Distribution: Simulation and Application Marshall-Olkin Generalized Pareto Distribution: Bayesian and Non Bayesian Estimation Marshall-Olkin Alpha Power Inverse Weibull Distribution: Non Bayesian and Bayesian Estimations Estimating parameters in continuous univariate distributions with a shifted origin Parameter estimation of three-parameter Weibull distribution based on progressively type-II censored samples Estimation of Inverse Lindley Distribution Using Product of Spacings Function for Hybrid Censored Data Maximum Product Spacing and Bayesian Method for Parameter Estimation for Generalized Power Weibull Distribution under Censoring Scheme Maximum Product Spacing Estimation of Weibull Distribution Under Adaptive Type-II Progressive Censoring Schemes Progressive Type-II hybrid censored schemes based on maximum product spacing with application to Power Lomax distribution The MarshallOlkin extended generalized Rayleigh distribution: Properties and applications A note on Kumaraswamy exponentiated Rayleigh distribution In this paper we formulate a new generalization of Rayleigh and Weibull distributions which is called EOWR distribution. We studied its statistical properties and obtained a linear representation for its pdf which was efficient in finding moments, moment generating function, mean residual and others. Different classical and bayes estimation methods were considered to find point estimation of EOWR unknown parameters α, β and δ. A comparison was conducted via simulation analysis using R package to distinguish the performance of different estimation method. MCMC method was used for that purpose, also real data sets were considered and they showed that EOWR better fit these data compared with other competitive distributions.