key: cord-1004426-6kdmd3to authors: Thamer, Mathil K.; Zine, Raoudha title: Comparison of Five Methods to Estimate the Parameters for the Three-Parameter Lindley Distribution with Application to Life Data date: 2021-12-08 journal: Comput Math Methods Med DOI: 10.1155/2021/2689000 sha: cbee39a60a909dbbf9e598cbe0c37f1b33f876cd doc_id: 1004426 cord_uid: 6kdmd3to We have studied one of the most common distributions, namely, Lindley distribution, which is an important continuous mixed distribution with great ability to represent different systems. We studied this distribution with three parameters because of its high flexibility in modelling life data. The parameters were estimated by five different methods, namely, maximum likelihood estimation, ordinary least squares, weighted least squares, maximum product of spacing, and Cramér-von Mises. Simulation experiments were performed with different sample sizes and different parameter values. The different methods were compared on the generated data by mean square error and mean absolute error. In addition, we compared the methods for real data, which represent COVID-19 data in Iraq/Anbar Province. The Lindley distribution was proposed in 1958 by Lindley [1] ; however, actual interest began in 2008 when Ghitany et al. [2] studied its properties and applications. Since then, this distribution has been developed as generalised Lindley distribution in 2009 by Zakerzadeh & Dolati [3] , a twoparameter Lindley distribution in 2013 by Shanker et al. [4] , another two-parameter Lindley distribution in the same year by Shanker & Mishra [5] , Lindley distribution with a location parameter as a three-parameter distribution in 2016 by Abd El-Monsef [6] , and another three-parameter Lindley distribution in 2017 by Shanker et al. [4] . The Lindley distribution is made up of mixing two continuous distributions with different weights; the first is exponential distribution with θ, and the second is gamma distribution with 2 and θ, that is, where Thus, the resulting function is as follows: Lindley [1] used w = θ/ðθ + 1Þ; hence, the probability density function (p.d.f.) is as follows: Ghitany et al. [2] introduced its cumulative distribution function (c.d.f.) as follows: Parameter estimation of the two-parameter Lindley distribution was conducted by many researchers, such as Al-Bayati [7] , Sharafi [8] , and Demirci Biçer [9] . However, the parameters of the three-parameter Lindley distribution were only estimated in the maximum likelihood method be Shanker et al. [10] . Therefore, in this research, the distribution parameters were estimated using different methods. 2.1. The Three-Parameter Lindley Distribution. The threeparameter Lindley distribution (THPL) was proposed by Shanker et al. [10] ; the weight w = αθ/ðαθ + βÞ was used, as follows: resulting in the following: x, θ, β > 0 and αθ + β > 0, ð7Þ x, θ, β > 0 and αθ + β > 0: The quantile function of the three-parameter Lindley distribution is given by the following: where W −1 ð:Þ denotes the negative branch of the Lambert W function. Estimators of the Parameters of THPL Distribution. We present five well-known methods to estimate the parameters of the three-parameter Lindley distribution, including maximum-likelihood (ML), ordinary least-squares (OLS), weighted least-squares (WLS), maximum product of spacing (MPS), and Cramér-von Mises (CVM). The log-likelihood of the positive vector of observations x = ðx 1 , x 2 , ⋯, x n Þ under the three-parameter Lindley distribution can be written as follows: where x is the sample mean. Shanker et al. [10] derived the maximum likelihood estimates (MLE) b θ, b α, and b β of θ, α, and β, by solving the following nonlinear equations: We can also obtain MLE by maximising (11) via fminunc function in MATLAB. Least-Square Estimators. Suppose that X ð1Þ ≤ X ð2Þ ≤ ⋯≤X ðnÞ are the order statistics of a random sample from any probability distribution. The i th -order statistic has the mean and the variance as follows: OLS and WLS were proposed in 1988 by Swain et al. [11] . We can get OLS estimates for the parameters by 2 Computational and Mathematical Methods in Medicine minimising the following function with respect to the parameters, as follows: where Fðx ðiÞ Þ represents the theoretical c.d.f. of the observation x ðiÞ of the distribution under study and FðiÞ represents the empirical c.d.f. which is usually estimated by F̂ðiÞ = i/ ðn + 1Þ; then, we obtain the following: This function can be obtained for the three-parameter Lindley distribution after substituting for Fðx ðiÞ Þ in the We can determine the OLS estimates by minimising (16) with respect to the parameters via fminunc function or by solving the following equations: Criteria Computational and Mathematical Methods in Medicine We can obtain WLS estimates for the parameters by minimising the following function with respect to the parameters: This function can be obtained for the three-parameter Lindley distribution after substituting for Fðx ðiÞ Þ in the previous equation by its c.d.f. defined in equation (8), as follows: WLS estimates can be obtained by minimising (19) with respect to the parameters via fminunc function or by solving the following equations: [12] ; the idea of this method is to maximise the following function: where D i = Fðx ðiÞ Þ − Fðx ði−1Þ Þ and Fðx ð0Þ Þ = 0 ; Fðx ðn+1Þ Þ = 1. This function can be obtained for the three-parameter Lindley distribution after substituting for Fðx ðiÞ Þ in the previous equation by its c.d.f. defined in equation (8), as follows: where We can identify the MPS estimates by maximising (22) via fminunc function or by solving the following equations: where [13] . The idea of this method is to minimise the following function: This function can be obtained for the three-parameter Lindley distribution after substituting for Fðx ðiÞ Þ in the previous equation by its c.d.f., which was defined in equation (8) , as follows: We can determine the CVM estimates by maximising (27) via fminunc function or by solving the following equations: 3.1. Simulation. To compare the five estimation methods, data were generated from the three-parameter Lindley distribution on the basis of the quantile function defined in equation (10) . Data were generated for four different cases, as shown in Table 1 . For each case, different sizes of samples were used (10, 30, 60, 80, 150, and 250). The experiment was repeated 10,000 times for each of combinations. Then, the parameters were estimated by the five estimation methods; the methods were compared using mean square error (MSE) and mean absolute error (MAE). Table 2 shows the formulas of these criteria. All operations were conducted in MATLAB 2020a (see Code 1). Tables 3-6 illustrate our simulation study. The different methods were compared based on their ranks. These results show that all estimators have the property of consistency and for all methods because MSEs and the MAEs for them decrease with an increasing sample size. The preference of the methods can be summarised in Table 7 , which shows that MPS and WLS are best for small sample sizes (10, 30) . MPS, MLE, and WLS are best in medium sample sizes (60, 80), and MPS and MLE are best for large sample sizes (150, 250). Application. The survival time to death for 83 COVID-19 patients was recorded by the researchers from the medical sector in Iraq/Al Anbar Province. Table 8 contains these data. The parameters of the three-parameter Lindley distribution were estimated by the five methods. Furthermore, the Kolmogorov-Smirnov ðKSÞ values, KS = max jFðx ðiÞ Þ − ði/ðn + 1ÞÞj and their associated p values were calculated to ensure that these data follow the three-parameter Lindley distribution. Table 9 shows that the data were distributed as the three-parameter Lindley distribution for all methods. We note that all p values are greater than 0.05, indicating that the data follow the three-parameter Lindley distribution. We draw p.d.f. and c.d.f. of the three-parameter Lindley distribution based on the following: where b θ, b α, and b β are the parameter estimates. According to equations (29) and (30), the p.d.f. and c.d.f. can be drawn in Figures 3 and 4 . We note that the behaviour of the estimated functions is relatively close to that of the empirical functions. This finding is a good indication that the estimated models can represent the COVID-19 data. The parameters of the three-parameter Lindley distribution (THPLD) were estimated by five different methods. A simulation study was performed, and these methods were compared using MSE and MAE. All estimators were consistent because their MSE and MAE values decrease as the sample size increases. The MPS and WLS methods were good in small samples. MPS, MLE, and WLS were good in medium samples. MLE and MPS were good in large samples. On the practical side, the results indicated that the COVID-19 data follow the three-parameter Lindley distribution. The p.d.f. and c.d.f. were estimated based on the five methods, and then, these functions were drawn. The graphics indicated that the behaviour of the estimated functions is close to the empirical functions. Data are available upon request from the authors. The authors declare no conflict of interest. Fiducial distributions and Bayes' theorem Lindley distribution and its application Generalized Lindley distribution A two-parameter Lindley distribution for modeling waiting and survival times data A two-parameter Lindley distribution A new Lindley distribution with location parameter Some Estimation Methods for Lindley Distribution M Inference of the two-parameter Lindley distribution based on progressive type II censored data with random removals Statistical inference for geometric process with the two-parameter Lindley distribution A threeparameter Lindley distribution Least-squares estimation of distribution functions in Johnson's translation system Maximum product of spacings estimation with application to the lognormal distribution An estimation procedure for mixtures of distributions The authors are grateful to Adnan M. Hussein for his programming contribution.