key: cord-0663570-cp4c8o1x authors: Dutta, Subhankar; Kayal, Suchandan title: Estimation of parameters of the Gumbel type-II distribution under AT-II PHCS with an application of Covid-19 data date: 2021-03-15 journal: nan DOI: nan sha: 0d5a1dbf501f4db66fec12e14a49a47dca345ee5 doc_id: 663570 cord_uid: cp4c8o1x In this paper, we investigate the classical and Bayesian estimation of unknown parameters of the Gumbel type-II distribution based on adaptive type-II progressive hybrid censored sample (AT-II PHCS). The maximum likelihood estimates (MLEs) and maximum product spacing estimates (MPSEs) are developed and computed numerically using Newton-Raphson method. Bayesian approaches are employed to estimate parameters under symmetric and asymmetric loss functions. Bayesian estimates are not in explicit forms. Thus, Bayesian estimates are obtained by using Markov chain Monte Carlo (MCMC) method along with the Metropolis-Hastings (MH) algorithm. Based on the normality property of MLEs the asymptotic confidence intervals are constructed. Also, bootstrap intervals and highest posterior density (HPD) credible intervals are constructed. Further a Monte Carlo simulation study is carried out. Finally, the data set based on the death rate due to Covid-19 in India is analyzed for illustration of the purpose. In recent times, life testing experiments and reliability studies have achieved more acceptance. In such experiments, various situations appear where experimental units are eliminated before the time of failure. In these cases, the investigator may not have full information about the failure times of the whole experimental unit. Such kind of data collected from all these experiments are called censored data. It is worth mentioning that the censoring was introduced in practice to save time and reduce the number of failed units in a life-testing experiment. Censoring can be done with respect to a prefixed time or prefixed number of failures or sometimes a combination of both prefixed time and number of failures. Depending upon these criteria, there are different types of censoring schemes. Let us consider a life testing experiment with n number of experimental units and X i:n denotes the time of i-th failure of the experimental units, where i = 1, . . . , n. Let T be a pre-specified time. If all the experiments fail before that pre-specified time T , it is known as a type-I censoring scheme. In the case of the type-II censoring scheme, the experiment will terminate after m-th number of failures observed, where m ≤ n. Both type-I and type-II censoring schemes have some disadvantages. In the case of type-I censoring, the number of failures may be zero, and in the case of type-II censoring, experimental time may be very large. For further information about these schemes, we refer to Lawless (2011) , Sirvanci and Yang (1984) and Balakrishnan and Chan (1992) . Epstein (1954) introduced a censoring scheme which is a mixture of type-I and type-II censoring schemes, known as the type-I hybrid censoring scheme. In this scheme, the experiment is terminated at time T * = min{T, X m:n }, where X m:n denotes the m-th failure and T is pre-specified time. Childs et al. (2003) proposed type-II hybrid censoring scheme where experiment will be terminated at time T * = max{X m:n , T }. For further studies on these hybrid censoring schemes, one may read Balakrishnan and Kundu (2013) , Kundu (2007) , Kundu and Pradhan (2009) and Dey and Pradhan (2014) . In practical life testing experiments, there are many cases when the experimental units are removed from experiments before failure for other reasons. To overcome such scenarios, a type-II right censoring scheme is introduced, known as a progressive type-II censoring scheme. It has more flexibility in allowing the removal of units at time points other than the terminal point of the experiments. For progressive type-II censoring scheme, let us consider that n experimental units are placed on a life testing experiment. Let X i:m:n be the failure time of the i-th experimental unit, and m be the number of failures that is fixed before the experiment starts. After first failure at X 1:m:n , R 1 number of units are removed randomly from n − 1 surviving units. After second failure at X 2:m:n , R 2 number of units are removed randomly from n − R 1 − 2 surviving units. The test continues until m-th failure occurs and all remaining R m = n − m i=1 R i − m surviving units are removed. In the progressive type-II censoring scheme, the values of R i 's are pre-fixed. Many researchers have studied statistical inferences of different distributions under this censoring scheme. For example, see Balakrishnan et al. (2003) , Rastogi and Tripathi (2012) and Maiti and Kayal (2019) . In recent years, progressive type-II censoring scheme gained more popularity in life testing experiments and reliability studies. But there is a major drawback of this censoring scheme that it may take a large time to complete an experiment. To overcome this drawback, Kundu and Joarder (2006) introduced progressive hybrid censoring scheme. Basically, there are two types of progressive hybrid censoring schemes. Let us consider progressively censored ordered statistics X 1:m:n , . . . , X m:m:n from a life testing experiments of n units. Further, (R 1 , . . . , R m ) are assumed to be the corresponding removals. If the experiment terminates at a time T * = min{T, X m:m:n }, where T ∈ (0, ∞) is prefixed, then it is called type-I progressive hybrid censoring scheme (PHCS). In case of type-I PHCS, at the time of first failure X 1:m:n , R 1 surviving units are removed randomly from (n − 1) units. At time of second failure X 2:m:n , R 2 number of surviving units are removed randomly from (n − R 1 − 2) units. In similar way if m-th failure occurs before time T , i.e. X m:m:n < T , then the observed failures are X 1:m:n , . . . , X m:m:n and all remaining units R m = n − m − m−1 i=1 R i are removed at termination point X m:m:n . If X m:m:n > T, then the observed failures are X 1:m:n , . . . , X j:m:n , where X j:m:n < T < X j+1:m:n and all remaining units R * j = n − j − j i=1 R i at the time of termination T . In recent years, statistical inference under type-I PHC has been studied by several authors. One may refer to Lin and Huang (2012) , Tomer and Panwar (2015) and Kayal et al. (2017) . There is a drawback of type-I progressive hybrid censoring scheme, since the effective sample size may turn out to a small number. Thus, it has lower efficiency in computing statistical inferences for some problems. To increase such efficiency, Ng et al. (2009) proposed adaptive type-II progressive hybrid censoring scheme (AT-II PHCS) where the effective sample size m and time T are prespecified. This censoring scheme is similar to the type-I progressive hybrid censoring scheme. It is different when T < X m:m:n . If X j:m:n < T < X j+1:m:n , where j + 1 < m, all remaining units will be terminated by setting R j+1 = . . . = R m−1 = 0 and R m = n−m− j i=1 R j . Thus the experiment will be terminated after getting the expected sample size. In recent years many authors studied inferences of various lifetime models under AT-II PHCS. Lin et al. (2009) obtained classical estimates on the Weibull lifetime model based on AT-II PHCS. Hemmati and Khorram (2013) obtained maximum likelihood and approximated maximum likelihood estimates of model parameters of two parameters log-normal distribution based on AT-II PHCS. Nassar and Abo-Kasem (2016) discussed maximum likelihood estimates and Bayesian estimates of parameters of Burr XII distribution model based on AT-II PHCS. Panahi and Moradi (2020) discussed classical and Bayesian estimation of parameters of inverted exponential Rayleigh distribution. Panahi and Asadi (2021) discussed the maximum likelihood estimates and Bayesian estimates of parameters of Burr III distribution based on AT-II PHCS. Gumbel (2004) introduced a two-parameter distribution, known as Gumbel type-II distribution, which is very useful to model meteorological phenomena such as floods, earthquakes, and natural disasters. Also it can used in life expectancy tables, hydrology and rainfall. The cumulative distribution function (CDF) of Gumbel type-II distribution is where α is the shape parameter, β is the scale parameter. The corresponding probability density function (PDF) is The hazard rate function of Gumbel type-II distribution is given by (1.3) Figure 1 represents graphs of the PDFs and hazard rate functions of the Gumbel type-II distribution based on different values of the parameters α and β. It is noticed that the shape of the hazard rate function of Gumbel type-II distribution is decreasing and upsidedown bathtub (UTB). Due to these shapes of the hazard rate function, the Gumbel type-II distribution is very flexible to use in different research areas such as clinical, reliability and survival studies. Recently, many authors have studied statistical properties of the estimators of the model parameters of Gumbel type-II distribution. As example, Abbas et al. (2013) discussed Bayesian estimation of the model parameters. Reyad and Ahmed (2015) studied E-Bayesian estimation of the unknown model shape parameter. Sindhu et al. (2016) obtained Bayes estimates and corresponding risk of the unknown model parameters based on left-censored data. Abbas et al. (2020) proposed Bayesian estimation of the model parameters under type-II censored sample with some medical applications. Due to importance of the Gumbel type-II distribution and the AT-II PHCS, in this paper, we have considered the problem of estimation of the parameters of Gumbel type-II distribution under AT-II PHCS. To the best of our information, this problem has not been studied yet. The rest of the article is organized as follows. In Section 2, MLEs and MPSEs are computed by using the Newton-Raphson method. The Bayesian estimates are obtained under symmetric and asymmetric loss functions using the Markov chain Monte Carlo (MCMC) technique in Section 3. In Section 4, the asymptotic confidence intervals using the normality property of the MLEs and bootstrap confidence intervals are constructed. Also, MCMC samples are used to build HPD credible intervals. A Monte Carlo simulation study is carried out to compare the performance of different estimates in Section 5. Section 6 deals with a real data set based on the death rates due to Covid-19 in India. Finally, conclusions of this paper have been drawn in Section 7. In this section, classical estimation of unknown model parameters of Gumbel type-II distribution is obtained by using two methods: (a) maximum likelihood estimation and (b) maximum product spacing. Let x 1:m:n < . . . < x j:m:n < T < x j+1:m:n < . . . < x m:m:n be an adaptive type-II progressive censored ordered sample from (1.2) along with a censoring scheme (R 1 , . . . , R j , 0, . . . , 0, R * j ), where R * j = n − m − j i=1 R i and T is pre-specified. Then, the likelihood function can be written as (2.1) Whenever x m:m:n < T , then R * j = 0 and the likelihood function becomes where x i represents the i-th failure time x i:m:n . The log-likelihood function is given as After differentiating (2.4) partially with respect to α and β respectively, and equating to zero we get The maximum likelihood estimates of unknown parameters α and β can be obtained from Equations (2.5) and (2.6), which are not in closed form, thus need to be solved numerically. The maximum likelihood estimates of the unknown parameters are denoted by α and β. Further, when computing MLEs numerically, it is always of interest to study the existence and uniqueness of the MLEs of the parameters. In order to achieve this, one requires to show two conditions proposed by Mäkeläinen et al. (1981) . These are difficult to establish due to the complicated nature of the expressions of the second order partial derivatives of the log-likelihood function. To have a rough idea of that, the profile log-likelihood functions for the parameters α and β are presented in Figure 2 . These figures represent that the MLEs may exist uniquely. Cheng and Amin (1983) proposed maximum product spacing estimation (MPSE) method as an alternative to the MLE. Almetwally et al. (2019) discussed MPSE method under AT-II PHCS for generalized Rayleigh distribution. The product spacing under AT-II PHCS can be written as such that D i = 1. Then, the product spacing function under AT-II PHCS based on Gumbel type-II model can be written as (2.9) Now, the logarithm of space function can be written as To obtain the normal equations from the above Equation (2.10), differentiate partially with respect to unknown parameters and equating to zero, we get Above these two equations can not be solved analytically, so numerical method will be applied to obtain the MPSEs of the unknown parameters α and β, as α M P S and β M P S , respectively. In this section, Bayes estimates of unknown parameters of Gumbel type-II distribution are obtained with respect to symmetric and asymmetric loss functions under AT-II PHCS. To obtain the Bayes estimates of α and β, squared error loss function (SELF), which is symmetric and general entropy loss function (GELF) and LINEX loss function (LLF), which are asymmetric, are considered. Letθ be an estimator of θ. Then, SELF, LLF and GELF are respectively given by Under the loss functions given by (3.1), (3.2) and (3.3), the Bayes estimates of θ can be respectively written as In Bayesian estimation, choosing priors for the unknown model parameters is an important as well as challenging problem. There is no clear methodology to choose best priors for Bayesian estimation problem. Here, we assume the prior distributions for the parameters as α ∼ Gamma(a, b) and β ∼ Gamma(c, d), where Gamma(a, b) denotes gamma distribution with shape parameter a and scale parameter b. The joint prior density of α and β can be written as Based on the likelihood function (2.3) and the joint prior density function (3.7), the posterior density function of α and β can be written as and all the hyper-parameters a, b, c, d are non-negative and known. Now, consider a function of parameters α and β, say ψ(α, β). Then, from (3.4), (3.5) and (3.6), the Bayes estimates of ψ(α, β) with respect to SELF, LLF and GELF are given by ( 3.12) respectively. To derive Bayes estimates of α and β in respect of loss functions given by (3.1), (3.2) and (3.3), ψ(α, β) is replaced by α and β respectively in Equations (3.10), (3.11) and (3.12). Then, the Bayes estimates of α are given by and In a similar way, by replacing ψ(α, β) as β in Equations (3.10), (3.11) and (3.12) Bayes estimates β SE , β LI and β GE can be obtained. There are ratios of two integrals given in Equations (3.13), (3.14) and (3.15) which can not be obtained in a closed form. To overcome such situations, MCMC technique will be used to obtain desired Bayes estimates. In this section, the Bayes estimates of the unknown parameters α and β are computed. To generate samples from (3.8), MCMC method has been used. From posterior probability density function given by (3.8), the conditional posterior densities of the parameters can be written as Note that the density functions π 1 (α|β, x) and π 2 (β|α, x) can not be brought into some well known classes of distributions. Thus, the MCMC samples for α and β can not be generated directly. In this context, the Metropolis-Hastings algorithm (see Chen et al. (2012) ) is used to generate samples from (3.16) and (3.17). This method has been discussed as follows: Step 1 Set i = 1 and choose initial guesses as α (1) = α and β (1) = β. Step 2 Generate new samples and β (i) with proposal distribution α (i) ∼ N(α (i−1) , var( α)) and β (i) ∼ N(β (i−1) , var( β)). Step Step 4 Generate a sample u from U(0, 1). Step 5 Set (α, β) = (α (i) , β (i) ), if u ≤ h; otherwise (α, β) = (α (i−1) , β (i−1) ). Step 6 Set i = i + 1. Step 7 Repeat Steps (2-6) N number of times to get α (1) , . . . , α (N ) and β (1) , . . . , β (N ) . Then, the Bayes estimates of the parameters under SELF are given by The Bayes estimates of the parameters under LLF are obtained as Further, the Bayes estimates of the parameters under GELF are proposed as (3.20) In this section, three types of confidence intervals for α and β, namely asymptotic confidence intervals, bootstrap confidence intervals and highest posterior density intervals are constructed. The 100(1 − γ)% asymptotic confidence intervals for α and β can be constructed by using asymptotic normality property of the MLEs α and β. In doing so, variance of α and β are required. These can be obtained from main diagonal elements of the inverse of the observed Fisher information matrix I −1 ( α, β), where I( α, β) = −l 20 −l 11 −l 11 −l 02 (α,β)=( α, β) (4.1) and l ij = ∂ 2 logL ∂θ i ∂θ j , Θ = (θ 1 , θ 2 ) = (α, β). Therefore, the 100(1 − γ)% approximate confidence intervals for α and β are respectively given by where z γ 2 is the upper γ 2 -th percentile point of a standard normal distribution. The approximate confidence intervals are acceptable when the effective sample size m is large. When m is small, efficiency of construction of the asymptotic confidence intervals may not work properly. Bootstrap re-sampling may produce approximate confidence intervals with more accuracy. Here, two types of parametric bootstrap confidence intervals are constructed. Step 1 Compute α and β. Step 2 Use these α, β and same T , R i 's and m to generate a bootstrap re-sample. Step 3 Obtain bootstrap estimates α B and β B from these bootstrap sample. Step 4 Step 5 Rearrange these bootstrap estimates in ascending order as α B(1) , . . . , α B(N ) and β B(1) , . . . , β B(N ) . The 100(1 − γ)% percentile bootstrap confidence intervals for α and β are respectively given by Steps 1-2 are similar to above discussed Boot-p method. Thus, we only state Steps 3 and 4 below: Step 3 Set t-statistics as and then compute. Step 4 Repeat Steps 2-3 up to N times to get and then rearranging these to get T 1 (1) , . . . , T 1 (N ) and T 2 (1) , . . . , T 2 (N ) . Then, 100(1 − γ)% bootstrap-t confidence intervals are obtained as and T 2 (N γ/2) , T 2 (N (1−γ/2)) , respectively. In this section, using MCMC samples α (1) , . . . , α (N ) and β (1) , . . . , β (N ) and the method given . Then, put away these intervals for which l 1k and l 2k become the smallest to obtain the HPD credible intervals. In this section, a Monte Carlo simulation study is carried out to compare the performance of different estimates of parameters for Gumbel type-II distribution based on adaptive type-II progressive hybrid censoring schemes. The performance of all estimates has been compared in terms of their absolute bias (AB) and mean squared errors (MSEs). For this purpose, 10000 AT-II progressive hybrid censored samples are generated by using different values of n, m, T along with three following censoring schemes: The average bias and MSEs of the MLEs, MPSEs and Bayesian estimates are computed for α = 1.5 and β = 0.75. These are presented in Table 1 , Table 2 , Table 4 and Table 5 . The Bayesian estimates are computed by using MCMC method along with 5000 MCMC samples. The hyper parameters in gamma prior are considered as a = 3, b = 2, c = 3, and d = 4. Further, the 95% confidence interval lengths and coverage probabilities for asymptotic confidence intervals, bootstrap intervals and HPD credible intervals are computed and tabulated in Table 3 and Table 6 . From the tables, the following conclusions are made : (v) It have been noticed that the Bayes estimates perform better than the classical estimates in terms of the absolute bias and MSEs. In classical estimates, MPSEs perform better than MLEs in terms of the absolute bias and MSEs. (vi) Bayes estimates with respect to the LLF when p = 0.25 provide superior performance than other Bayes estimates. (vii) The performance of the HPD credible intervals is better than other confidence intervals in the sense of the average lengths and coverage probabilities. In this section, a real data set is considered to illustrate the established estimates. Gumbel type-II distribution can be used as an alternative to some well known two parameter distri- Table 7 . If we get value of C * and A * smaller but p-values greater, then the distribution fits better than other one. Data : Death rates due to Covid-19 in India. This following data set is taken from https://www.worldometers.info/coronavirus/country/india/, which represents the death rates due to Covid-19 in India from March 16 to May 13, 2020. The data set is provided in the following: -13.33, 17.65, 17.65, 16.67, 17.86, 17.86, 22.58, 22.73, 20, 21.82, 30.77, 21.51, 22.22, 22.13, 23.88, 22.15, 28.16, 27.38, 30.94, 30.18, 26.46, 26.16, 25.48, 26.02, 26.33, 24.34, 22.91, 23.46, 23.26, 22.43, 21.87, 20.22, 19.23, 17.46, 16.38, 15.32, 13.96, 13.48, 12.58, 12.43, 12.20, 11.90, 11.63, 11.51, 11.34, 11.29, 10.89, 10.90, 10.57, 10.87, 10.69, 10.43, 10.12, 9.99, 9.82, 9.54, 9.23, 9, 8.81, 8.65, 8.34, 7.74, 7.60, 7.45, 7.24, 7.03, 6.87, 6.71, 6.64, 6.52, 6.43, 6.33, 6.27,6.23,5.68,5.63,5.56, 5.53, 5.49, 5.53, 5.54, 5.55, 5.53, 5.50, 5.47, 5.44, 5.44, 5.47, 5.45, 5.36 . For the purpose of goodness of fit test, we consider different plots in Figure 3 , Figure 4 and Figure 5 . In Figure 3 , the theoretical CDFs of the distributions are compared with the empirical CDF of the given real data set. In Figure 4 , the QQ-plots are used to compare the fitted theoretical models. In Figure 5 (a), the box plot of the real data set is displayed, which represents that the distribution is right skewed. The TTT plot for given real data set is shown by Figure 5 (b). From Table 7 , it is concluded that Gumbel type-II distribution is fitted to that data better than NH, Burr III and IKum distributions. From the real data set, different AT-II progressive hybrid censored samples are considered by using different values of n, m and T . The computed values of the MLEs, MPSEs and Bayes estimates based on the real data set are tabulated in Table 8 . Further, the confidence intervals are constructed and tabulated in Table 9 . From Table 8 , it is observed that the Bayes estimates perform better than the other estimates. From Table 9 , it is seen that the HPD credible interval performs better than asymptotic confidence intervals. Table 9 : Confidence intervals of parameters for real data set when T = 15. In this article, different estimates of the unknown model parameters of the Gumbel type-II distribution based on AT-II PHCS have been developed. Both classical and Bayesian estimation methods are used to obtain the estimates. It is observed that the MLEs and MPSEs can not be obtained explicitly. So, Newton Raphson iterative method is employed to compute these estimates. Bayes estimates are obtained based on the symmetric and asymmetric loss functions under the assumption of independent gamma priors using MCMC method. Three types of confidence intervals for unknown parameters are constructed. Then, Monte carlo simulation study is performed to compare the performance of the estimates in terms of the absolute bias and MSEs. It is observed that the Bayes estimates under LINEX loss function perform better than the other estimates. Further, a real data set is considered for illustrative purposes. 10*5) (1.45,2.42) (1.91,2.18) (3.37,140.12) (85.96,86.12) 15 (1.52,2.53) (1.90,2.12) (0.78,165.33) (85.87,86.06) 10 (0*30 Bayesian estimation of Gumbel type-II distribution Bayesian estimation of Gumbel type-II distribution under type-II censoring with medical applications Adaptive type-II progressive censoring schemes based on maximum product spacing with application of generalized Rayleigh distribution Estimation for the scaled half logistic distribution under type-II censoring Point and interval estimation for Gaussian distribution, based on progressively type-II censored samples Hybrid censoring: Models, inferential results and applications Monte Carlo methods in Bayesian computation Estimating parameters in continuous univariate distributions with a shifted origin Exact likelihood inference based on type-I and type-II hybrid censored samples from the exponential distribution Generalized inverted exponential distribution under hybrid censoring, Statistical methodology Truncated life tests in the exponential case Statistics of extremes Statistical analysis of the log-normal distribution under type-II progressive hybrid censoring schemes Inference for Burr XII distribution under type-I progressive hybrid censoring On hybrid censored Weibull distribution Analysis of type-II progressively hybrid censored data Estimating the parameters of the generalized exponential distribution in presence of hybrid censoring Statistical models and methods for lifetime data On progressive hybrid censored exponential distribution Statistical inference of type-II progressively hybrid censored data with Weibull lifetimes Estimation of parameters and reliability characteristics for a generalized Rayleigh distribution under progressive type-II censored sample, Communications in Statistics-Simulation and Computation On the existence and uniqueness of the maximum likelihood estimate of a vector-valued parameter in fixed-size samples Estimation of Burr type XII parameters under adaptive type-II progressive hybrid censoring scheme Statistical analysis of exponential lifetimes under an adaptive type-II progressive censoring scheme On adaptive progressive hybrid censored burr type III distribution: application to the nano droplet dispersion data, Quality Technology & Quantitative Management Estimation of the inverted exponentiated Rayleigh distribution based on adaptive type II progressive hybrid censored sample Estimating the parameters of a Burr distribution under progressive type-II censoring E-bayesian analysis of the Gumbel type-II distribution under type-II censored scheme Study of the left censored data from the Gumbel type-II distribution under a Bayesian approach Estimation of the Weibull parameters under type-I censoring Estimation procedures for Maxwell distribution under type-I progressive hybrid censoring scheme