key: cord-0705747-2buug2oy authors: Shanmugam, Ramalingam; Ledlow, Gerald; Singh, Karan P title: Stochasticity among Victims of COVID-19 Pandemic date: 2022-01-04 journal: J Multidiscip Healthc DOI: 10.2147/jmdh.s322637 sha: a6d5a2abeb848df6a5a8f7cdfa379da4553e56dd doc_id: 705747 cord_uid: 2buug2oy This article provides a thorough explanation of methods and theoretical concepts to detect infectivity of COVID-19. The concept of heterogeneity is discussed and its impacts on COVID-19 pandemics are explored. Observable heterogeneity is distinguished from non-observable heterogeneity. The data support the concepts of heterogeneity and the methods to extract and interpret the data evidence for the conclusions in this paper. Heterogeneity among the vulnerable to COVID-19 is a significant factor in the contagion of COVID-19, as demonstrated with incidence rates using data of a Diamond Princess cruise ship. Given the nature of the pandemic, its heterogeneity with different social norms, pre- and post-voyage quick testing procedures ought to become the new standard for cruise ship passengers and crew. With quick testing, identification of those infected and thus, not allowing to embark on a cruise or quarantine those disembarking, and other mitigation strategies, the popular cruise adventure could become norm for safe voyage. The novel method used in this article adds valuable insight in the modeling of disease and specifically, the COVID-19 virus. In the literature, the term heterogeneity echoes differently in one context versus another. Its root word lies in Greek "heterogenes" meaning different. In scientific disciplines, the word heterogeneity is popularly mentioned to have existed when the variance is large. In insurance applications, for example, the premium is assessed more if an insurer is in a heterogeneous group with high hazard proneness. 5 The large (small) variance is indicative of heterogeneity (homogeneity). Ecochard 6 has interesting discussions for heterogeneity. In healthcare disciplines, heterogeneity is referred to as different outcomes among patients. The utilized mathematical expressions are eased in this section with added descriptions. The main topic is all about detecting reasons for complex infectivity of COVID-19 and they squarely connect to their heterogeneity. There are two types of heterogeneity. One is the observable heterogeneity, and the other is non-observable heterogeneity. We have devised a novel method to identify and estimate the level of each type in this article. The heterogeneity is linked with a nonobservable hidden trait as done in genetics. The heterogeneity refers dissimilar attributes across the subgroups of the population itself even before sampling. In a sense, the heterogeneity really points to the non-identical nature in a random sample or population. Sometimes, the heterogeneity implies a shifting stochasticity. In genetic studies, several authors refer to genetic heterogeneity as rather too difficult to ascertain. They really mean that because the alleles in more than one locus exhibit susceptibility to any disease including COVID-19, there is a need to track the loci to infer their heterogeneity. So, in a sense, the application of heterogeneity is really a discussion of an opposite to similarity across loci. The reader is referred to Elston et al (2002, pages 3404-344) for details. 2 Hope and Norris 7 attempted to determine how heterogeneity played a role in judgements in the context of crime victimization. A formal definition of heterogeneity is examined later in the article and its properties are explored and itemized. We use the notations y, θ, and H 0 to describe the observation, the unknown parameter, and the posed conjecture (in other words, hypothesis) pertinent to the chance-oriented mechanism behind the COVID-19 pandemic. However, in the literature, using a random sample y 1 ; y 2 ; . . . ; y n from a population whose main parameter is θ, when the null hypothesis H 0 : θ 1 ¼ θ 2 ¼ . . . : ¼ θ n is tested, it is named the homogeneity test. This suggests that heterogeneity is really all about a shifting population. This creates more confusion. The source of such confusion with respect to heterogeneity emanates from its ill-communication. It is evident that there is a lack of a clear definition of heterogeneity given by Hunink et al (Chapter 12), 8 for details. Neither the Encyclopedia of Statistical Sciences nor the Encyclopedia of Biostatistics has even an entry as if it is not pertinent in statistical disciplines. One comes across different types of data in scientific studies. Drawing data from a binomial population is one of them and the data should possess an under dispersion (ie, variance of the binomial distribution is smaller than its mean). 1 From a Poisson population, the drawn random sample ought to reflect equality between the mean and variance. When the main (incidence rate) parameter of a Poisson chance mechanism is stochastically transient, the unconditional observation of the random variable convolutes to an inverse binomial model. 9 The inverse binomial distribution is known to attest that the variance is larger than its mean (Stuart and Ord 10 for details). Consequently, a comparison between the mean and variance characterizes only which type of binomial, Poisson, or inverse binomial possesses the underlying chance mechanism we are sampling from but does not inform anything about heterogeneity. Recently, Hassen et al 11 employed a statistical concept behind a stochastic hybrid Susceptible-Infectious-Removed (SIR) framework in a Poisson chance mechanism for COVID-19 evolution and transmission in the Maghreb Central Regions (which consists of Tunisia, Algeria, and Morocco) in Western Africa. The COVID-19 pandemic is virulent and rapidly spreading in Maghreb Central Regions as much as elsewhere in other parts of the world. Their version of the SIR-Poisson model successfully predicted the range of the future infected cases since its emergence until the end of the confinement period April 2020. They estimated an average number of two contacts in Tunisia, while it was three contacts in Algeria and Morocco by an infected individual. According to them, the pandemics declined in each of the three countries but did not end. Furthermore, they detected an evolutionary change in the sense that the pandemics spreaded more rapidly in Morocco and Algeria than in Tunisia, though the most affected country was Algeria with more deaths despite a high number of cures. With details about the probabilistic patterns among coronavirus confirmed, recovered or cured people and those that succumb as fatalities/deaths in the 32 states/ territories of India are given by Shanmugam. 13 To track the confusion with respect to heterogeneity, let us consider the data given in Table 1 , 12 describing the spread of COVID-19 among the voyagers in a Diamond Princess cruise ship, during the month of February 2020. The random variables Y 1 , Y 2 , and Y 3 denote, respectively, the number of COVID-19 cases, the number of asymptomatic cases and the number of symptomatic cases among them in time (date). Under a given COVID-19's prevalence rate, λ>0, the number Y 1 perhaps follows a Poisson probability pattern. For a given number of COVID-19 cases in a date, the number Y 2 perhaps follows a binomial probability pattern with parameters ðy 1 ; pÞ, where 00. That is, the conditional probability of observing y 1 number of COVID-19 cases under a prevalence rate λ>0 is The reader is referred to Rajan and Shanmugam 14 for detailed derivations of the Poisson mean and variance. The prevalence parameter λ itself is crucial in our discussions. The Poisson variability cannot be heterogeneity because the expected value also changes when the variability changes due to their inter-relatedness. Realize that no two individuals on the ship are assumed to have the same level of susceptibility to the COVID-19 virus. It is reasonable to imagine that the prevalence levels follow a conjugate, stochastic gamma distribution. The socalled conjugate prior knowledge in the Bayesian framework smooths the statistical analytic process. It is known that the conjugate prior for the Poisson distribution is gamma, whose pdf is cðλ α; β j Þdλ ¼ e À ðαλÞ ðαλÞ βÀ 1 dðαλÞ=ΓðβÞ; α>0; β>0 (1) with an average. Eðλ α; β j Þ ¼ β α and variability Varðλ α; β j Þ ¼ Eðλ α; β j Þ=α, where the parameters α and β are recognized as hyper-parameters. 14 Note that the hyperparameter α>0 causes the variability in the COVID-19's prevalence rate to fluctuate up or down and hence, you would anticipate the heterogeneity to involve the hyperparameter α. But the question is how? We assume that the probability of observing a nonnegative COVID-19 cases, y 1 is a Poisson under a stable sampling population PrðY 1 λ j Þ with an expected number EðY 1 λ j Þ ¼ λ and a variability VarðY 1 λ j Þ ¼ EðY 1 λ j Þ. With replications, the observable heterogeneity should become estimable. That is to mention, the maximum likelihood estimate (MLE) of the COVID-19 prevalence rate is the average number, � y 1 , of the observations. To discuss the non-observable heterogeneity, we need to integrate its conjugate prior cðλ α; β j Þ for the non-observable λ with the likelihood PrðY 1 λ j Þ and it results in an update and it is called posterior pdf for λ. The expressions for nonobservable heterogeneity, observable heterogeneity and other expressions are derived and used. In this section, we explore heterogeneity for two subbinomial processes emanating from a Poisson process. The asymptomatic number, Y 2 and symptomatic number, Y 3 of COVID-19 cases are two branching binomial random numbers out of the Poisson random number, Y 1 ¼ 0; 1; 2; . . . ; of COVID-19 cases. These two split random variables are complementary of each other in the sense Then, what are the underlying model for Y 2 and for Y 3 ? Are they correlated random variables? If so, what is their correlation? These are pursued in this section. Let an indicator random variable, I i ¼ 1 for a COVID-19 person to show asymptomatic symptom with a probability, 0