key: cord-1038282-9bxpsgm4 authors: Huang, Fengfeng; Guo, Pengfei; Wang, Yulan title: Optimal Group Testing Strategy for the Mass Screening of SARS-CoV-2 date: 2022-05-22 journal: Omega DOI: 10.1016/j.omega.2022.102689 sha: 5c16b84e880bc555b3dc9903d9c71a294206513e doc_id: 1038282 cord_uid: 9bxpsgm4 We analyze the group testing strategy that maximizes the efficiency of the SARS-CoV-2 screening test while ensuring its effectiveness, where the effectiveness of group testing guarantees that negative results from pooled samples can be considered presumptive negative. Two aspects of test efficiency are considered, one concerning the maximization of the welfare throughput and the other concerning the maximization of the identification rate (namely, identifying as many infected individuals as possible). We show that compared with individual testing, group testing leads to a higher probability of false negative results but a lower probability of false positive results. To ensure the test effectiveness, both the group size and the prevalence of SARS-CoV-2 must be below certain respective thresholds. To achieve test efficiency that concerns either the welfare throughput maximization or the identification rate maximization, the optimal group size is jointly determined by the test accuracy parameters, the infection prevalence rate, and the relative importance of identifying infected subjects. We also show that the optimal group size that maximizes the welfare throughput is weakly smaller than the one that maximizes the identification rate. Mass screening is an essential means to curb the spread of the coronavirus disease 2019 due to the severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) infection. However, due to the limited test capacity, the extent of screening work usually falls short of our expectation. In order to reconcile the tight testing capacity and the urgent need for large-scale tests, "group testing" protocols have been put forward (see, e.g., Ackerman et al. (2020) ; Lohse et al. (2020) ; Shenta et al. (2020) ) and into practice in several places, including Wuhan city of China (Walden and Zhao, 2020) and Germany (Healthcare in Europe, 2020). With group testing, samples from multiple subjects are pooled together and tested as one to check the presence of the virus. The pooled sample shall be tested negative if none of the sampled individuals is infected and positive otherwise. When a pooled sample is tested positive, each sample of the pool is then tested individually. Group testing was developed by Dorfman (1943) as a cost efficient technique to eliminate syphilis men in army recruits and has been used afterwards in screening a variety of infections, including HIV, H1N1 influenza, malaria, Zika virus (see, e.g., Pilcher et al. (2002) ; Cowling et al. (2010) ; Hsiang et al. (2010) ; Kuehnert (2016) , respectively). In these studies, the group testing strategy aims to minimize the expected number of tests. However, they usually overlook how group testing affects the overall test accuracy. Virtually, no test is 100% accurate, because the distribution of the true positive result and that of the true negative result usually overlap and there does not exist a single decision criterion that could separate the samples cleanly (Metz, 1978; Wilson et al., 1968) . When choosing the decision criteria for a test (which determine test specificity and sensitivity), several clinical factors shall be taken into consideration, including the disease prevalence and the relative importance of identifying infected subjects (Metz, 1978) . Due to that pooling samples together alters the disease prevalence (among the pooled samples), the frequency of errors and the overall test accuracy of group testing are different from those of individual testing. In particular, for communicable diseases, the consequences associated with false positive results are more severe than those associated with false negative results, as false positives only lead to further testing whereas false negatives lead to further spread of the disease. Thus, it is needed to take into account the impacts of false positives and false negatives when developing a group testing strategy. We develop a stylized model to investigate the optimal group testing strategy, i.e., the number of individual samples to be mixed together as one, so that the efficiency of the test is maximized. Two aspects of test efficiency are considered, one concerning the maximization of the identification rate (namely, identifying as many infected individuals as possible) and the other concerning the maximization of the welfare throughput. The first aspect is in essence the same as minimizing the expected number of tests, which, as has been stated above, is the primary concern of the group testing strategy in the extant literature. The second aspect takes account of the consequences associated with false positive and false negative results, which, despite not having drawn enough attention in the group testing literature, have been widely recognized by the medical community; see, e.g., Wilson et al. (1968) ; Metz (1978) . When deciding the optimal group size, both aspects of test efficiency shall be subject to the group testing effectiveness requirement. For instance, the Food and Drug Administration of the United State suggests that group testing is effective only if each individual subject in a pooled sample can be considered presumptive negative whenever the pooled sample is tested negative (United State Food and Drug Administration, 2020). We show that compared with individual testing, group testing leads to an increased probability of false negative results and a decreased probability of false positive results. The effectiveness of group testing requires both the infection prevalence rate and the group size of individual samples below certain thresholds. These thresholds are jointly determined by the test accuracy, which includes test specificity (the probability of correctly identifying an uninfected subject) and test sensitivity (the probability of correctly identifying an infected subject), and the relative importance of identifying infected subjects. We also show that the optimal group size hinges on the test accuracy, the infection prevalence rate, and the relative importance of identifying an infected subject. Moreover, the optimal group size under the welfare throughput maximization is weakly smaller than that under the identification rate maximization. When the agent aims to maximize the welfare throughput and test specificity is 100% (i.e., no false positive results), the optimal group size is the same as the one when the agent aims to maximize the identification rate, and it equals the minimum of two values with one corresponding to the group size that ensures the effectiveness of group testing and the other corresponding to the group size that minimizes the required testing times per sample. We further conduct the extensive numerical studies by taking into account that (1) test outcomes are usually not dichotomous and (2) it is possible to alter the decision criteria to align with the purpose of the test in practice; see, e.g., Metz (1978) ; Wilson et al. (1968) , where decreasing (increasing) test specificity is coupled with increasing (decreasing) test sensitivity. Our numerical experiments reveal that a decision criterion associated with a slightly lower test specificity and a higher test sensitivity can improve the test efficiency of group-testing. Moreover, the relative advantage of group testing over individual testing in improving test efficiency diminishes when the infection prevalence rate is high. The remainder of this paper is organized as follows. The related literature is reviewed in Section 2. Section 3 discusses the model setup. We analyze the optimal group sizes in Section 4. Concluding remarks are provided in Section 5. Ever since group testing was developed by Dorfman in 1943 in eliminating syphilis man in army (Dorfman, 1943) , the literature on group testing has being flourishing. Researchers have investigated the feasibility of group testing in a wide range of infections through experimental studies and clinic practice, including HIV, H1N1 influenza, malaria, and Zika virus; see Pilcher et al. (2002) ; Cowling et al. (2010) ; Hsiang et al. (2010) ; Kuehnert (2016) . In response to the recent COVID-19 outbreak, group testing protocols for SARS-CoV-2 has surged (Ackerman et al., 2020; Lohse et al., 2020; Shenta et al., 2020) . Group testing pools samples collected from multiple subjects into a single pool and tests the pooled sample for the presence of the virus; the pool shall be tested negative if none of the sampled individuals is infected and positive otherwise. Based on the group testing method, a protocol called the square array method was proposed by Phatarfod and Sudbury (1994) to further improve test efficiency. Under the square array method, samples are placed into a square array where each row and column forms a pool, and samples whose row and column pools both tested positive shall be deemed positive. The group testing strategy and its square-array derivative increase testing throughput, and have been analyzed by researchers from multiple disciplines by considering various perspectives, including search theories, learning theories, information theories, and control theories; see, e.g., Malyutov (2013) The aforementioned studies overlook the following two facts: One, the group testing strategy affects the frequencies of false positive and false negative results. Two, the consequences associated with false positive results are more severe than those associated with false negative results for communicable diseases (Wilson et al., 1968; Aprahamian et al., 2019) . Our work is also related to the studies on balancing the system performance (such as social/patient welfare and system congestion) and diagnosis accuracy, including Alizamir Other related studies include Tu et al. (1995) ; Zenios and Wein (1998) ; Lin et al. (2020) , which evaluate the effectiveness of group testing by using statistical or stylized models. Both Tu et al. (1995) and Zenios and Wein (1998) estimate the prevalence of HIV by adopting the group testing method. Tu et al. (1995) show that group testing not only reduces the probability of false positive results but also improves the accuracy of the estimator. Zenios and Wein (1998) take into account that HIV test outcomes are not dichotomous but continuous. They develop a parametric procedure and a hierarchical model to derive the decision criteria that yield accurate estimates. Lin et al. (2020) propose a group testing strategy to reduce the prevalence rate of COVID-19 in a closed community. Consider a representative testing center (the agent) that provides SARS-CoV-2 (which causes COVID-19) mass screening tests to the public. The test capacity of the agent (namely, the number of tests the agent can perform at a time) is fixed, and his objective is to maximize the efficiency of group testing by choosing the appropriate group size. We consider two aspects of test efficiency, one concerning maximizing of the welfare throughput, and the other con-cerning maximizing the identification rate, that is, identifying as many infected individuals as possible. Let p 0 denote the infection prevalence rate; it measures the probability of an asymptomatic subject being infected. As of March 2022, the percentage of tested individuals being tested positive is no more than 20% in most countries (Ritchie et al., 2022) , and such percentage shall be lower when mass screening is considered. Given such evidence, we consider that p 0 < 20% in this study. Denote a subject's true type by t, where t = 1 indicates the subject (she) is infected and t = 0 uninfected. The screening test categorizes subjects as positive (s = 1) and negative (s = 0). The screening test is not perfect. Depending on the subject's true type t (t ∈ {0, 1}), the probability of s = t is as follows: α := P (s = 0|t = 0) and β := P (s = 1|t = 1). The parameters α and β are known as test specificity (the probability of correctly identifying an uninfected subject) and test sensitivity (the probability of correctly identifying an infected subject), respectively. It is reported that the accuracy of SARS-CoV-2 screening testing usually has an average sensitivity of 84.3% and a specificity of between 96.6% and 99.7%; see, e.g., the systematic review of Bastos et al. (2020) . Based on Bastos et al. (2020) and for the sake of analytical tractability, we restrict the parameter values to the case 1.75 < α + β < 2. Given a subject's true type t = i (i ∈ {0, 1}), she obtains a reward V i if she is correctly identified (s = i) and suffers a loss L i if misidentified (s = 1 − i). V i is the potential benefit associated with either being assured of the health status when i = 0 or receiving the proper treatment when i = 1, and L i is the disutility/loss incurred from either not taking the proper treatment (or the potential pandemic containment failure) when i = 1 or the unnecessary stress suffered by the concerned individual (or the health care system) when i = 0. Following Huang et al. (2022) , we assume that which indicates that even though the screening information is imperfect, it is valuable to all individuals. Define which is an indicator of the relative importance of identifying an infected subject. As far as the communicable diseases are concerned, identifying positive cases are extremely important (Wilson et al., 1968) , and hence, the value of θ shall be large. The group test is composed of two stages. In Stage 1, the agent pools samples into equalsized groups and performs tests on the groups. If a group is tested negative, every sample in the group is considered to be uninfected; otherwise, it goes to Stage 2, in which every sample in the group is tested individually. Let x K denote the prevalence of SARS-CoV-2 among the groups when the group size is K (K = 2, 3, ...). Then, we have Test Effectiveness. When maximizing the efficiency of group testing, the agent is often subject to the effectiveness requirement. For instance, one important concern of the Food and Drug Administration of the United State with group testing is whether negative results from the pooled samples can be presumptive negative (United State Food and Drug Administration, 2020). Group testing can be regarded effective only when the expected reward of considering negative pooling test results as presumptive negative is greater than that as presumptive positive; that is, where and g(x K ) is the posterior probability that a group of size K contains at least one infected subject given that the test result is negative. When the condition stated in (1) is violated, the groups with negative results still need to proceed to Stage 2, and hence, group testing cannot reduce the required number of tests. Test Efficiency. Given the group size K, let P ij (K) denote the probability of identifying a As the agent's test capacity is fixed, maximizing the welfare throughput (the identification rate, respectively) shall be equivalent to maximizing the welfare throughput (the identification rate, respectively) per test. Denote the welfare throughput per test and the identification rate per test as W t and W r , respectively. Then, both of which shall be subject to the effectiveness requirement indicated by (1). In the welfare throughput maximization objective function (2), the numerator is the expected reward of a representative subject, in which P 00 (K)V 0 − P 01 (K)L 0 and P 11 (K)V 1 − P 10 (K)L 1 are her expected rewards if she is uninfected (t = 0) and infected (t = 1), respectively. In the identification rate maximization objective function (3), P 11 (K)p 0 K is the expected number of infected individuals found in each group as p 0 K is the expected number of infected individuals in a group. In this section, we first examine how group testing affects test outcomes and the average number of tests required per individual sample. Next, we derive the condition that ensures the test effectiveness. After that, we derive the optimal group sizes that ensure test efficiency in two aspects, welfare throughput maximization and identification rate maximization. Last, we conduct numerical experiments to strengthen our understanding of the implications of group testing. First, we provide the following result. Lemma 1. Suppose that the group size is K(> 1). Then, 1. the average number of tests required per individual sample is λ(K 2. the false negative probability, i.e., the probability that an infected subject is tested negative, is P 10 (K) = 1 − β 2 ; 3. the false positive probability, i.e., the probability that an uninfected subject is tested Lemma 1 shows that given test specificity α and test sensitivity β, group testing leads to a higher false negative probability (because 1 − β 2 ≥ 1 − β) and a lower false positive testing. Moreover, the false negative probability is independent of the group size whereas the false positive probability increases with the group size. With regard to the effectiveness of the test, we obtain the following result. Lemma 2. To ensure the effectiveness of group testing, the infection prevalence rate shall The group size K shall not exceed an upper bound K, where Both p 0 and K decrease with θ (the relative importance of identifying an infected subject) and increase with test specificity α and test sensitivity β. Lemma 2 shows that to ensure the effectiveness of group testing, the infection prevalence rate p 0 shall not exceed an upper threshold p 0 that is jointly determined by the relative importance of identifying an infected subject θ and the test accuracy parameters α and β. It also suggests that government health agencies shall provide operational guidelines on the implementation of group testing such as the maximum pool size that depends on p 0 , θ, α, and β. The results stated in Lemma 2 are also consistent with the observed practice. For example, regarding SARS-CoV-2 screening tests, the U.S. Food and Drug Administration authorized several public health laboratories to conduct group testing with 5 to 8 samples, depending on their respective test technology and accuracy (Schneider, 2020) , and a group size of 10 to 25 is allowed in New York State (New York State, 2020). We further investigate how the infection prevalence rate p 0 affects the group size upper bound K via a numerical experiment; see Figure 1 . It shows that K is decreasing with p 0 . The relative importance of identifying infected subjects θ also plays a critical role, where a higher θ leads to a smaller K. Figure 1 shows that given α = 100% and β = 82%, if θ = 99.0%, group testing is effective only when the infection prevalence rate p 0 is below 3.5%, while if θ = 98.0%, it is effective as long as p 0 is no higher than 6.8%. Based on Lemmas 1 and 2, we can obtain the following result regarding the optimal group size. (i). When the agent aims to maximize the welfare throughput, the optimal group size is K * = min{ K, K} if α = 1, min{ K, K} if 1/2 < α < 1; (ii). When the agent aims to maximize the identification rate, the optimal group size is where K < K, K takes one of the following two values, − γ ln(1−p 0 ) or − γ ln(1−p 0 ) + 1, whichever yields a larger value of the objective function, and γ (0 < γ < 1) satisfies ln(1 − p 0 ) + (α + β − 1)γ 2 e −γ = 0. Proposition 1 shows that under both test efficiency objectives-the welfare throughput maximization and the identification rate maximization-the magnitude of the optimal group size K * highly depends on the prevalence of virus infection p 0 , test accuracy parameters α and β, and the relative importance of identifying an infected subject θ. We can easily conclude from Proposition 1 that the optimal group size under the welfare throughput maximization is weakly smaller than that under the identification rate maximization. Interestingly, the optimal group size that maximizes the welfare throughput when α = 100% is the same as that maximizes the identification rate, and it equals the minimum of K, the largest allowable group size that ensures the effectiveness of group testing (see Lemma 2), and K, the group size that minimizes the required testing times per sample. Metz (1978) illustrates this gist. According to Metz (1978) , in Figure 2 , any vertical line across the overlapped area of the two distributions can be a decision criterion that specifies a unique combination of test specificity and test sensitivity (α, β). Take the red line in Figure 2 As an example. The dark gray area formed by the red line and the distribution curve of true negative cases represents the false positive fraction and determines a unique value of 1 − α, while the light gray area formed by the red line and the distribution curve of true positive cases represents the false negative fraction and determines a unique value of 1 − β. If we move the red line towards the left, test specificity α decreases while test sensitivity β increases; vice versa. By varying the decision criteria via moving the red line, multiple combinations of test specificity and test sensitivity are obtained, based on which the medical community usually graphically plot a receiver operating characteristic (ROC) curve (Metz, 1978) . Note that the ROC curve plots the (1 − α, β) pair rather than the (α, β) pair. By adopting the theoretical guidelines of Metz (1978) along with the observed accuracy of SARS-CoV-2 tests in practice, we set the distributions of true negative and true positive results to be normally distributed with mean 5 and 10 and variance 1 and 1.2, respectively. We then explore a series of possible decision criteria and obtain the corresponding ROC curve as shown in Figure 3 . We then select the following four test specificity and sensitivity combinations from the ROC curve-(100%, 82%), (99.9%, 93.8%), (99.5%, 97.7%), and (99.1%, 98.5%)-to conduct our numerical experiments, which are depicted as the red dots in Figure 3 . We now investigate how test specificity α and test sensitivity β affect the optimal group size K * and test efficiency. Our numerical experiments reveal that due to the high importance of identifying infected subjects for COVID-19 (i.e., θ is close to 1), the test effectiveness requirement characterized by the threshold group size K (given in Lemma 2) is usually the binding factor, leading to that the two aspects of test efficiency-maximizing the welfare throughput and maximizing the identification rate-yield the same optimal group size. Hence, for illustration purpose, we only present the numerical outcomes under the welfare throughput maximization; see Figure 4 . Figure 4(a) shows that the optimal group size K * decreases with the infection prevalence rate p 0 , and that K * is weakly larger when test specificity α decreases and test sensitivity β increases. Figure 4 (b) depicts the relative efficiency improvement of group testing over individual testing. It shows that when 0 < p 0 < 1%, the test specificity and sensitivity pair (α = 100%, β = 82.0%) leads to the highest test efficiency; when 1% ≤ p 0 < 10%, the test specificity and sensitivity pair (α = 99.5%, β = 97.7%) leads to the highest test efficiency; and when 10% ≤ p 0 < 20%, the test specificity and sensitivity pair (α = 99.1%, β = 98.5%) leads to the highest test efficiency. This indicates that as the infection prevalence rate p 0 increases, to facilitate group testing, a decision criterion associated with a slightly lower test specificity α and a higher test sensitivity β can improve test efficiency. Moreover, Figure 4 (b) shows that for any combination of α and β, the relative advantage of group testing over individual testing diminishes as p 0 increases. We consider an agent that provides SARS-CoV-2 screening tests to the public. The agent's objective is to maximize the efficiency of group testing. Two aspects of test efficiency are considered with one concerning the welfare throughput maximization and the other concerning the identification rate maximization. Apart from test efficiency, a group testing strategy must be "effective" in the sense that whenever a negative test result is obtained for a sample group, the group can be considered as presumptive negative. We show that compared with individual testing, group testing leads to a higher probability of false negative results but a lower probability of false positive results. To ensure the effectiveness of group testing, it requires that both the group size and the prevalence of SARS-CoV-2 are below certain thresholds. Moreover, the optimal group size that maximizes the welfare throughput is weakly smaller than the one that maximizes the identification rate, and the optimal group sizes under both objectives-the welfare throughput maximization and the identification rate maximization-are jointly determined by test accuracy parameters, the infection prevalence rate, and the importance of identifying an infected subject. Our numerical study shows that the efficiency of group testing can be improved if we increase test sensitivity (the probability of correctly identifying infected subjects) by sacrificing test specificity (the probability of correctly identifying uninfected subjects). Moreover, the relative advantage of group testing over individual testing in improving test efficiency diminishes-or even vanishes-when the infection prevalence rate is high for all combinations of test specificity and test sensitivity. Proof of Lemma 1. For a group test with size K > 1, if the test result is negative, the testing stops at Stage 1 with a total number of tests N = 1. Otherwise, the testing continues and proceeds to Stage 2, where additional K tests are needed. In this situation, the total number of tests N = 1 + K. The probability that the group is tested negative and that the group is tested positive can be respectively derived as Then, the average number of tests required for the group can be written as Thus, the average number of tests needed for each individual sample in the group is At Stage 1, a pooled sample is tested false negative with probability (1 − β)x K and tested true positive with probability βx K when the group has at least one infected subject. If the group test result is true positive in Stage 1, each individual sample is re-tested in Stage 2. Recall that each individual sample is tested false negative with probability (1 − β). Combining the above together, we can show that under the group testing, the expected number of false negative subjects can be derived as where Kp 0 x K is the expected number of infected subjects in a group that has at least one infected subject. Similarly, at Stage 1, a pooled sample is tested false positive with probability (1 − α)(1 − x K ) when the group has no infected subject. Then, at Stage 2, these K uninfected individual samples are re-tested and the result is false positive with probability (1 − α). When at Stage 1 a pooled sample is tested true positive (with probability βx K ), the expected number of uninfected samples in the group (that has at least one infected subject) is K − K p 0 x K . When the test proceeds to Stage 2, those unaffected samples are re-tested and the result is false positive with probability (1 − α). Combining the above together, we can show that under the group testing, the expected number of false positive subjects can be derived as The total numbers of true positive and true negative subjects in the group are p 0 K and (1 − p 0 )K, respectively. Thus, the probability of false negative and false positive test results for the group are, respectively, Proof of Lemma 2. Recall that the group testing strategy is effective only if the condition . We can show that this effectiveness condition is equivalent to Recall that x K = 1 − (1 − p 0 ) K . We can show that this requires . Since the group size is an integer, we can obtain that Considering that group testing requires K ≥ 2 (otherwise, it shall be tested individually), we further obtain that this requires It can be easily shown that (1−β)θ+α(1−θ) increases with θ and decreases with α and β. It can be also easily shown that both log 1−p 0 x and 1 − √ x decrease with x. By using the chain rule, we can show that K and p 0 decrease with θ and increase with α and β. Proof of Proposition 1. It is easy to show that when 0 < x < 1, By utilizing 3/4 < α + β − 1 < 1 and p 0 < 20%, we can show that Define the following function over the range x ∈ [2, ∞): Taking the first order derivative, we then have It can be easily shown that −1/x 2 and ln(1 − p 0 )(1 − p 0 ) x are both increasing and concave in x. This implies that −1/x 2 and ln(1 − p 0 )(1 − p 0 ) x cross each other at most twice; in other words, h (x) crosses 0 at most twice. We can show that where the inequality < can be obtained from (4) and 3/4 < α + β − 1 < 1, and the inequality ">" can be obtained based on (5). This implies that when 2 crosses 0 once at a local minimum. Thus, for x > −1/ ln(1 − p 0 ), h (x) crosses 0 at most once. Since h (−1/ ln(1 − p 0 )) > 0, the minimum on the interval shall be located at one of the two interval endpoints. We can also show that where the inequality follows from (5). Therefore, the global minimum of h(x) shall fall into the interval x ∈ [2, −1/ ln(1 − p 0 )] and is determined by x = −γ/ ln(1 − p 0 ) with γ being the unique solution that satisfies Welfare throughput maximization. First, consider the case α = 1. We can simplify the objective function (2) as follows: As the numerator is now a constant, the welfare throughput is maximized once the effective number of tests per individual sample λ(K) is minimized. We note that the function λ(K) is equivalent to the function h(x) except that λ(K) is defined on positive integers. The group size that minimizes λ(K), K, shall be one of the two integers around x = −γ/ ln(1 − p 0 ), namely, whichever yields a larger value of the welfare throughput. Also, recall that the group size shall not exceed K, and thus K * = min{K, K}. Next, consider 1/2 < α < 1. Define the following functions on the interval x ≥ 2: We can show that Recall that h (x) = 0 when x = −γ/ ln(1 − p 0 ), in which γ is uniquely determined by (7). Then, we have F (−γ/ ln(1 − p 0 )) = f (−γ/ ln(1 − p 0 )) h (−γ/ ln(1 − p 0 )) < 0. We have shown that h (x) crosses 0 at most once when x > −γ/ ln(1 − p 0 ). We further have the following two scenarios: 1. If h (x) does not cross 0, then h (x) > 0 for all x > −γ/ ln(1 − p 0 ), from which we can show that F (x) < 0. Hence, the maximum value of F (x) on this interval shall be obtained at x = −γ/ ln(1 − p 0 ); 2. If h (x) indeed crosses 0 once, by using f (x) < 0, we can show that F (x) crosses 0 at most once. As lim x→∞ F (x) = 0, together with (9), we can show that F (x) decreases on this interval. Hence, the maximum value of F (x) shall be obtained at x = −γ/ ln(1 − p 0 ). Thus, the maximum value of F (x) is obtained at x = −γ/ ln(1 − p 0 ) for x > −γ/ ln(1 − p 0 ). This indicates that the global maximum of F (x) shall be found in the interval 2 ≤ x ≤ −γ/ ln(1 − p 0 ). Let x 0 denote the global maximum of F (x). By using (9), we can show that 2 ≤ x 0 < −γ/ ln(1 − p 0 ). Note that the objective function (2) is equivalent to the function F (x) except that (2) is defined on positive integers. Thus, the optimal group size, denoted as K 0 , shall be one of the two integers around x 0 , i.e, K ∈ { x 0 , x 0 + 1}, whichever yields a larger value of welfare throughput. As the group size shall not exceed K, K * = min{ K, K} ≤ min{ K, K}, where the inequality is obtained because 2 ≤ x 0 < −γ/ ln(1 − p 0 ) . Identification rate maximization. From Lemma 1, we have P 11 (K) = 1 − P 10 (K) = β 2 . By using (3), we can rewrite the agent's optimization problem as follows: Then, W r is maximized whenever λ(K) is maximized. Obviously, λ(K) = h(K) for any integer K ≥ 2. We have shown that h(x) is minimized at x = −γ/ ln(1 − p 0 ), where γ is determined by (7). By considering the effectiveness constraint K ≤ K, it is easy to show that W r is maximized at K * = min{K, K}, where K is given by (8). Massively multiplexed nucleic acid detection with Cas13 Group testing: An information theory perspective. Foundations and Trends in Communications and Information Theory Diagnosis accuracy under congestion. Management Science Optimal risk-based group testing Diagnostic accuracy of serological tests for covid-19: systematic review and meta-analysis Entry screening to delay local transmission of 2009 pandemic influenza A (H1N1) The detection of defective members of large populations Corona 'pool testing' increases worldwide capacities many times over Modeling Patients' Illness Perception and Equilibrium Analysis of Their Doctor Shopping Behavior. Production and Operations Management PCR-based pooling of dried blood spots for detection of malaria parasites: optimization and application to a cohort of Ugandan children Screening of blood donations for Zika virus infection-Puerto Rico Group testing during the COVID-19 pandemic: optimal group size selection and prevalence control Pooling of samples for testing for SARS-CoV-2 in asymptomatic people Search for sparse active inputs: A review Analysis and applications of adaptive group testing methods for COVID-19 Basic principles of ROC analysis Governor Cuomo Announces New Testing Initiatives to Improve COVID-19 Detection & Control Across New York State The use of a square array scheme in blood testing Real-time, universal screening for acute HIV infection in a routine HIV counseling and testing population COVID-19 single-stage pooled testing method approved in Israel On the Informativeness and accuracy of pooled testing in estimating prevalence of a rare disease: application to HIV screening Does FDA have validation or other recommendations regarding SARS-CoV-2 diagnostic/screening tests for use with sample pooling? How China managed to test almost 1.5m people for coronavirus in a single day Design and analysis of diagnostic service centers Principles and practice of screening for disease Efficient highthroughput SARS-CoV-2 testing to detect asymptomatic carriers Pooled testing for hiv prevalence estimation: exploiting the dilution effect We are grateful to the department editor (Professor Joe Zhu), an anonymous associate editor, and two anonymous referees for their very helpful comments and suggestions. Fengfeng All authors contributed equally to the work.