key: cord-0929788-mxkfuya4
authors: Lin, D.; Gu, Y.; Zeng, D.; Janes, H.; Gilbert, P.
title: Evaluating Vaccine Efficacy Against SARS-CoV-2 Infection
date: 2021-04-17
journal: medRxiv : the preprint server for health sciences
DOI: 10.1101/2021.04.16.21255614
sha: fc85baf40e1fd7d9700f4d233b15d95179a97755
doc_id: 929788
cord_uid: mxkfuya4

Although interim results from several large placebo-controlled phase 3 trials demonstrated high vaccine efficacy (VE) against symptomatic COVID-19, it is unknown how effective the vaccines are in preventing people from becoming asymptomatically infected and potentially spreading the virus unwittingly. It is more difficult to evaluate VE against SARS-CoV-2 infection than against symptomatic COVID-19 because infection is not observed directly but rather is known to occur between two antibody or RT-PCR tests. Additional challenges arise as community transmission changes over time and as participants are vaccinated on different dates because of staggered enrollment or crossover before the end of the study. Here, we provide valid and efficient statistical methods for estimating potentially waning VE against SARS-CoV-2 infection with blood or nasal samples under time-varying community transmission, staggered enrollment, and blinded or unblinded crossover. We demonstrate the usefulness of the proposed methods through numerical studies mimicking the BNT162b2 phase 3 trial and the Prevent COVID U study. In addition, we assess how crossover and the frequency of diagnostic tests affect the precision of VE estimates.

Enormous progress has been made in the development of vaccines against severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2). Within 1 year after the emergence of this novel infection that caused a global pandemic, vaccine targets were identified, vaccine constructs were created, and phase 1 through phase 3 testing was conducted. Interim results from several large-scale phase 3 randomized, placebo-controlled clinical trials have demonstrated high vaccine efficacy (VE) against symptomatic COVID-19 [1] [2] [3] [4] . However, very little is known about VE against possibly asymptomatic SARS-CoV-2 infection.

It is critically important to assess VE against SARS-CoV-2 infection because reducing infection and community transmission is the key to halting the pandemic. Fortunately, most phase 3 trials have collected blood samples that can be used to identify SARS-CoV-2 seroconversion [1] [2] [3] [4] . For economic and logistical reasons, however, blood samples can only be drawn infrequently, such that seroconversion is only known to occur between two clinic visits that are weeks or months apart. It is more difficult to analyze such interval-censored seroconversion data than potentially right-censored symptomatic disease data, especially when VE changes over time. (An event time is said to be interval-censored if it is only known to lie in a time interval; an event time is said to be potentially right-censored if it is either observed exactly or known to be longer than the duration of follow-up [5] .) Additional challenges arise when community transmission varies over time and when participants are vaccinated on different dates because of either staggered enrollment of participants or crossover of placebo participants to the vaccine arm before the end of the trial. SARS-CoV-2 infection is commonly diagnosed by reverse transcription polymerase chain reaction (RT-PCR) on nasal swabs. Most phase 3 trials have collected nasal swabs at the enrollment and crossover visits [1] [2] 4] . However, such infrequent swab samples will miss many infections, because a person may be RT-PCR positive for only a few days or weeks after infection [6] . Some phase 3 trials have taken nasal swabs more frequently (e.g., twice a week) on a subset of participants, and the newly launched Prevent COVID U study takes nasal swabs every day; however, frequent RT-PCR testing increases trial cost. How does the RT-PCR testing schedule affect the estimation of VE against infection (defined as viral RNA above a minimum threshold)?

In this article, we show how to evaluate potentially waning VE against SARS-CoV-2 infection -defined by seroconversion or detectable viral RNA -using blood or nasal samples taken at varying levels of frequency under the conditions of time-varying community transmission, staggered enrollment of participants, and possible crossover of placebo volunteers to the vaccine arm before the end of the study. We demonstrate the usefulness of the proposed methods through extensive simulation studies mimicking the BNT162b2 phase 3 trial [1] and the Prevent COVID U study. In addition, we investigate how the frequency of diagnostic tests and the characteristics (blinded versus unblinded, priority-dependent versus priority-independent) of crossover affect the precision of VE estimation. Figure 1 shows the blood sampling schedules for several phase 3 vaccine trials. For the three vaccines that have received FDA's Emergency Use Authorization (EUA), blood samples are also taken at the crossover visits [1] [2] 4] .

We are interested in time to SARS-CoV-2 infection assessed by seroconversion, which is only known to occur between two blood draws and is thus interval-censored. We allow the risk of infection to vary over the calendar time and to depend on baseline risk factors, such as age, sex, ethnicity, race, occupation, and health conditions; we allow the effect of vaccine on infection to depend on the time elapsed since vaccination.

We consider three measures of VE as a function of time elapsed since vaccination: (1) VE h (t) is the percentage reduction in the hazard rate or instantaneous risk of infection at time t; (2) VE a (t) is the percentage reduction in the attack rate or cumulative incidence of infection over the time period (0, t]; and (3) VE a (t 1 , t 2 ) is the percentage reduction in the attack rate over the time period (t 1 , t 2 ]. Note that VE h (t) and VE a (t) pertain to instantaneous and cumulative vaccine effects, respectively, and that VE a (t) is a special case of 4 All rights reserved. No reuse allowed without permission.

(which was not certified by peer review) is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.

The copyright holder for this preprint this version posted April 17, 2021. ; https://doi.org/10.1101/2021.04.16.21255614 doi: medRxiv preprint VE a (t 1 , t 2 ) with t 1 = 0 and t 2 = t. If the vaccine effect is constant over time, then the three VE measures are equivalent (provided that the infection rate is low) [7] .

In Supplementary Appendix 1, we formulate the above ideas through an adaptation of the well-known Cox [8] regression model, in which each participant's time to infection is measured from the start of the clinical trial, and the hazard ratio of infection for vaccine versus placebo depends on the time elapsed since vaccination. Because of staggered enrollment and staggered crossover, the serum sampling time points are scattered randomly over time, providing valuable information about the distribution of the infection time. We express VE h and VE a as appropriate functions of the time-varying hazard ratio. We derive the maximum likelihood estimator for the time-varying hazard ratio based on the interval-censored infection time data and provide the corresponding estimators of VE h and VE a .

The above framework also applies to RT-PCR tests of nasal swabs. Because an infected person is RT-PCR positive for a shorter period of time than they are seropositive (days/weeks versus months) [6, [9] [10] , nasal swabbing needs to be done more frequently than serum sampling in order to capture the infections defined by detectable viral RNA. With very frequent RT-PCR, time to detectable viral RNA may be treated as a potentially right-censored event time. In our framework, potentially right-censored data is a special case of interval-censored data, with the exactly observed event time lying within an interval of one day.

We conducted a series of simulation studies mimicking the BNT162b2 phase 3 trial [1] (Supplementary Appendix 2.1). We used a total of 40,000 participants who enter the trial at a constant rate over a 4-month period and are randomly assigned to vaccine or placebo in a 1:1 ratio. To reflect the increase of COVID-19 cases since last summer and the downward trend this spring, we let the risk of infection increase over the first 7 months and decrease afterward. In addition, we let the risk of infection depend strongly on priority tier.

As in the BNT162b2 phase 3 trial [1] , the vaccine in our simulation received an EUA from FDA at the 5th month, after which placebo participants are sequentially crossed over 5 to the vaccine arm. We considered:

Priority-dependent crossover. Crossover starts at month 6, 7, 8, 9, or 10 of the study for participants with priority tier of 1, 2, 3, 4, or 5, respectively, each participant's waiting time for the clinic visit following the exponential distribution with mean of 0.5 month.

Priority-independent crossover. Crossover starts at month 6 of the study for all participants, with the waiting time following the exponential distribution with mean of 0.5 month.

Note that crossover spreads over a longer time period under priority-dependent crossover than under priority-independent crossover.

We considered both blinded and unblinded crossover. At blinded crossover, placebo participants receive the vaccine and vaccine participants receive the placebo; none of them are aware of the order of their treatments. At unblinded crossover, participants are notified of their original treatment assignments, and placebo recipients are vaccinated. In both types of crossover, all participants are followed until the time of analysis, which is 10.5 months since trial initiation. To avoid bias due to behavioral confounding, we discarded the data collected after unblinded crossover.

As shown in Figure 1 , blood samples were scheduled to be drawn on Day 1, Day 22, Day 52 and Day 209 (during the first year) in the BNT162b2 phase 3 trial [1] . In our simulation, we allowed for small random deviations from the schedule. Blood samples were also drawn at the crossover visits. Because of staggered enrollment and staggered crossover, serum sampling points were scattered randomly over the study period, making it possible to estimate time-varying VE.

We also simulated a design under which there is no crossover before the time of analysis.

Without crossover, placebo participants stay on placebo longer than with crossover, providing more information about long-term placebo-controlled VE. However, because crossover is one of the serum sampling points, there are fewer sampling points and thus fewer antibody tests under no crossover than under crossover.

Naturally, VE h equals 0 at the first injection. We let VE h increase from 0 to 80% at 4 6 All rights reserved. No reuse allowed without permission.

(which was not certified by peer review) is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.

The copyright holder for this preprint this version posted April 17, 2021. ; https://doi.org/10.1101/2021.04.16.21255614 doi: medRxiv preprint weeks and then either stay constant or decrease gradually over time. We refer to these two situations as constant VE and waning VE, respectively. (Note that constant versus waning VE pertains only to the period after the first 4 weeks, when VE is ramping up.)

In our first simulation scenario, we let VE h stay at 80% after 4 weeks, and we analyzed the resulting interval-censored data using the proposed method with a log hazard ratio that decreases linearly between weeks 0 and 4 and stays constant after week 4. For comparison, we fit the same model by treating the time of the first positive antibody test as a potentially right-censored event time and performing maximum partial likelihood estimation [5, 8] ; we refer to this approach as naive Cox regression. (Naive Cox regression estimates the same VE parameter using the same data as the proposed method, the only difference being that it converts interval-censored event times to potentially right-censored event times.) For further comparison, we implemented logistic regression by treating the seroconversion status at the last blood test before the 10.5 month mark (excluding the blood samples drawn after unblinded crossover) as a binary outcome and estimating the odds ratio of seroconversion between the vaccine and the placebo groups by the maximum likelihood estimator. Table 1 summarizes the results of these simulation studies. Using the proposed method, the VE estimates are unbiased, the standard errors are accurately estimated, and the confidence intervals have proper coverage probabilities. The standard error is lower under blinded than unblinded crossover, and lower under priority-dependent than priority-independent crossover. The standard error is slightly higher under no crossover than under blinded priority-dependent crossover. (Note that there are fewer sampling points under no crossover than under crossover.) In comparison, naive Cox regression may under-estimate or overestimate the true VE, and logistic regression always under-estimates the true VE.

In our second simulation scenario, we let VE h stay at 80% after 4 weeks or let it decrease to 0 at 1 year. We implemented the proposed method (for interval-censored data) using a piecewise linear function for the log hazard ratio, with a change point placed at 4 weeks and with the two slopes estimated from the data. For comparison, we also implemented naive Cox regression with the same piecewise linear function for the log hazard ratio. 7 All rights reserved. No reuse allowed without permission.

(which was not certified by peer review) is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. Tables 2-3 show the simulation results on the estimation of VE a over successive time periods under constant VE and waning VE, respectively. The proposed method yields unbiased VE a estimates, with accurate standard error estimates and proper confidence intervals in virtually all cases. Naive Cox regression yields severely biased VE a estimates. Figure 2 displays the estimation results produced by the proposed method in one of the trials simulated under waning VE. The estimated VE h and VE a curves are close to the truth, and the 95% confidence intervals nearly cover the entire true curves. As expected, the confidence intervals are the narrowest under blinded priority-dependent crossover and the widest under unblinded priority-independent crossover. In addition, the confidence intervals for VE h are wider than the confidence intervals for VE a at the right tail.

We also conducted a series of simulation studies mimicking the Prevent COVID U study (Supplementary Appendix 2.2). A total of 12,000 participants enter the study at a constant rate over one month. Half of them are randomly selected to receive the Moderna vaccine at enrollment, and the other half get their first injection with a 4 month delay. We assumed a downward trend of infection over time; we adopted the VE patterns from the first series of simulation studies but placed the change point at 6 weeks instead of 4 weeks.

We explored various swabbing/RT-PCR testing schedules, ranging from every day to every 2 weeks. Each participant is followed for 4 months, and the study ends at month 5, when the last enrolled participant has been followed for 4 months. We also considered a scenario where 50% of the delayed-arm participants receive outside vaccines before the end of follow-up; in the analysis, we discarded the data collected after outside vaccination. Table 4 summarizes the simulation results on the estimation of constant VE. The proposed method yields unbiased VE estimates, with accurate standard error estimates and proper confidence intervals in all cases. The standard error of the VE estimate tends to increase a little bit as RT-PCR testing becomes less frequent. There is a slight loss of precision in the VE estimates when the delayed-arm participants are allowed to receive outside vaccines. In comparison, naive Cox regression shows highly similar results to the proposed method when RT-PCR testing is performed every day; however, as RT-PCR testing becomes 8 All rights reserved. No reuse allowed without permission.

(which was not certified by peer review) is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.

The copyright holder for this preprint this version posted April 17, 2021. ; https://doi.org/10.1101/2021.04.16.21255614 doi: medRxiv preprint less frequent, naive Cox regression becomes more biased, with increasingly larger standard error than the proposed method. Excluding the events within the first 6 weeks substantially reduces the precision of VE estimates. (The substantial loss of precision is due to relatively high incidence in the first 6 weeks under decreasing background incidence over time and high VE. In the setting of constant background incidence with VE of 0.6 after 4 weeks, excluding the events within the first 4 weeks incurs about 13% loss of statistical efficiency.) Table 5 presents the results on the estimation of VE a over successive time periods when VE h decreases to 0 at 1 year. The proposed method provides unbiased VE a estimates, along with proper confidence intervals. Naive Cox regression performs well when RT-PCR testing is done daily but performs poorly when RT-PCR testing is infrequent.

We have assumed that VE ramps to an unknown peak level 4 weeks (or 6 weeks) after the first injection of the Pfizer/BioNTech vaccine (or the Moderna vaccine). We can allow uncertainty in this change point by including several change points in the analysis or by selecting the change point through the Akaike information criterion (AIC). We evaluated these strategies by extending the simulation studies reported in Table 1 Table 6 , the first method performs very well, although the standard error is higher than using a single change point. The second method correctly selects the change point with high probability.

We have demonstrated that it is possible to evaluate time-varying VE against SARS-CoV-2 infection using the blood samples collected in the ongoing phase 3 vaccine trials [1] [2] [3] [4] or using the nasal samples collected in studies like Prevent COVID U. We found that when 9 All rights reserved. No reuse allowed without permission.

(which was not certified by peer review) is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.

The copyright holder for this preprint this version posted April 17, 2021. ; https://doi.org/10.1101/2021.04.16.21255614 doi: medRxiv preprint antibody or RT-PCR tests are performed infrequently, the use of standard Cox regression for potentially right-censored data yields biased and imprecise VE estimates. The new methods provide valid and efficient estimation of three useful VE measures.

The form of the model considered in this article is essentially the same as that used in our previous work on evaluating VE against symptomatic COVID-19 [7] . However, here the estimation approach is different because infection times are interval-censored rather than potentially right-censored. The proposed methodology is general enough to include potentially right-censored data as a special case and thus offers an alternative way to assess VE against symptomatic COVID-19. A major advantage of this new approach is that it provides a unified framework for studying constant versus waning VE.

Another important contribution of this work is a careful treatment of the ramping VE after initial vaccination. The prevailing approach is not to count the events that occur within 4-6 weeks of the first injection [1] [2] . Discarding the first 4-6 weeks of follow-up data causes considerable loss of statistical efficiency, as shown in Table 4 . In the case of blinded crossover, excluding the events that occur within 4-6 weeks of crossover will further reduce statistical efficiency, whereas including all the events will result in biased VE estimates.

We have not accounted for the measurement errors of antibody or RT-PCR tests in the analysis. The false-positive rate is negligible for RT-PCR testing and small for antibody

testing. An infected person is seropositive for a longer period of time than they are RT-PCR positive (several months versus several days or weeks) [6, [9] [10] . Thus, infrequent serology will capture more infections than infrequent RT-PCR. Some asymptomatic infections never seroconvert or have transient seroconversion that may be missed by infrequent serology [9] [10] . However, those who do not seroconvert tend to be less infectious than those who do, such that missed seronegative infections may be clinically less important. Likewise, an asymptomatic infection that is RT-PCR positive for just a day or two is difficult to detect but may have little public-health relevance.

For the Prevent COVID U study, the main reason for daily swabbing and testing is not to determine the timing of infection but rather to measure the full course of viral load for 10 All rights reserved. No reuse allowed without permission.

(which was not certified by peer review) is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.

The copyright holder for this preprint this version posted April 17, 2021. ; https://doi.org/10.1101/2021.04.16.21255614 doi: medRxiv preprint all infected participants. In particular, investigators wish to capture potential infectiousness by measuring the peak viral load, the duration of viral shedding, and the area under the viral load curve. If detecting the presence of viral RNA were the study's only goal, then less frequent testing would be needed. The proposed methods (for interval-censored data) may be warranted in the case that a substantial number of swabs are not collected or are not usable (due to improper collection or storage).

Blood samples and nasal swabs provide complementary information about SARS-CoV-2 infection. Viral RNA can be detected sooner after infection than seroconversion, but antibody lasts longer than viral shedding [6, [9] [10] . There is considerable heterogeneity in the duration of both seropositivity and RT-PCR positivity, with the biggest driving factor being symptomatic versus asymptomatic infection [6, [9] [10] . Many studies collect both blood and nasal samples. For example, the Moderna phase 3 trial [2] performs RT-PCR testing at month 1 and at crossover in addition periodic serology. The Prevent COVID U study performs periodic (every 2 months) serology and frequent RT-PCR testing. The proposed methods can be applied to the two types of infection data separately or as a combined endpoint, depending on the objective of the analysis and the frequency of each type of test.

The monitoring times are assumed to be independent of the infection time (conditional on covariates). This assumption is satisfied for planned diagnostic tests but is unlikely to hold if SARS-CoV-2 infection is detected through symptom-prompted testing. We can apply the proposed methods (for interval-censored data) to planned tests and standard Cox regression with potentially right-censored data to symptom-prompted tests. If the planned RT-PCR testing is frequent, then the data from planned and symptom-prompted RT-PCR tests can be combined and standard Cox regression for potentially right-censored data can be adopted.

We have implemented the methods described in this article in an R package, which is available at https://dlin.web.unc.edu/software/idove/. 13 All rights reserved. No reuse allowed without permission.

(which was not certified by peer review) is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. 14 All rights reserved. No reuse allowed without permission.

(which was not certified by peer review) is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. (which was not certified by peer review) is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. 16 All rights reserved. No reuse allowed without permission.

(which was not certified by peer review) is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. the mean of the standard error estimator, and CP denotes the coverage probability of the 95% confidence interval, when VE may reach its peak level at one of three change points.

Correct denotes the probability of correctly selecting the change point by the AIC.

18 All rights reserved. No reuse allowed without permission.

(which was not certified by peer review) is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. 19 All rights reserved. No reuse allowed without permission.

(which was not certified by peer review) is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.

The copyright holder for this preprint this version posted April 17, 2021. priority-independent (C') crossover: the black curve pertains to the true value, the red curve to the proposed estimate, and the green curves to the 95% confidence intervals. 20 All rights reserved. No reuse allowed without permission.

(which was not certified by peer review) is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.

The copyright holder for this preprint this version posted April 17, 2021. ;

Let S denote the time when the participant is vaccinated, and T denote the time when the participant acquires SARS-CoV-2 infection (as defined by seroconversion or detectable viral RNA), with both times measured in days from the start of the clinical trial. In addition, let X denote baseline risk factors (e.g., age, occupation, race, health conditions). We specify that the hazard function of T is related to S and X through the Cox [1] regression model We can define the vaccine efficacy at day t as the proportionate reduction in the hazard rate of infection at day t for individuals who were vaccinated t days ago compared with those who have not been vaccinated, i.e., V E h (t) = 1 − e η(t) . In addition, we can define the t-day vaccine efficacy as the proportionate reduction in the attack rate or cumulative incidence of infection by day t for individuals who were vaccinated t days ago compared with the non-vaccinated individuals:

which is approximately 1 − t 0 e η(u) λ 0 (u)du/ t 0 λ 0 (u)du when the infection rate is low. If λ 0 (·) is approximately constant, then V E a (t) = 1 − V (t)/t, where V (t) = t 0 e η(u) du. Finally, we consider the vaccine efficacy in reducing the attack rate over a certain time period, say (t 1 , t 2 ]:

21 All rights reserved. No reuse allowed without permission.

(which was not certified by peer review) is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. Clearly, all three VE measures are simple functions of the log hazard ratio η(·). For a clinical trial with a total of n participants, the data consist of (E i , L i , R i , S i , X i ) (i = 1, . . . , n). The likelihood takes the form

where Λ 0 (t) = t 0 λ 0 (s)ds. This likelihood involves two infinite-dimensional functions Λ 0 (·) and η(·), which are not identifiable if both are unrestricted. We let Λ 0 (·) be completely nonparametric and estimate it by a step function with non-negative jumps at the unique values of L i > 0 and R i < ∞ (i = 1, . . . , n). We approximate η(·) by a sequence of B-splines functions, denoted by

This is the interval-censored data likelihood for the standard Cox model with timeindependent covariates X and time-dependent covariates Z [2] , except that the event time is measured from the start of the study rather than the participant's entry time. We compute the nonparametric maximum likelihood estimator for (β, γ, Λ 0 ), denoted by ( β, γ, Λ 0 ), 22 All rights reserved. No reuse allowed without permission.

(which was not certified by peer review) is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.

The copyright holder for this preprint this version posted April 17, 2021.

through an EM algorithm based on latent Poisson random variables [2] . We then estimate η(t) and V (t) by η(t) = K k=1 γ k B k (t) and V (t) = t 0 e η(u) du, respectively. By appealing to modern empirical process theory [3] , we can show that β, γ, and Λ 0 (·) are consistent. In addition, β and γ are asymptotically normal and their covariance matrix can be consistently estimated by the Hessian matrix [2, 4] of the profile log-likelihood for (β, γ), where the log-likelihood is maximized with respect to Λ 0 for fixed β and γ via the EM algorithm. These results allow us to estimate V E h (t), V E a (t), and V E a (t 1 , t 2 ), construct confidence intervals, and perform hypothesis testing.

Remark. In our previous work on (potentially right-censored) symptomatic disease, we approximate log λ 0 (·) by B-spline functions while letting η(·) be completely nonparametric [5] . With interval-censored data, a completely nonparametric function cannot be estimated at the parametric rate, making it difficult to construct confidence intervals. Thus, we let λ 0 (·) be completely nonparametric and approximate η(·), which is the parameter of main interest, by B-spline functions. One benefit of this approach is that it provides a unified framework to study constant versus time-varying VE (by choosing appropriate B-spline functions).

This framework also unifies the analysis of symptomatic disease and asymptomatic infection because potentially right-censored data can be treated as a special case of interval-censored data. For potentially right-censored data, we can adopt very flexible B-spline functions for η(·); for truly interval-censored data, we have to be more rigid unless the sample size is very large, the infection rate is high, or antibody/RT-PCR tests are performed frequently.

We designed the first series of simulation studies to mimic the BNT162b2 phase 3 trial [6] .

We assumed that 40,000 participants entered the study at a constant rate over four months, i.e., E ∼ Uniform(0, 4) months. (In the actual trial, the number of participants was slightly below 40,000, after exclusion of those who were seropositive at baseline.) We created a 23 All rights reserved. No reuse allowed without permission.

(which was not certified by peer review) is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.

The copyright holder for this preprint this version posted April 17, 2021. ; https://doi.org/10.1101/2021.04.16.21255614 doi: medRxiv preprint composite baseline risk score X, which takes values 1, 2, 3, 4, and 5 with equal probability.

We randomly assigned half of the participants at study entry to vaccine and half to placebo.

We generated the infection time T from model (1) with β = 0.2 and

where t + = t if t > 0 and 0 otherwise. We assumed that V E h starts at 0 at t = 0, increases to some maximum value at t = t m , and then stays constant or decreases gradually over time. Serum samples were also drawn at the crossover visits. We considered:

Priority-dependent crossover: Crossover occurs at month (11 − X + G) of the study, where G follows the exponential distribution with mean of 0.5 month.

Priority-independent crossover: Crossover occurs at month 6 + G of the study, where G follows the exponential distribution with mean of 0.5 month.

We assumed that the analysis is performed at 10.5 months after the start of the study, such that only the blood samples that were drawn before the 10.5 month mark can be included. We considered both blinded and unblinded crossover designs. Under blinded crossover, participants receive the opposite of their original assignments, and we used all the data that are collected before the time of analysis. At the point of unblinded crossover, participants are notified of their original assignments, and placebo participants receive the vaccine; we disregarded any data collected after unblinded crossover in order to avoid bias due to behavioral confounding. 24 All rights reserved. No reuse allowed without permission.

(which was not certified by peer review) is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. We applied the proposed methods to each simulated dataset by setting η(t) in model (1) to be piecewise linear with a change point placed at t m and with the slope after t m fixed at 0 or estimated from data. For comparison, we performed maximum partial likelihood estimation of the same model with the same data by treating the time of the first positive antibody test as a potentially right-censored event time. We also performed the logistic regression of the infection status at the last antibody test before the 10.5 month mark (excluding the blood samples drawn after unblinded crossover) on the randomization assignment and baseline covariates X, and we estimated VE by 1 minus the odds ratio of infection for vaccine versus placebo.

We conducted a second series of simulation studies to mimic the Prevent COVID U study.

In our simulation, a total of 12,000 participants enter the study at a constant rate over one month; half of them receive the Moderna vaccine at enrollment and the other half 4 months later. We generated the infection time T from model (1) without X and with log λ 0 (t) = −4.0 − 0.2t. We assumed the same VE patterns as in the first series of simulation studies, but the change point was set at 6 weeks instead of 4 weeks.

We investigated various swabbing/RT-PCR testing schedules, ranging from every day to every 2 weeks. All participants are followed for 4 months, and the study ends at month 5. In addition, we considered a scenario in which placebo participants may receive vaccines outside of the study 1 month after enrollment. We assumed that the time to outside vaccination follows the Weibull distribution with shape parameter of 3 and scale parameter of 4, such that the cumulative probability of outside vaccination is approximately 50%.

For each simulated dataset, we analyzed the data in the same way as in the first series of simulation studies. Specifically, we implemented both the proposed method and its rightcensored data counterpart, to be referred to as naive Cox regression. We discarded the data collected on placebo participants after they received outside vaccines. 25 All rights reserved. No reuse allowed without permission.

(which was not certified by peer review) is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.

The copyright holder for this preprint this version posted April 17, 2021. ; https://doi.org/10.1101/2021.04.16.21255614 doi: medRxiv preprint

Safety and efficacy of the BNT162b2 mRNA Covid-19 vaccine

Efficacy and safety of the mRNA-1273 SARS-CoV-2 vaccine

Safety and efficacy of the ChA-dOx1 nCoV-19 vaccine (AZD1222) against SARS-CoV-2: an interim analysis of four randomised controlled trials in Brazil, South Africa, and the UK

FDA Briefing Document: Janssen Ad26.COV2.S vaccine for the prevention of COVID-19

The Statistical Analysis of Failure Time Data

Characteristics of viral shedding time in SARS-CoV-2 infections: A systematic review and meta-analysis

Evaluating the long-term efficacy of COVID-19 vaccines

Regression models and life-tables

Clinical and immunological assessment of asymptomatic SARS-CoV-2 infections

The duration, dynamics and determinants of SARS-CoV-2 antibody responses in individual healthcare workers

Regression models and life-tables

Maximum likelihood estimation for semiparametric transformation models with interval-censored data

Weak Convergence and Empirical Processes

Maximum likelihood estimation for semiparametric regression models with multivariate interval-censored data

Evaluating the long-term efficacy of COVID-19 vaccines

Safety and efficacy of the BNT162b2 mRNA Covid-19 vaccine

Acknowledgments: This work was supported by the NIH grants R01 AI029168, R01 GM124104, P01 CA142538, and UM1 AI068635.