key: cord-0001700-k53e8r3h authors: Walter, Roland B.; Othus, Megan; Burnett, Alan K.; Löwenberg, Bob; Kantarjian, Hagop M.; Ossenkoppele, Gert J.; Hills, Robert K.; Ravandi, Farhad; Pabst, Thomas; Evans, Anna; Pierce, Sherry R.; Vekemans, Marie-Christiane; Appelbaum, Frederick R.; Estey, Elihu H. title: Resistance Prediction in AML: Analysis of 4,601 Patients from MRC/NCRI, HOVON/SAKK, SWOG, and MD Anderson Cancer Center date: 2014-08-12 journal: Leukemia DOI: 10.1038/leu.2014.242 sha: 2e6b763befef8fc1df4868231c49171d57793e8a doc_id: 1700 cord_uid: k53e8r3h Therapeutic resistance remains the principal problem in acute myeloid leukemia (AML). We used area under receiver operator characteristic curves (AUC) to quantify our ability to predict therapeutic resistance in individual patients where AUC=1.0 denotes perfect prediction and AUC=0.5 denotes a coin flip, using data from 4,601 patients with newly diagnosed AML given induction therapy with 3+7 or more intense standard regimens in MRC/NCRI, HOVON, SWOG, and MD Anderson Cancer Center studies. Age, performance status, white blood cell count, secondary disease, cytogenetic risk, and FLT3-ITD/NPM1 mutation status were each independently associated with failure to achieve complete remission despite no early death (“primary refractoriness”). However, the AUC of a bootstrap-corrected multivariable model predicting this outcome was only 0.78, indicating only fair predictive ability. Removal of FLT3-ITD and NPM1 information only slightly decreased the AUC (0.76). Prediction of resistance, defined as primary refractoriness or short relapse-free survival (RFS), was even more difficult. Our ability to forecast resistance based on routinely available pre-treatment covariates provides a rationale for continued randomization between standard and new therapies and supports further examination of genetic and post-treatment data to optimize resistance prediction in AML. "Resistance", defined as either failure to achieve initial complete remission (CR) or as relapse from CR, remains the principal problem in adult acute myeloid leukemia (AML). [1] [2] [3] It is widely appreciated that the likelihood of resistance to therapy differs significantly between individuals; e.g., the complete remission (CR) rate following initial chemotherapy is 80-90% in some patients but below 30% in others. 4, 5 The ability to accurately forecast resistance would have considerable significance for the management of AML and the evaluation of new drugs. Specifically, the better our ability to predict resistance to standard therapy, the less need there might be for randomization of patients between such therapy and an investigational therapy. The lack of a need to randomize would address a seldom discussed but very real ethical conundrum: physicians caring for AML patients generally are not enthusiastic about randomizing patients with features associated with resistance (older age, complex cytogenetics, secondary AML, etc.) to standard therapy but are willing to do so because they believe that our ability to predict resistance to standard therapy is only fair and, thus, that randomization is necessary to properly evaluate a new therapy. We therefore set out to test this belief by quantifying our current ability to predict resistance. Here, it may be important to distinguish between association and prediction. It is well known that clinical, cytogenetic, and molecular characteristics are strongly associated with resistance, 1-3 as quantified by odds ratios, hazard ratios, and P-values, the standard measurements of association. 6 While valuable, these traditional measurements do not give insight into the degree to which given prognostic factors improve our ability to predict outcome in individual patients. This is the province of measures of prediction such as the area under receiver operating curve (AUC). 7 Conceptually, the closer we are to AUCs of 1.0, the less need there might be to randomize patients between standard therapy (i.e. therapy for which the outcome can be predicted) and a novel therapy. Herein, we use AUCs in conjunction with clinical, cytogenetic, and molecular data from adults with newly diagnosed AML treated on trials conducted by the U.K. Medical Research Council/National Cancer Research Institute (MRC/NCRI), the Dutch-Belgian Cooperative Trial Group for Hematology/Oncology and the Swiss Group for Clinical Cancer Research (HOVON/ SAKK), the U.S. cooperative group SWOG, and MD Anderson Cancer Center (MDA) to quantify our ability to predict therapeutic resistance using uni-and multiparameter models. The trials we analyzed primarily enrolled adults with newly diagnosed AML other than acute promyelocytic leukemia, as based on WHO 2008 classification criteria. 8 Specifically, we used data from patients who received curative-intent treatment on 6 MRC/NCRI trials from 1988-2010 (AML10, AML11, AML12, AML14, AML15, and AML16), 6 HOVON/ SAKK trials from 1987-2008 (HO04, HO04A, HO29, HO42A, HO42, and HO43), or 4 SWOG trials from 1992-2009 (S8931, S93333, S9500, and S0106), or received treatment on various protocols at MDA from 2000-2013. As expected, there were intergroup differences with regard to the type of therapeutic regimens used: for example, while induction treatments generally contained cytarabine and daunorubicin or idarubicin, therapy at MDA more often included cytarabine at >1 g/m 2 /dose, while in MRC/NCRI typically 10 rather than 7 days of cytarabine at 100-200 mg/m 2 /dose were used; moreover, HOVON/ SAKK post remission therapy often included amsacrine. Institutional review boards of participating institutions approved all protocols, and patients were treated according to the Declaration of Helsinki. Early death ("treatment-related mortality" [TRM]) was defined as death within 28 days after initiating therapy 9 or study registration, if exact date of initiation of therapy was unknown. Complete remission (CR) was conventionally defined as achievement of a morphologic leukemia-free state (bone marrow blasts <5%, absence of extramedullary disease) and recovery of peripheral blood counts (absolute neutrophil count >1,000/μL and platelet count >100,000/μL). 2, 10 Overall survival (OS) was defined as time from initiation of therapy or study registration (if date of initiation of therapy was unknown) to death, with censoring on the day patients were last known to be alive. For patients who achieved CR, relapse-free survival (RFS) was defined as time from achievement of remission until relapse or death from any cause, with patients not known to have relapsed or died being censored at last follow-up. Rather than settling on a single, arbitrary criterion to define therapeutic resistance, we used several criteria for resistance: (a) failure to attain CR despite surviving at least 28 days from beginning induction therapy ("primary refractory"); (b) primary refractory or RFS ≤3 months; (c) primary refractory or RFS ≤6 months; and (d) primary refractory or RFS ≤12 months. OS and RFS were estimated using the Chi-squared tests and the Kruskal-Wallis test were used to assess differences between categorical variables and median values of numeric variables across categories, respectively. We used logistic regression analyses to assess the relationship between individual covariates and various working definitions of therapeutic resistance, and then used the AUC to quantify a model's ability to predict therapeutic resistance; in this approach, an AUC of 1 indicates perfect prediction while an AUC of 0.5 indicates no prediction. It is natural to wonder whether some of the changes in AUC observed, for example from 0.77 to 0.78, are "statistically significant"; however, there is no rule permitting one to say whether a given change in AUC is statistically significant. Rather, this depends on effect size and patient numbers. Given the large number of models we evaluated and the even larger number of pairwise comparisons, we consider it more important to focus on the magnitude of AUC values and changes in these values (rather than on their statistical significance) using the commonly accepted criteria that AUCs of 0. 6-0.7, 0.7-0.8, and 0.8-0.9 indicate poor, fair, and good predictive ability, respectively. [12] [13] [14] An alternative way of interpreting AUC is as the proportion of patients who are correctly ranked. For example, a model with an AUC of 0.85 correctly ranks patients with higher risk has having a larger risk score 85% of the time, so 15% of the time the model incorrectly gives a person with higher risk a lower risk score than a person with lower risk. In this interpretation, the AUC is linear: a model with an AUC of 0.75 is a 25% improvement over a model with an AUC of 0.50. The following pre-treatment covariates were used in the regression modeling: age at study registration, performance status, gender, white blood cell (WBC) count, platelet count, bone marrow blast percentage, disease type (primary vs. secondary), cytogenetic risk, and FLT3-ITD as well as NPM1 mutational status. Missing cytogenetic risk was accounted for as separate category. The relative importance of predictors in the multivariable regression models was evaluated by the value of the partial Wald Chi-squared statistic minus the predictor's degrees of freedom. Bootstrapping, which has been demonstrated to be a more efficient method for assessing internal model accuracy than cross-validation or splitting data into two groups, 15, 16 was used to estimate bias-corrected values of AUC, and all reported AUCs are bootstrap-bias corrected. All analyses were performed using R (http://www.r-project.org). Our dataset included 4,955 predominantly adults with newly diagnosed AML treated on MRC/NCRI, HOVON/SAKK, or SWOG protocols or given treatment at MDA and in whom pre-treatment information (age, gender, performance status, WBC and platelet count, bone marrow blast percentage, secondary AML, FLT3-ITD and NPM1 mutational status) and outcome data were available. To better assess resistance following initial chemotherapy, we excluded patients who died within 28 days of initiation of induction chemotherapy (n=335) or were lost to follow-up before this time (n=19), leaving 4,601 patients (median age: 52 years) from our 4 treatment sites (MRC/NCRI [n=2, 615] , HOVON/SAKK n=1,134], SWOG [n=443], and MDA [n=409]). Patients' baseline characteristics are presented in Table 1 . As summarized in Table 2 , a CR to the initial 1-2 courses of induction chemotherapy was achieved in 3,613 (79%) of patients, whereas 988 (21%) were "primary refractory". 1,318/4,552 patients (29%) with sufficient follow-up time were either primary refractory or had a RFS of 3 months or less after CR achievement, 1,784/4,503 patients (40%) with sufficient follow-up time were either primary refractory or had a RFS of 6 months or less after CR achievement, and 2,547/4,457 patients (57%) with sufficient follow-up time were primary refractory or had a RFS of 12 months or less after CR achievement. Patients with primary refractory AML are widely considered those exhibiting the highest degree of resistance. Not surprisingly, but serving as a data reliability check, age (P<0.001), performance status (P=0.006), WBC (P<0.001), secondary disease (P<0.001), cytogenetic risk (favorable or intermediate vs. adverse; P<0.001 for each comparison), and NPM1 mutation as well as FLT3-ITD status (NPM1 neg /FLT3-ITD neg or NPM1 neg /FLT3-ITD pos vs. NPM1 pos /FLT3-ITD pos ; P<0.005 for each comparison) were independently associated with being primary refractory to induction chemotherapy when considering all patients ( Table 3) . As shown in Figure 1A , age was the most important individual covariate in predicting failure to achieve CR followed by cytogenetic risk and NPM1 and FLT3/ITD status, as indicated by values of the partial Wald Chi-squared statistic. Despite the strong associations between such covariates and primary refractory AML, AUCs validated by bootstrapadjustment suggested that the predictive ability was less impressive. Specifically, in univariate models, AUC values were 0.68, 0.66, and 0.60 for age, cytogenetic risk, and NPM1 and FLT3-ITD mutational status, A bootstrap-corrected multivariable model incorporating all covariates (age, performance status, WBC, platelet count, bone marrow blast percentage, gender, disease type [primary vs. secondary], cytogenetic risk, and FLT3-ITD as well as NPM1 mutational status) yielded an AUC of 0.78. Despite their strong associations with primary refractory AML, removal of FLT3-ITD and NPM1 from the model, while having a statistically significant effect on AUC, only decreased the AUC slightly (from 0.78 to 0.76; see Supplemental Figure 1 ). AUCs varied considerably among the 4 treatment sites: from 0.82/0.81 (with/without FLT3-ITD and NPM1 mutation information) to 0.69/0.66. We performed several additional analyses of prediction of primary refractory AML. First, because information on the mutational status of FLT3 and NPM1 is currently thought to be clinically most helpful to guide decision-making in patients with cytogenetically normal AML, we restricted our analysis to patients with normal karyotypes at diagnosis. Here, the multivariable models with or without NPM1 and FLT3-ITD information for the combined dataset yielded AUCs of 0.75 and 0.72, respectively, with AUCs for the individual treatment sites of 0.72-0.81 (with inclusion of FLT3-ITD and NPM1) and 0.63-0.80 (without inclusion of FLT3-ITD and NPM1). Thus, these AUC values were not higher than the values in the unrestricted dataset presented above, and only in 1 of the 4 individual cohorts did the inclusion of FLT3-ITD and NPM1 mutational information improve the prediction performance of the models compared to models incorporating all cytogenetic groups. In the U.S., selection of induction chemotherapy for newly diagnosed AML is commonly made before cytogenetic and mutational information is available. For a second additional analysis aimed at evaluating AUCs under these circumstances, we built multivariable models without inclusion of molecular or cytogenetic information. The accuracy of such models was significantly lower, as indicated by the AUC obtained in the entire patient cohort of 0.70, with AUCs for the individual 4 treatment sites ranging from 0.59 to 0.78. Third, we examined whether predictive accuracy was improved if we focused on relatively homogeneous subsets. However, an analysis limited to our 3,942 patients with de novo AML yielded bootstrap-corrected AUCs of only 0.75 and 0.73 for multivariable models with or without NPM1 and FLT3-ITD information, i.e. lower than in our entire study cohort (0.78 and 0.76). Similarly restricting attention separately to patients aged 60 years or younger (n=3,425) and those older than 60 years (n=1,176), AUCs again were not higher than those in the entire study cohort (0.76/0.72 [younger patients] and 0.73/0.73 [older patients] for multivariable models with or without NPM1 and FLT3-ITD information, respectively). Finally, because recent studies indicated that many patients can achieve CR with an identical second course of chemotherapy after a first course failed, 17 we restricted our analysis to patients declared primary refractory only after 2 courses of chemotherapy. Information on receipt of 1 vs. 2 cycles of induction therapy was only available in the SWOG cohort, in which 62 patients failed the first course of induction chemotherapy and did not receive reinduction on protocol. In a subset analysis on the remaining 381 patients, bootstrapcorrected AUCs were 0.66 and 0.64 for multivariable models with or without NPM1 and FLT3-ITD information, i.e. values that were lower than in the entire SWOG cohort (0.69 and 0.66, respectively). To assess whether the results were similar when we used other criteria for resistance, we performed additional analyses in which we considered resistance not only as primary refractory AML but also as an RFS of no more than 3, 6, or 12 months from date of first CR. As shown in Figures 1A-D, the relative importance of individual covariates changed slightly when the criterion for resistance was altered, with FLT3-ITD and NPM1 mutational status and cytogenetic risk gradually replacing age as the most important individual covariates. As the time over which prediction was attempted increased, predictive ability decreased. Specifically, when analyzing the entire study cohort, the AUCs for models predicting primary refractory disease or to have RFS of ≤3, ≤6, or ≤12 months were 0.75/0.74 (with/ without inclusion of FLT3-ITD and NPM1 mutational data; Table 4 and Supplemental Table 2 ), 0.76/0.73 (Table 5 and Supplemental Table 3 ), and 0.75/0.71 (Table 6 and Supplemental Table 4 ), respectively, vs. 0.78/0.76 when considering primary refractory AML alone. Although our previous data suggest that 28 days is a reasonable criterion for distinguishing patients who fail therapy because of TRM, 9 it is plausible that patients who die between days 14 and 28 had sufficient AML present at death that they would have been considered resistant had they lived beyond day 28. Hence, exclusion of patients who died before day 28 might affect conclusions. We therefore performed additional analyses in which we included the 7% of our patients (354/4955) who died before day 28 or were lost to follow-up before that time. As summarized in Table 7 , the conclusions were essentially unaffected by inclusion of these patients, with AUCs for models predicting primary refractory disease or to have RFS of ≤3, ≤6, or ≤12 months being 0.77, 0.75, 0.76, and 0.75, respectively, when mutational data on FLT3-ITD and NPM1 were included. One might suspect that the ability to predict resistance is higher in patients with high-risk or, conversely, favorable cytogenetics. To test this possibility, we performed separate analyses of resistance in favorable-risk, intermediate-risk, and unfavorable-risk cytogenetic groups. However, there was no evidence that AUCs differed in different cytogenetic risk groups regardless of whether FLT3-ITD and NPM1 data were included. Even in the unfavorablerisk group AUCs remained relatively low for models predicting primary refractory disease or to have RFS of ≤3, ≤6, or ≤12 months (0.64-0.65). AML patients fail to be cured primarily because of resistance to therapy. The dominant role of resistance has been noted previously for patients during induction 18 as well as while in remission. 19 Analyses from our patient cohort are consistent with these previous data in that only 7% of patients experienced TRM and 16% died while in CR. Accurate methods to predict resistance to standard therapy are needed if physicians are to reliably advise AML patients. Specifically, if the ability to predict resistance was sufficiently high, it would be difficult to recommend standard therapy to some patients and similarly difficult to recommend investigational therapies to others. Experience suggests that these recommendations are commonly made; however, they are made in the absence of quantitative measurements of predictive ability. While perhaps not surprisingly confirming that age, performance status, WBC, secondary disease, cytogenetic risk, and NPM1/FLT3-ITD mutational status are each strongly and independently associated (P<0.001) with resistance, our results suggest that our ability to predict resistance is much less impressive. Indeed, using standard criteria relating AUC to predictive ability, this ability is only "fair" and is little improved by inclusion of data on aberrations in FLT3 and NPM1; the same was true if attention was restricted to more homogeneous subsets such as younger patients or those with de novo AML. This finding, denoting only a fair ability to predict CR, is reminiscent of a recent study by Krug et al., who observed similar AUCs (0.72 and 0.68) with multivariable models in their study cohort, although their definition of primary refractory AML were slightly different. 20 While some clinicians might understandably feel that even fair predictive ability justifies a recommendation that some patients receive standard therapy and others investigational therapy, others might feel that the relatively low AUCs for resistance support randomization between standard and investigational therapies. Thus this study provides a rationale for continued randomization as the best means to accurately evaluate the comparative efficacies of new and standard therapies. Resistance is undoubtedly difficult to predict because AML is an extraordinarily complex and heterogeneous disease, whose diversity is only very partially captured by clinical parameters, cytogenetic data, and information about aberrations in NPM1 and FLT3, the covariates we examined. Mutations in many additional genes may carry prognostic significance, 21-25 the degree of which may depend on the co-existence of other alterations; 26 it would therefore be interesting to assess the degree to which AUCs could be improved by incorporating data from additional genetic or molecular analyses or pretreatment assays that test AML signaling pathways functionally. Moreover, different AML subclones may exhibit differential chemotherapy sensitivities, with resistance reflecting emergence of subclones whose importance cannot be determined at initial presentation. 27 Emergence of these clones will likely make it more difficult to forecast resistance as the period over which the forecast is attempted increases. Such increasing difficulty was seen as we moved from predicting primary resistance to predicting primary resistance and RFS. In contrast, our previous efforts to predict death within 28 days of beginning chemotherapy were more successful. 9 Integration of post-treatment data, e.g. early disease clearance in the case of primary refractory AML or achievement of CR vs. CRp/CRi and assessment of minimal residual disease, 28-33 in the case of RFS may improve prediction of resistance. While our data are derived from well-annotated patients treated with contemporary AML regimens, several limitations need to be acknowledged. First, our results may only apply to newly diagnosed AML treated with "3+7" or more intense regimens and may be influenced by allogeneic transplantation, about which we have only limited information. Second, rather than "harmonizing" cytogenetic risk categories across our study cohort, we used the cooperative study group's separate classification; however, recoding of the SWOG patient cohort to the refined MRC/NCRI classification 34 only slightly changed the AUC of a multivariable prediction model (change from 0.68 to 0.70) and additional categorization of patients with monosomal karyotype 35 as separate risk entity did not further improve the AUC (AUC=0.70). Third, we did not distinguish between relapse and death in CR as events contributing to short RFS, a criterion for resistance in some of our analyses. While patients who die in CR are not necessarily "resistant", previous studies indicated that patients who die in CR are much more similar to patients who relapse than those who remain alive in CR. 19 And fourth, we did not use an independent dataset to obtain an unbiased estimate of the AUC of any of the models presented in this paper. However, the goal of this paper was not to generate one predictive model; rather, our aim was to investigate how the predictive ability of a model changes as cytogenetics and molecular information is incorporated into the model, and to assess whether there were similar trends across separate cohorts. For this purpose, validation via bootstrapping 15, 16 provides a useful approach to assess the internal model accuracy. Of note, experience suggests that models do not perform as well in independent datasets as in the dataset used to derive them. Thus, the use of another dataset is unlikely to change our fundamental conclusion (i.e. that the usefulness of current prognostic factors for guiding treatment decisions is very likely overestimated), which was supported by separate analyses of data from SWOG, MDA, HOVON, and MRC. In conclusion, our ability to predict therapeutic resistance based on routinely available clinical covariates, even with inclusion of commonly used molecular data on FLT3 and NPM1, is relatively limited. This finding appears to have significant clinical consequences. Not infrequently, multivariate analysis is used to examine whether, after accounting for covariates such as cytogenetics, NPM1 and FLT3 status, a new treatment is superior to an older one. However if the predictive ability of these covariates is limited as our results indicate, reliance on historical controls to assess the new treatment is problematic. Thus, our results emphasize the continued importance of randomized treatment assignment for the testing of new drugs in AML. Importance of individual covariates to predict (A) failure to achieve CR with initial 1-2 courses of induction chemotherapy, (B) failure to achieve CR with initial 1-2 courses of induction chemotherapy or RFS ≤3 months, (C) failure to achieve CR with initial 1-2 courses of induction chemotherapy or RFS ≤6 months, and (D) failure to achieve CR with initial 1-2 courses of induction chemotherapy or RFS ≤12 months, using χ 2 values. 981 (38) 95 (8) 90 (21) 108 (26) 1,274 (28) Male Gender, n (%) 1,407 (54) 588 (52) 234 (53) 213 (52) 2,442 (53) 332 (13) 100 (9) 19 (4) 208 (51) 659 (14) Karyotype 1 , n (%) Favorable 259 (10) 142 (13) 66 (15) 50 (12) 517 (11) Intermediate 1,447 (55) 682 (60) 242 (55) 197 (48) 2,568 (56) Unfavorable 275 (11) 275 (24) 90 (20) 152 (37) 792 (17) Missing 634 (24) 35 (3) 45 (10) 10 (2) 724 (16) NPM1 Mutation 1 Yes 828 (32) 325 (29) 125 (28) 89 (22) 1,367 (30) No 1,787 (68) 809 (71) 318 (72) 320 (78) 3,234 (70) FLT3-ITD Mutation 1 Yes 582 (22) 279 (25) 98 (22) 77 (19) 1,036 (23) No 2,033 (78) 855 (75) 345 (78) 332 (81) 3,565 (77 (59) 692 (61) 268 (60) 279 (68) 2,792 (61) NPM1−, FLT3-ITD+ 234 (9) 117 (10) 50 (11) 41 (10) 442 (10) NPM1+, FLT3-ITD− 480 (18) 163 (14) 77 (17) 53 (13) 773 (17) NPM1+, FLT3-ITD+ 348 (13) 162 (14) 48 (11) 36 (9) 594 ( (7) 76 (17) 35 (9) 727 (16) 1 At diagnosis. Abbreviations: WBC, white blood cells. Acute myeloid leukaemia Diagnosis and management of acute myeloid leukemia in adults: recommendations from an international expert panel, on behalf of the European LeukemiaNet Acute myeloid leukaemia in adults Optimization of chemotherapy for younger patients with acute myeloid leukemia: results of the medical research council AML15 trial Monosomal karyotype in adult acute myeloid leukemia: prognostic impact and outcome after different treatment strategies Regression modeling strategies: with applications to linear models, logistic regression, and survival analysis The statistical evaluation of medical tests for classification and prediction WHO Classification of Tumours of Haematopoietic and Lymphoid Tissue. 4. IARC Prediction of early death after induction therapy for newly diagnosed acute myeloid leukemia with pretreatment risk scores: a novel paradigm for treatment assignment Revised recommendations of the International Working Group for Diagnosis, Standardization of Response Criteria, Treatment Outcomes, and Reporting Standards for Therapeutic Trials in Acute Myeloid Leukemia Nonparametric estimation from incomplete observations Can routine laboratory tests discriminate between severe acute respiratory syndrome and other causes of communityacquired pneumonia? Quantitative analysis of the color change after iodine staining for diagnosing esophageal high-grade intraepithelial neoplasia and invasive cancer The development and validation of two prediction models to identify employees at risk of high sickness absence Multivariable prognostic models: issues in developing models, evaluating assumptions and adequacy, and measuring and reducing errors Development of a clinical prediction model for an ordinal outcome: the World Health Organization Multicentre Study of Clinical Signs and Etiological agents of Pneumonia Abbreviations: BM, bone marrow; CR, complete remission; WBC, white blood cells BM blasts Type of AML De novo (primary) 86-2.19) Author manuscript; available in PMC Abbreviations: BM, bone marrow; CR, complete remission; WBC, white blood cells BM blasts Type of AML De novo (primary) Unfavorable Author manuscript; available in PMC Research reported in this publication was supported by grants from the National Cancer Institute/National Institutes of Health (NCI/NIH; R21-CA182010 to R.B.W. and M.O., and R01-CA090998-09 to M.O.). SWOG trials were supported in part by the following PHS Cooperative Agreement grant numbers awarded by the NCI/NIH: U10-CA032102, U10-CA038926, and U10-CA105409. R.B.W. is a Leukemia & Lymphoma Society Scholar in Clinical Research. Walter et al.Page 19 Data is presented as odds ratio (95% CI). The model also contained a covariate "missing cytogenetics". Abbreviations: BM, bone marrow; CR, complete remission; WBC, white blood cells. 2,074 (79) 954 (84) 303 (68) 282 (69) 3,613 (79) Failure to achieve CR, n (%) 541 (21) 180 (16) 140 (32) 127 (31) 988 (21) 18 (18) (19) (20) 24 (20-33) 27 (23-42) 18 (16-24) 20 (19-22) 13 (13) (14) (15) 18 (15) (16) (17) (18) (19) (20) (21) (22) (23) (24) (25) (26) 17 (14) (15) (16) (17) (18) (19) (20) (21) (22) (23) (24) (25) (26) (27) 16 (13-26) 15 (14) (15) (16) Patients evaluable at 3 months, n (%) 2,573 (98) 1,134 (100) 443 (100) 402 (98) 4,552 (99) Failure to achieve CR or to have RFS ≤3 months, n (%) 720 (28) 283 (25) 159 (36) 156 (39) 1,318 (29) Patients evaluable at 6 months, n (%) 2,544 (97) 1,130 (99) 441 (100) 388 (95) 4,503 (98) Failure to achieve CR or to have RFS ≤6 months, n (%) 1,004 (39) 396 (35) 192 (44) 192 (49) 1,784 (40) Patients evaluable at 12 months, n (%) 2,523 (96) 1,124 (99) 439 (99) 371 (91) 4,457 (97) Failure to achieve CR or to have RFS ≤12 months, n (%) 1,471 (58) 580 (52) 259 (59) 237 (64) 2,547 (57) Abbreviations: CR, complete remission; OS, overall survival; RFS, relapse-free survival. Data is presented as odds ratio (95% CI). The model also contained a covariate "missing cytogenetics". Abbreviations: BM, bone marrow; CR, complete remission; WBC, white blood cells.