key: cord-0044957-ejk9wjr1 authors: Crilly, Colin J.; Haneuse, Sebastien; Litt, Jonathan S. title: Predicting the outcomes of preterm neonates beyond the neonatal intensive care unit: What are we missing? date: 2020-05-19 journal: Pediatr Res DOI: 10.1038/s41390-020-0968-5 sha: 1327e3a53273b298098892f88ce6388793aa703e doc_id: 44957 cord_uid: ejk9wjr1 ABSTRACT: Preterm infants are a population at high risk for mortality and adverse health outcomes. With recent improvements in survival to childhood, increasing attention is being paid to risk of long-term morbidity, specifically during childhood and young-adulthood. Although numerous tools for predicting the functional outcomes of preterm neonates have been developed in the past three decades, no studies have provided a comprehensive overview of these tools, along with their strengths and weaknesses. The purpose of this article is to provide an in-depth, narrative review of the current risk models available for predicting the functional outcomes of preterm neonates. A total of 32 studies describing 43 separate models were considered. We found that most studies used similar physiologic variables and standard regression techniques to develop models that primarily predict the risk of poor neurodevelopmental outcomes. With a recently expanded knowledge regarding the many factors that affect neurodevelopment and other important outcomes, as well as a better understanding of the limitations of traditional analytic methods, we argue that there is great room for improvement in creating risk prediction tools for preterm neonates. We also consider the ethical implications of utilizing these tools for clinical decision-making. IMPACT: Based on a literature review of risk prediction models for preterm neonates predicting functional outcomes, future models should aim for more consistent outcomes definitions, standardized assessment schedules and measurement tools, and consideration of risk beyond physiologic antecedents. Our review provides a comprehensive analysis and critique of risk prediction models developed for preterm neonates, specifically predicting functional outcomes instead of mortality, to reveal areas of improvement for future studies aiming to develop risk prediction tools for this population. To our knowledge, this is the first literature review and narrative analysis of risk prediction models for preterm neonates regarding their functional outcomes. Preterm infants have long been recognized as a population at high risk for mortality and adverse functional outcomes, including cerebral palsy and intellectual impairment. 1 As mortality rates for preterm neonates decline and more survive to childhood, 2,3 attention has increasingly turned towards measuring longer-term morbidities and related functional impairments during childhood and young-adulthood, as well as identifying risk factors related to these complications. 4, 5 While child-specific characteristics, such as gestational age, birth weight, and sex, are well established as predictors of adverse neurodevelopmental outcomes, 6-8 recent work has identified additional factors, including bronchopulmonary dysplasia and family socioeconomic status, that are correlated with relevant outcomes, such as poor neuromotor performance and low intelligence quotient at school age. 9 In clinical settings, the assessment of prognosis can vary widely across neonatologists, 10 making a valid and reliable predictive model for long-term outcomes a highly sought-after clinical tool. Moreover, predicting outcomes is vital when making decisions regarding which therapeutic interventions to apply, when providing critical data to parents for informed decision-making, and when matching infants with outpatient services to best meet their needs. In addition, prediction models are useful in evaluating Neonatal Intensive Care Unit (NICU) performance and allowing for between-center comparisons with proper adjustment for the severity of cases being treated. 11 Numerous prediction tools have been developed to quantify the risk of death for preterm neonates in the NICU setting, including the Score for Neonatal Acute Physiology (SNAP) and the Clinical Risk Index for Babies (CRIB). 12 The National Institute of Child Health and Human Development (NICHD) risk calculator, predicting survival with and without neurosensory impairment, is widely used to counsel families in the setting of threatened delivery at the edges of viability. 13 Furthermore, there are numerous other models that use clinical data from the NICU stay to predict risk for poor functional outcomes in infancy and school age. 14, 15 While several studies have categorized and evaluated the risk prediction models developed and validated in recent decades for mortality, 12, 16 no studies have compared and contrasted risk prediction models for non-mortality outcomes. Recently, Linsell et al. 17 published a systematic review of risk factor models for neurodevelopmental outcomes in children born very preterm or very low birth weight (VLBW). However, this review focused primarily on overall trends in model development and validation rather than a detailed consideration of individual models. In this article, we conduct an in-depth, narrative review of the current risk models available for predicting the functional outcomes of preterm neonates, evaluating their relative strengths and weaknesses in variable and outcome selection, and considering how risk model development and validation can be improved in the future. Towards this, we first provide an overview of the different risk models developed since 1990. We then frame our review of these models in terms of the outcomes predicted, the range of predictors considered, and the statistical methods used to select the variables included in the final model, as well as to assess the predictive performance of the model. Finally, the ethical implications of integrating risk stratification into standard clinical care for preterm neonates are considered. We conducted a manual search for relevant literature via PubMed, entering combinations of key terms synonymous with "prediction tool," "preterm," and "functional outcome" and reading the abstracts of resulting studies (Table 1 ). Studies with abstracts that appeared related to our review were then read in full to identify prediction models that were eligible for inclusion. Reference lists of included studies were also reviewed, as were articles that later cited these original studies. Prediction tools were defined as multivariable risk factor analyses (>2 variables) aiming to predict the probability of developing functional outcomes beyond 6 months corrected age. Models that solely investigated associations between individual risk factors and outcomes were excluded, as were models that were not evaluated for predictive ability in terms of either a validation study or an assessment for performance, discrimination, or calibration. Tests used to evaluate a model's overall performance were R 2 , adjusted R 2 , and the Brier score. The use of a receiver operating characteristic (ROC) curve or a C-index evaluated a model's discrimination, and the Hosmer-Lemeshow test was considered to evaluate a model's calibration. 18 Preterm neonates were defined as <37 weeks of completed gestational age. Models with VLBW neonates <1500 g were also included, since in the past birth weight served as a substitute for measuring prematurity when gestational age could not be accurately determined. Models were excluded if they used a cohort entirely composed of infants born prior to 1 January 1990; those born after 1990 were likely to have had surfactant therapy available in the event of respiratory distress syndrome, which significantly reduced the morbidity and mortality rates among preterm neonates nationwide. 19, 20 Models were also excluded if they limited their prediction to the outcome of survival, if they incorporated variables measured after initial NICU discharge, or if they included subjects who were not necessarily transferred to a NICU for further care following delivery. Finally, we excluded tools that only predicted outcomes to an age of <6 months corrected age, as well as case reports, narrative reviews, and tools reported in languages other than English. Overview of risk prediction models Table 2 lists all 32 studies with risk prediction models that meet the inclusion and exclusion criteria. [13] [14] [15] From these, a total of 43 distinct models were reported. From mortality to neurodevelopmental impairment Since 1990, several mortality prediction tools have been evaluated in regards to their ability to predict the likelihood of neurodevelopmental impairment (NDI) among neonates surviving to NICU discharge. One such model is the CRIB, which incorporates six physiologic variables collected within the first 12 h of the preterm infant's life: birth weight, gestational age, presence of congenital malformations, maximum base excess, and minimum and maximum FiO 2 requirement. 50 Fowlie et al. 24 evaluated how CRIB models obtained at differing time periods over the first 7 days of life predicted severe disability among a group of infants born >31 weeks gestational age or VLBW. In another study, Fowlie et al. 25 incorporated cranial ultrasound findings on day of life 3 along with CRIB scores between 48 and 72 h of life into their prediction model. Subsequent studies analyzed the CRIB in its original 12-h form and, with only one exception, 23 determined that it was not a useful tool for predicting long-term NDI or other morbidities. [26] [27] [28] [29] A second example is the SNAP score. 51 SNAP uses 28 physiologic parameters collected over the first 24 h of life to predict survival to NICU discharge, and was modified to predict NDI at 1 year and 2-3 years of age. A subsequent assessment of both the SNAP and the SNAP with Perinatal Extension 42 showed a poor predictive value for morbidity at 4 years of age for children born VLBW and/or with gestational age ≤31 weeks. 28 Finally, the Neonatal Therapeutic Intervention Scoring System, a comprehensive exam-based prediction tool for mortality, 52 was found to have a poor predictive value for adverse outcomes at 4 years of age in children born very preterm or VLBW. 28 Shortened forms of the early physiology-based scoring systems were developed and assessed for their ability to predict outcomes in childhood. Application of the CRIB-II on a small cohort (n = 107) of infants born <1250 g predicted significant NDI at 3 years of age. 39 However, a subsequent evaluation in a much larger cohort (n = 1328) of preterm infants <29 weeks gestational age concluded that the CRIB-II did no better than gestational age or birth weight alone in predicting moderate to severe functional disability at 2-3 years of age. 40 Studies have supported an association between the SNAP-II and SNAPPE-II scores and neurodevelopmental outcomes and small head circumference at 24 months corrected age. High SNAP-II scores were shown to correlate with adverse neurological, cognitive, and behavioral outcomes up to 10 years of age within a large cohort (n = 874) of children born very preterm. 43 Antenatal risk factors Several groups have used data from the NICHD's Neonatal Research Network (NRN) to design and test various risk prediction models for extremely low birth weight (ELBW) newborns. One of the most widely used risk prediction tools developed from this cohort was by Tyson et al., 13 Postnatal morbidity A large cohort study (n = 910) from Schmidt et al. 15, 32 used data from ELBW neonates 500-999 g enrolled in the international Trial of Indomethacin Prophylaxis in Preterms (TIPP). They found that the presence of three morbidities at 36 weeks post-menstrual age -bronchopulmonary dysplasia, serious brain injury, and severe retinopathy of prematurity-had a significant and additive effect on the risk for death or poor neurologic outcome at 18 months corrected age. They developed a model from this relationship that has been corroborated in two studies with smaller samples and by Schmidt et al. 15 in a separate, large cohort in which the definition of poor outcome was expanded from solely NDI to "poor general health." 33, 34 Letting the machines decide Some innovative work has been recently performed by Ambalavanan et al. 14, 35 in creating several risk prediction models. 45 Along with studies developing risk prediction tools with data from the NRN and the TIPP to predict the outcomes of death and NDI or solely NDI, the group made the only risk prediction tool for the outcome of rehospitalization, both general and specifically for respiratory complications, using a combination of physiologic and socioeconomic variables incorporated into a decision tree approach. They have also been the only group to create neural network-trained models, using the same small cohort to predict major handicap, low mental development index (MDI), or low psychomotor development index (PDI). The advantage of using neural networks-algorithms that can "learn" mathematical relationships between a series of independent variables and a set of outcomes-is the ability to model complex or nonlinear relationships that can be elucidated by the model without having to consider these relationships a priori (as is typically required when using multiple regression models). Despite the use of innovative approaches, however, none of these models differed from other studies in predictive strength or even had high predictive efficacy. 31 Limitations of prior approaches The above literature review highlights the substantial interest in developing a clinically useful risk prediction model and the limits of efforts to date. Notwithstanding their differing inclusion and exclusion criteria, existing risk prediction models are relatively similar in terms of variables selected, outcomes analyzed, and statistical strategies employed. With few exceptions, the limitations of existing risk prediction models are especially apparent in their reliance on solely biologic variables and traditional analytic methods ill-equipped to handle the statistical complexity necessary for risk modeling. Identifying important outcomes. The majority of risk prediction models defined NDI as their primary outcome of interest. Making a determination of impairment often relies on standardized measures of cognition in concert with neurosensory deficits. Yet, researchers often define NDI in different ways, making betweenstudy comparisons difficult. NDI is a construct relating to global abilities encompassing cognition, language, motor function, and vision and hearing. While the tools used to identify NDI are often also used to make diagnoses of developmental delay, NDI is not a clinical term or diagnosis in and of itself. Many of the remaining studies also predicted functional outcomes, such as academic performance, executive function, language ability, and autism spectrum disorder (ASD). These outcomes may be more meaningful to parents and providers than NDI. 54 To date, only four studies have considered outcomes unrelated to neurodevelopment, such as impaired pulmonary function, "poor general health," and rehospitalization rates. 15, 28, 45, 49 While the emphasis on NDI is unsurprising given the high-risk population, moderate to severe NDI only affects a minority of the preterm population. 55, 56 Studies have revealed numerous additional adverse outcomes that preterm individuals are more likely to experience compared to their full-term counterparts, such as impaired respiratory, cardiovascular, and metabolic function. [57] [58] [59] [60] [61] [62] [63] [64] [65] [66] Neurodevelopment has been linked to chronic health problems in later childhood. 67 Limiting risk prediction to moderate to severe NDI therefore ignores other, more common complications that preterm infants are likely to face that have an impact on neurodevelopment. This represents a missed opportunity for researchers to better understand what variables influence the likelihood that these problems occur. The impact of developmental disability on the child and family is completely absent from current risk models. Health-related quality of life (HRQL), which distinguishes itself as a personal rather than third-party valuation of a patient's physical and emotional well-being, is being increasingly appreciated as an important metric necessary to fully understand the impact of prematurity. 68 In a French national survey, the majority of neonatologists, obstetricians, and pediatric neurologists stated that predicting HRQL in the long term for preterm infants would be beneficial for consulting parents about what additional responsibilities they can anticipate in caring for their child. 69 The trajectory of HRQL from childhood to young-adulthood appears to improve in both VLBW and extremely low gestational age populations. 70 Prediction modeling might aid in determining which factors could positively or negatively impact HRQL in this vulnerable population. Finally, we must consider the age at which outcomes are being predicted. It is evident that lower gestational age is inversely proportional to rates of NDI and academic achievement in adolescence. 71, 72 However, the vast majority of risk prediction models assessed outcomes at the age of 3 years or less, with only three studies doing so at 10 years of age or above. Although early childhood outcomes may give clues about later development, many problems do not manifest until later in childhood, such as learning disabilities and certain psychiatric disorders. Developmental disability severity can fluctuate throughout childhood, with catch-up occurring in early preterm children and worsening delay in some moderate and late preterm children. 73, 74 Although cohorts of preterm infants are not usually followed for more than several years, likely due to lack of resources and expense, recent studies have used data from national registries to link neonatal clinical data to sampled adults, providing evidence of increased rates of adverse neurodevelopmental, behavioral, and educational outcomes among adults born preterm. 75, 76 Opportunities are therefore available to use long-term data to extend risk prediction models beyond the first few years of life. Variable selection. Most of the risk models reviewed relied primarily on physiologic and clinical measures obtained during the NICU stay. While an emphasis on biologic risk factors is clearly reasonable given the known associations between perinatal morbidities and long-term outcomes, there is strong evidence in the literature suggesting associations between sociodemographic factors like parental race, education, and age, and outcomes such as cognitive impairment, cerebral palsy, and mental health disorders in children born preterm. More specific socioeconomic variables such as lower parental education, maternal income, insurance status, foreign country of birth by a parent, and socioeconomic status as defined by the Elly-Irving Socioeconomic Index have been repeatedly correlated with reduced mental development index, psychomotor development index, intelligence quotient, and social competence throughout childhood. 71, 72, [77] [78] [79] [80] [81] [82] The geographic area in which preterm neonates are raised could also have a profound influence on their development. Neighborhood poverty rate, high school dropout rate, and place of residence (metropolitan vs. non-metropolitan) have all been correlated with academic skills and rate of mental health disorders among low birth weight children. 83, 84 Only 12 of the 43 models reviewed included socioeconomic variables. This may be due, at least in part, to the difficulty in obtaining social, economic, and demographic data; these variables are often not collected upon hospital admission. Additionally, socioeconomic information is often poorly, inaccurately, and variably recorded or is largely missing. 85 Some risk prediction models collected socioeconomic variables at the follow-up visit when outcomes were assessed. This is an imperfect method given that factors such as household setting and family income may change substantially in the years following NICU discharge and affect children's health. 86, 87 In some models, socioeconomic variables were not included because they did not significantly improve the model's predictive ability. 45 Testing the effects of social factors on infant and child outcomes requires samples that are socially and economically diverse. Even large, diverse study populations may become more homogeneous over time, as subjects of lower socioeconomic status and non-white race are more likely to drop out of studies dependent on long-term follow-up. 41 And treating socioeconomic variables as statistically independent factors rather than interrelated might minimize the impact of contextual information on neurodevelopmental outcomes. Model development. Of the 32 papers included in the review, 12 reported on de novo risk prediction tools. The other 20 studies either evaluated a previous model or adjusted a prior model by changing the times at which data were collected or by adding additional variables. The approach to prediction tool development was almost uniform among the studies, with nine of the models solely using regression techniques to select variables. Ambalavanan et al. deviated from this method in three separate studies: two using classification tree analysis, 35, 45 and one using a four-layer back-propagation neural network. 31 Each new model-with the exception of the neural networkbased model by Ambalavanan et al. 35, 45 -depended on an approach in which individual variables were selected and treated as independent of one another as they were analyzed in their ability to predict the outcome of interest. Yet, variables may, in fact, not act independently. While parsing the roles of potential interrelationships may be computationally onerous and treating them independently may lead to a more parsimonious model, this may be at the expense of accuracy. Alternative computational approaches are needed to account for the differential likelihoods of certain outcomes on the causal pathway from preterm birth to later childhood outcome. Nonlinear statistical tools should be further utilized in risk prediction model development to examine the relationships between variables and outcomes of interest. Machine learning, for instance, is a method of inputting a group of variables and generating a predictive model without making assumptions of independence between the factors or that specific factors would contribute the most to the model. 88 Different forms of machine learning have already been employed in NICU's to extract the most important variables for predicting outcomes such as days to discharge. 89 The non-independence of risk factors is also complicated by the role of time in models of human health and development. The lifecourse framework describes how an accumulation or "chains" of risk experienced over time and at certain critical periods impact later health outcomes. 90 In the context of preterm birth, the risk of being born early is not uniform across populations and dependent on a given set of maternal risks. In turn, the degree of prematurity imparts differential risk for developing complications such as bronchopulmonary dysplasia, necrotizing enterocolitis, or retinopathy of prematurity. These morbidities then, in turn, increase risks for further medical and developmental impairment. These time-varying probabilities can be modeled and incorporated into prediction tools to more accurately capture the longitudinal and varying relationships between exposures and outcomes and improve thereby estimations of risk. [91] [92] [93] A final methodological concern regarding model development is whether and how the competing risk of death is considered when the outcome being predicted is non-terminal. Consider, for example, the task of developing a model for the risk of NDI at 10 years of age. How one handles death can have a dramatic effect on the model, especially since mortality is relatively high among preterm infants. Moreover, if death is treated simply as a censoring mechanism, as it is often done in time-to-event analyses such as those based on the Cox model, then the overall risk of NDI will be artificially reduced; those children who die before being diagnosed with NDI will be viewed as remaining at risk even though they cannot possibly be subsequently diagnosed with NDI. While an alternative to this would be to use a composite outcome of time the first of NDI or death, doing so may result in a model that is unable to predict either event well. Instead, one promising avenue is to frame the development of a prediction model for NDI within the semi-competing risks paradigm. 94, 95 Briefly, semicompeting risks refer to settings where one event is a competing risk for the other, but not vice versa. This is distinct from standard competing risks, where each event is competing for the other (e.g., death due to one cause or another). To the best of our knowledge, however, semi-competing risks have not been applied to the study of long-term outcomes among preterm infants. Model evaluation. Waljee et al. 18 provide a summary of methods for assessing the performance of a predictive model, categorizing them into three types: overall model performance, which focuses on the extent of variation in risk explained by the model; calibration, which assesses differences between observed and predicted event rates; and discrimination, which assesses the ability to distinguish between patients who do and do not experience the outcome of interest. The majority of studies in our review assessed their models with ROC curve analysis, a method of assessing discrimination. While widely used, there is some debate with regard to ROC-based assessments, specifically in regard to its lack of sensitivity in assessing differences between good predictive models. 96 Although several novel performance measures for comparing discrimination among models have been proposed, none have been employed in the context of comparing risk prediction tools for preterm neonates. 97, 98 Few studies employed analyses other than ROC. Only six in our review assessed overall performance with R 2 or partial R 2 , and five evaluated calibration using the Hosmer-Lemeshow test. Another four studies assessed internal validation with either an internal validation set or bootstrapping techniques. 99 There were nine studies meeting inclusion criteria solely because they had models that were externally validated via other studies. Schmidt et al. 32 reported odds ratio associations for their 3-morbidity model, which are not a reliable method of determining the strength of risk prediction tools. 100 Future risk model assessments for preterm neonates should at minimum include an ROC curve analysis, although assessments of overall performance and calibration would also be helpful. Validation with a different sample from the development set is also advised, ideally with a population outside the original cohort. 18 CONCLUSION Risk assessment and outcomes prediction are valuable tools in medical decision-making. Fortunately, infants born prematurely enjoy ever-increasing likelihood of survival. Research over the past several decades has highlighted the many influences, physiologic and psychosocial, affecting neurodevelopment, HRQL, and health services utilization. Yet, the wealth of knowledge gained from longitudinal studies of growth and development is not reflected in current risk prediction models. Moreover, some of the most wellknown and widely used tools today, such as Tyson et al.'s 13 fivefactor model, were developed nearly two decades ago. As advances in neonatal intensive care progressively reduce the risk of certain outcomes, it is clear that these older models require updating if they are to be of continued clinical use. It should be recognized that there are potential ethical ramifications to incorporating more psychosocial factors and outcomes into risk prediction models, such as crossing the line from risk stratification to "profiling" patients and offering different treatment decisions based on race or class. 101 However, physician predictions without the aid of prediction tools are highly inconsistent during counseling at the margins of viability, and further research is needed regarding the level of influence that physicians actually have on caregiver decision-making during counseling, as well as the extent to which risk prediction tools would change their approach to counseling. 10 In addition, despite recent innovation in statistical approaches to risk modeling, such as machine learning, most prediction tools rely on standard regression techniques. Insofar that risk prediction models will continue to be developed for preterm neonatal care, making use of the clinical data available in most modern electronic health records and taking into consideration the analytic challenges related to unequal prior probabilities of exposures, non-independence of variables, and semi-competing risk can only strengthen our approach to predicting outcomes. We therefore recommend taking a broader view of risk, incorporating these concepts in creating stronger risk prediction tools that can ultimately serve to benefit the long-term care of preterm neonates. C.J.C. and J.S.L. designed and carried out this literature review. C.J.C., J.S.L., and S.H. worked jointly in the analysis and interpretation of the literature review results, as well as the drafting and revision of this article. All three authors gave final approval of the version to be published. On the influence of abnormal parturition, difficult labours, premature birth, and asphyxia neonatorum, on the mental and physical condition of the child, especially in relation to deformities Trends in care practices, morbidity, and mortality of extremely preterm neonates Survival of infants born at periviable gestational ages Outcomes of preterm infants: morbidity replaces mortality Institute of Medicine Committee on Understanding Premature Birth and Assuring Healthy Outcomes. Preterm Birth: Causes, Consequences, and Prevention Influence of birth weight, sex, and plurality on neonatal loss in United States Preterm neonatal morbidity and mortality by gestational age: a contemporary cohort Gestational age and birthweight for risk assessment of neurodevelopmental impairment or death in extremely preterm infants Neurodevelopmental outcome at 5 years of age of a national cohort of extremely low birth weight infants who were born in 1996-1997 Comparing neonatal morbidity and mortality estimates across specialty in periviable counseling Prognosis and prognostic research: what, why, and how? Neonatal disease severity scoring systems Intensive care for extreme prematurity-moving beyond gestational age Outcome trajectories in extremely preterm infants Prediction of late death or disability at age 5 years using a count of 3 neonatal morbidities in very low birth weight infants Prediction of mortality in very premature infants: a systematic review of prediction models Risk factor models for neurodevelopmental outcomes in children born very preterm or with very low birth weight: a systematic review of methodology and reporting A primer on predictive models Pulmonary surfactant therapy The future of exogenous surfactant therapy Nursery Neurobiologic Risk Score and outcome at 18 months Evaluation of the ability of neurobiological, neurodevelopmental and socio-economic variables to predict cognitive outcome in premature infants. Child Care Health Dev Increased survival and deteriorating developmental outcome in 23 to 25 week old gestation infants, 1990-4 compared with 1984-9 Measurement properties of the Clinical Risk Index for Babies-reliabilty, validity beyond the first 12 hours, and responsiveness over 7 days Predicting the outcomes of preterm neonates beyond the neonatal intensive Predicting outcome in very low birthweight infants using an objective measure of illness severity and cranial ultrasound scanning Is the CRIB score (Clinical Risk Index for babies) a valid tool in predicting neurodevelopmental outcome in extremely low birth weight infants? The CRIB (Clinical Risk Index for Babies) score and neurodevelopmental impairment at one year corrected age in very low birth weight infants Can severity-of-illness indices for neonatal intensive care predict outcome at 4 years of age? Neurodevelopment of children born very preterm and free of severe disabilities: the Nord-Pas de Calais Epipage cohort study Chronic physiologic instability is associated with neurodevelopmental morbidity at one and two years in extremely premature infants Prediction of neurologic morbidity in extremely low birth weight infants Impact of bronchopulmonary dysplasia, brain injury, and severe retinopathy on the outcome of extremely low-birth-weight infants at 18 months: results from the trial of indomethacin prophylaxis in preterms Impact at age 11 years of major neonatal morbidities in children born extremely preterm Effect of severe neonatal morbidities on long term outcome in extremely low birthweight infants Early prediction of poor outcome in extremely low birth weight infants by classification tree analysis Consequences and risks of <1000-g birth weight for neuropsychological skills, achievement, and adaptive functioning Clinical data predict neurodevelopmental outcome better than head ultrasound in extremely low birth weight infants Infant outcomes after periviable birth; external validation of the neonatal research network estimator with the BEAM Trial Clinical Risk Index for Babies score for the prediction of neurodevelopmental outcomes at 3 years of age in infants of very low birthweight NSW and ACT Neonatal Intensive Care Units Audit Group. Can the early condition at admission of a high-risk infant aid in the prediction of mortality and poor neurodevelopmental outcome? A population study in Australia Autism spectrum disorders in extremely preterm children SNAP-II and SNAPPE-II and the risk of structural and functional brain disorders in extremely low gestational age newborns: the ELGAN study Early postnatal illness severity scores predict neurodevelopmental impairments at 10 years of age in children born extremely preterm High prevalence/low severity language delay in preschool children born very preterm Identification of extremely premature infants at high risk of rehospitalization Screening for autism spectrum disorders in extremely preterm infants Perinatal risk factors for neurocognitive impairments in preschool children born very preterm Correlation between initial neonatal and early childhood outcomes following preterm birth Bronchopulmonary dysplasia and perinatal characteristics predict 1-year respiratory outcomes in newborns born at extremely low gestational age: a prospective cohort study The International Neonatal Network. The CRIB (Clinical Risk Index for Babies) score: a tool for assessing initial neonatal risk and comparing performance of neonatal intensive care units Score for neonatal acute physiology: a physiologic severity index for neonatal intensive care Neonatal therapeutic intervention scoring system: a therapy-based severity-of-illness index Prediction of death for extremely premature infants in a population-based cohort Parental perspectives regarding outcomes of very preterm infants: toward a balanced approach Risk of developmental delay increases exponentially as gestational age of preterm infants decreases: a cohort study at age 4 years Preterm birth-associated neurodevelopmental impairment estimates at regional and global levels for 2010 Late respiratory outcomes after preterm birth Respiratory health in pre-school and school age children following extremely preterm birth Preterm delivery and asthma: a systematic review and metaanalysis Preterm birth, infant weight gain, and childhood asthma risk: a meta-analysis of 147,000 European children Preterm birth: risk factor for early-onset chronic diseases Preterm heart in adult life: cardiovascular magnetic resonance reveals distinct differences in left ventricular mass, geometry, and function Right ventricular systolic dysfunction in young adults born preterm Elevated blood pressure in preterm-born offspring associates with a distinct antiangiogenic state and microvascular abnormalities in adult life Preterm birth and the metabolic syndrome in adult life: a systematic review and meta-analysis Prevalence of diabetes and obesity in association with prematurity and growth restriction Prematurity: an overview and public health implications Measurement of quality of life of survivors of neonatal intensive care: critique and implications Quality of life assessment in preterm children: physicians' knowledge, attitude, belief, practice -a KABP study Health-related quality of life and emotional and behavioral difficulties after extreme preterm birth: developmental trajectories Prognostic factors for poor cognitive development in children born very preterm or with very low birth weight: a systematic review Prognostic factors for cerebral palsy and motor impairment in children born very preterm or very low birthweight: a systematic review Evidence for catchup in cognition and receptive vocabulary among adolescents born very preterm The economic burden of prematurity in Canada Changing definitions of long-term followup: should "long term" be even longer? Functional outcomes of very premature infants into adulthood Social competence of preschool children born very preterm Prediction of cognitive abilities at the age of 5 years using developmental follow-up assessments at the age of 2 and 3 years in very preterm children Predicting the outcomes of preterm neonates beyond the neonatal intensive Perinatal risk factors of adverse outcome in very preterm children: a role of initial treatment of respiratory insufficiency? The relationship between behavior ratings and concurrent and subsequent mental and motor performance in toddlers born at extremely low birth weight Prognostic factors for behavioral problems and psychiatric disorders in children born very preterm or very low birth weight: a systematic review Neurodevelopmental outcomes of extremely low birth weight infants <32 weeks' gestation between Neighborhood influences on the academic achievement of extremely low birth weight children Mental health outcomes in US children and adolescents born prematurely or with low birthweight Measurement of socioeconomic status in health disparities research Family income trajectory during childhood is associated with adiposity in adolescence: a latent class growth analysis Family income trajectory during childhood is associated with adolescent cigarette smoking and alcohol use Machine learning in medicine: a primer for physicians Predicting discharge dates from the NICU using progress note data A Life Course Approach to Chronic Diseases Epidemiology 2nd edn. A Life Course Approach to Adult health Scientists rise up against statistical significance The ASA's Statement on p-values: context, process, and purpose Time for clinicians to embrace their inner Bayesian? Reanalysis of results of a clinical trial of extracorporeal membrane oxygenation Semi-competing risks data analysis: accounting for death as a competing risk when the outcome of interest is nonterminal Beyond composite endpoints analysis: semicompeting risks as an underutilized framework for cancer research Use and misuse of the receiver operating characteristic curve in risk prediction Assessing the performance of prediction models: a framework for traditional and novel measures Novel metrics for evaluating improvement in discrimination: net reclassification and integrated discrimination improvement for normal variables and nested models Multivariable prognostic models: issues in developing models, evaluating assumptions and adequacy, and measuring and reducing errors Limitations of the odds ratio in gauging the performance of a diagnostic, prognostic, or screening marker Just health: on the conditions for acceptable and unacceptable priority settings with respect to patients' socioeconomic status AUC: 0.703 Sensitivity: 27.6% Specificity: 87.3% Competing interests: The authors declare no competing interests.Publisher's note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.