key: cord-0810197-ht3fty2z
authors: Wanjiang, Feng; Xiaobo, Zhang; Xin, Wu; Ye, Meng; Lihua, Huang; Jianlong, Wang
title: Application of POSSUM and P-POSSUM scores in the risk assessment of elderly hip fracture surgery: systematic review and meta-analysis
date: 2022-05-07
journal: J Orthop Surg Res
DOI: 10.1186/s13018-022-03134-0
sha: ae657cedd3255377695a301162fb794d321f11ee
doc_id: 810197
cord_uid: ht3fty2z

BACKGROUND: Since Mohamed et al. analyzed 2326 orthopedic cases in 2002 and believed that the POSSUM formula can be directly used to predict postoperative morbidity and mortality in orthopedic patients, applications of POSSUM and P-POSSUM scores in the hip fracture surgery have been mostly reported in the field of orthopedics, but there are still some inconsistencies in the related reports. METHODS: The electronic library was searched for all literature that met the purpose from its inception to 2021. Relative risk (RR) was selected to evaluate whether the model could be used to assess the risk of surgery in patients with elderly hip fractures. Finally, sensitivity analyses and subgroup analyses were performed. RESULTS: Thirteen studies were finally included, including 9 retrospective and 4 prospective studies.The morbidity analysis includes 11 studies, and the result was RR = 1.07 (95% CI 0.93–1.24), The mortality analysis includes 11 studies on POSSUM and 5 studies on P-POSSUM. The results of mortality by POSSUM and by P-POSSUM were RR = 1.93 (95% CI 1.21–3.08) and RR = 1.15 (95% CI 0.89–1.50), respectively. POSSUM had more accuracy to predict mortality for sample < 200 subgroup(RR = 2.45; 95% CI 0.71–8.42) than sample > 200 subgroup(RR = 1.59; 95% CI 1.06–2.40), and in the subgroup of hip fractures that did not distinguish between specific fracture types(RR = 1.69, 95% CI 0.87–3.32) than intertrochanteric neck fracture subgroup(RR = 5.04, 95% CI 1.07–23.75) and femoral femoral fracture subgroup(RR = 1.43,95% CI 1.10–1.84). CONCLUSION: POSSUM can be used to predict morbidity in elderly hip fractures. The P-POSSUM was more accurate in predicting mortality in elderly hip fracture patients compared to the POSSUM, whose predictive value for mortality was influenced by the sample size and type of fracture studied. In addition, we believe that appropriate improvements to the POSSUM system are needed to address the characteristics of orthopedic surgery.

Hip fractures mainly refer to femoral neck fractures and intertrochanteric fractures. Gullberg et al. [1] predict that the global number of hip fractures will reach 2.6 million and 4.5 million by 2025 and 2050. With the aging of the global population, the incidence of elderly hip fractures in developed countries can reach 350/100,000 [2] , becoming 17:255 one of the most common diseases in joint orthopedics. Generally, the main treatment for hip fractures is surgery, however elderly patients with hip fractures often have many basic diseases when they are admitted to the hospital. George et al. [3] found that when elderly patients with hip fractures had a variety of other diseases, the risk of postoperative morbidity and mortality increased. Their data showed that 20% of the patients had postoperative complications, and the 30-day death rate of such patients was 9%. In addition, previous studies have also shown that the ageing of patients, cognitive dysfunction and other body aging processes will also increase the risk of postoperative complications or death in patients with hip fractures [4] [5] [6] . In actual clinical work, active preoperative preparation will significantly reduce the adverse events of patients during the perioperative period. Surgical risk prediction models are a very valuable tool for surgeons because these tools allow surgeons to assess and prevent patients' perioperative events in advance, so as to make the best decisions to optimize resources and improve the quality of care for patients [7, 8] .

Many prediction models have emerged in recent years, but their specific application value is almost limited. The Physiological and Operative Severity Score for enUmeration of mortality and morbidity (POSSUM) was first proposed by Copeland et al. [9] in 1991 to evaluate the prognosis of patients. Subsequently, Whiteley et al. [10] found that the POSSUM scoring system would overestimate the postoperative mortality rate, and after simplifying the exponential analysis technology in the POSSUM scoring system, a P-POSSUM scoring formula that was more suitable for surgical patients was obtained. Currently, the APACHEI, APACHE II, POSSUM, ASA and NHFS scoring systems are generally used internationally to assess the perioperative risk of surgical patients, and the research by de Cássia Braga Ribeiro et al. [11] believes that the POSSUM scoring system has the most application value in the risk assessment of the perioperative period. Some scholars believe that the Surgical Risk Scale (SRS) is more accurate than the POSSUM scoring system for surgical risk assessment, has the advantages of simple procedures, and can obtain the predictive indicators before surgery [12] , but in disciplines of general surgery, vascular surgery and esophageal surgery, the POSSUM scoring system is still widely used and recognized by surgeons [13] [14] [15] [16] [17] .

Since Mohamed et al. [18] analyzed 2326 orthopedic cases in 2002 and believed that the POSSUM formula can be directly used to predict postoperative complications and death risks in orthopedic patients, more and more reports have appeared on the application of POSSUM and P-POSSUM scores in hip fracture surgery. Yet, there were inconsistencies in the related reports, some reports believed that the POSSUM scoring system could not accurately predict postoperative morbidity and mortality, but other reports supported this model [19] [20] [21] [22] . Therefore, the purpose of this study is to conduct a meta-analysis on the application value of POSSUM and P-POSSUM in the risk assessment of hip fracture surgery, in order to guide the orthopedic surgeon in the evaluation of postoperative risk events and the choice of surgical benefits.

The search-style electronic libraries, including Pubmed, Embase, the Cochrane Library, CNKI, Wanfang Data, VIP Chinese Journals, and China Biomedical Literature Service System were used for document retrieval. From the establishment of the databases to 2021, a total of 289 documents were retrieved. All documents had abstracts or full texts, and there are no language restrictions. We OR (P-POSSUM[Title/ Abstract]))]. After eliminating the duplicate documents, all the review documents retrieved by this retrieval method and the references of the original research were comprehensively reviewed to determine whether there were additional documents. EndNote X9 software was used to manage the documents.

Inclusion criteria: original retrospective or prospective cohort study; the literature using POSSUM or P-POS-SUM for research; patients with bone fracture; surgical treatment. Exclusion criteria: meeting or review; graduate thesis; non-hip fractures (including femoral neck fractures and intertrochanteric fractures); data was incomplete (the predicted value and/or observed value cannot be obtained); Study population age < 60 years; There is no defined follow-up period (period of hospitalisation or number of days of follow-up).

Because there was no significant difference in composition between the orthopedic POSSUM system [9] in 1991, no strict distinction was made in literature inclusion.

The titles and abstracts were used to screen the literature for inclusion criteria. After the data extraction, the NOS scale was used to evaluate the quality of the literature. A summary of the literature content is shown in Table 1 , which is recorded in sequence with the following items: author name; publication year; country of author; study type; fracture type; total sample size; predictive outcome indicators (POSSUM: morbidity and mortality),P-POSSUM: mortality); actual outcome indicators (morbidity and mortality); O/E value (observed value/predicted value); NOS score result.

We performed meta-analysis using the latest version of RevMan 5.4 software recommended by the Cochrane Library. We used the relative risk (RR) to assess the predictive accuracy of the POSSUM scoring system because the included studies were cohort studies, the data type was a dichotomous variable and RR was the most useful indicator of the strength of the event association. 95% confidence intervals (CI) for the RR were used to indicate accuracy, and when the horizontal line of the 95% CI intersected the null vertical line or the 95% CI contained 1, we considered the POSSUM to accurately predict the postoperative risk of patients. When the 95% CI did not intersect the null vertical line or when the 95% CI did not contain 1. We considered the POSSUM to be too high (both upper and lower 95% CI were greater than 1) or too low (both upper and lower 95% CI were less than 1) to predict the postoperative risk. When the heterogeneity I 2 > 50% in the forest plot, we considered significant heterogeneity and chose the random-effects model; otherwise, we considered little heterogeneity and chose the fixed-effects model. Sensitivity analysis was performed for each of the included studies when I 2 > 50%, and if one study was excluded, I 2 < 50% was considered as the main source of heterogeneity; if sensitivity analysis did not reveal the main source of heterogeneity, subgroup analysis was performed according to hip fracture type (femoral neck fracture, intertrochanteric fracture or hip fracture) and sample size (less than 200 or greater than or equal to 200) to explore sources of heterogeneity.

Total of 289 studies was obtained through a literature search, and the full text of 51 studies was obtained. Among them, 38 studies met the exclusion criteria: 9 studies could not determine whether it was a hip fracture; 7 studies were in non-elderly populations; 9 studies did not specify the duration of follow-up; 13 studies did not have complete data. In the end, 13 studies were used for meta-analysis [20] [21] [22] [23] [24] [25] [26] [27] [28] [29] [30] [31] [32] , including 9 retrospective cohort studies [20, 22-26, 28, 29, 31] and 4 prospective cohort studies [21, 27, 30, 32] . The ratings of quality appraisal of these included papers are all ≥ 6 stars. The specific inclusion and exclusion process is shown in Fig. 1 .

A total of 13 studies were included, including 9 retrospective cohort studies and 4 prospective cohort studies. Among them, 11 studies each reported POSSUM's predictive value for the morbidity and mortality, and 

Sensitivity analysis was performed with RevMan5.4 software, we found that the study of Blay-Domínguez et al. [20] is a significant effect on the overall heterogeneity in POSSUM to morbidity. The prediction by POSSUM of mortality of patients with hip fractures was analyzed in subgroups according to the fracture type (femoral neck fractures, intertrochanteric fractures, and hip fractures) and sample size (sample > 200 cases, sample < 200 cases). As shown in Fig. 3A , after excluding studies that led to major sources of heterogeneity, RR = 1.01 (95% CI 0.94-1.08, I 2 = 18%) for POSSUM to morbidity. In Fig. 3B 

According to research reports that can be retrieved so far, this study is the first comprehensive analysis of the application of POSSUM and P-POSSUM scoring systems to elderly hip fractures. In terms of the results of this study, the POSSUM scoring system can accurately predicted the postoperative morbidity (RR = 1.07, 95% CI 0.93-1.24), and compared with POSSUM over-predicted the mortality (RR = 1.93, 95% CI 1.21-3.08), P-POSSUM can accurately predict the postoperative mortality of elderly patients with hip fractures (RR = 1.15, 95% CI 0.89-1.50).

In a meta-analysis report on the application value of POSSUM scoring system in hepatobiliary and pancreatic surgery by Chen et al. [33] , they believe that because the early POSSUM scoring model is not suitable for current surgical operations, it led to POSSUM's over-prediction of postoperative morbidity and mortality. Similarly, we also believe that the same situation may exist here. The POSSUM scoring system proposed by Copeland et al. in 1991 was originally used for general surgery patients. It was used in orthopedics after Mohamed et al. [18] modified its surgical scoring table according to the characteristics of orthopedic surgery in 2002. In the past 18 years, surgical methods and instruments have been greatly improved, and the nursing teams in related departments have become more professional. Previous improvements according to the characteristics of orthopedic surgery are no longer applicable to the current postoperative risk assessment. And, with the development of surgical technology, the impact of surgical intervention on the prognosis of patients is getting smaller and smaller. Combining the characteristics of high incidence of hip fracture in the elderly patients and having many underlying diseases in this group of people, it is not enough to modify the surgical severity score scale alone, and the physiological score scale should also be specifically modified. Orthopedic patients, especially those with hip fractures, have a tendency to stay in bed for a long time, which leads to a high risk of venous thrombosis. The physiology score scales, such as coagulation function, D-dimer, and other auxiliary examinations such as neck and lower extremity vascular color Doppler ultrasound to assess the patient's vascular condition, have not been reflected in the score.

Secondly, some studies reported that the difference in the prediction by POSSUM on the morbidity and mortality may be caused by the difference in the medical technology level, population characteristics, and the composition ratio of patient operation types in the nonoriginating country of the system from the originating country of the system [12] . In the process of exploring the source of heterogeneity of combined results in this study, we found that the type of hip fracture, the choice of surgical methods, and the distribution of patient age may all have an impact on the prediction accuracy of the POSSUM scoring system. Because the treatment methods of different fracture types are often different, are abnormal compared to normal people because the relevant test results are "normal" for themselves, which leads to the inability of the POSSUM score to accurately predict the postoperative complications rate and mortality of this group of people. That is, POSSUM gives an excessively high score for the "normal" physiological examination results that can maintain the physiological function of this group of people. Therefore, this group population may not be suitable for POSSUM scoring system, a two-component scoring model of preoperative physiology score and surgical severity score. We believe that the above problems may be solved by risk grouping based on the size of the POSUUM score value. From previous reports, it can be seen that the error of POSSUM's over-predicting postoperative death events mainly comes from the low-risk group. For example, in the reports of Whiteley et al. [10] and Prythech et al. [34] , The predicted value of death event in the low-risk group was 6 times and 7 times the actual observation value, and POSSUM performed well in the high-risk group. Therefore, we believe that it is necessary to improve the predictive ability of the POSSUM scoring system in the complication rate and mortality of patients with hip fractures, and it is very necessary to predict the risk of patients when using the scoring system. It is worth raising that, according to previous reports in the literature, the overall prediction of the POSSUM scoring system for postoperative patients was consistent with actual observations, but its prediction for younger patients and patients undergoing elective surgery was found to be higher than the actual outcome in the subgroup [35] , so this seems to suggest that the elderly population and critically ill patients requiring emergency surgery may be the priority groups. Evidence for this view is also provided by a recently retrospective cohort study, which showed that the P-POSSUM score showed good predictive power for postoperative mortality in COVID-19 positive patients undergoing emergency general surgery [36] . The scoring system is not applicable to children because the physiological scoring indicators in the scoring system are those of adults. In addition, the definition of postoperative complications is not completely clear, especially for those with preoperative underlying disease, so a clear definition of postoperative complications is necessary. Finally, according to the regulations of the POSSUM scoring system when it was established, the physiological indicators of the system should preferably be data within 1 day before surgery, and POSSUM is best for predicting complications and mortality within 30 days after surgery. And regarding the relevant data in the surgical scoring, such as blood loss and operative time, they need to be recorded accurately by the assessor.

One of the shortcomings of this study is that although our study was analysed in subgroups, we did not find a major source of heterogeneity in the POSSUMpredicted postoperative mortality pairs. We suspect that this may be due to differences in the cause of the fracture, the physical impact of the fracture itself, the treatment, the distribution of the population, and the level of medical care, but we were unable to draw firm conclusions. In addition, this study also failed to collect relevant unpublished data.

In all, the current data analysis shows that the POS-SUM scoring system can predict the postoperative morbidity of elderly hip fracture patients. POSSUM's prediction of mortality is affected by the type of fracture and size of sample. Compared with POSSUM, P-POSSUM can accurately predict the postoperative mortality of patients with hip fractures. The application of the POSSUM scoring system in orthopedics needs to be further improved according to the characteristics of orthopedic patients and surgery, as well as needing more multi-center and large-sample prospective studies so that it can be more applicable to orthopedics.

World-wide projections for hip fracture

Predictors of noninstitutionalized survival 1 year after hip fracture: a prospective observational study to develop the Marburg Rehabilitation Tool for Hip fractures (MaRTHi)

The effect of pre-operative medical co-morbidities on rehabilitation outcomes following surgery for hip fracture management in geriatric populations

Gender differences in patients with hip fracture: a greater risk of morbidity and mortality in men

Predictors of functional recovery after hip fracture in the elderly

Effect of comorbidities and postoperative complications on mortality after hip fracture in elderly people: prospective observational cohort study

An overview of risk stratification in the elderly orthopaedic population

Hip fracture mortality: predictive models

POSSUM: a scoring system for surgical audit

An evaluation of the POSSUM surgical scoring system

POSSUM, and ASA scores and the risk of perioperative complications in patients with oral or oropharyngeal cancer

Comparison of the risk prediction systems POSSUM and P-POSSUM with the Surgical Risk Scale: a prospective cohort study of 721 patients

Physiological POSSUM as an indicator for long-term survival in vascular surgery

Portsmouth physiological and operative severity score for the enumeration of mortality and morbidity scoring system in general surgical practice and identifying risk factors for poor outcome

Development of a dedicated risk-adjustment scoring system for colorectal surgery (colorectal POSSUM)

Risk-adjusted prediction of operative mortality in oesophagogastric surgery with O-POSSUM

Estimation of mortality and morbidity risk in vascular surgery using POSSUM and the Portsmouth predictor equation

An assessment of the POSSUM system in orthopaedic surgery

Audit of morbidity and mortality following neck of femur fracture using the POSSUM scoring system

O-POSSUM score predicts morbidity and mortality in patients undergoing hip fracture surgery

POSSUM scoring for patients with fractured neck of femur

Evaluation of POSSUM scoring system in the treatment of osteoporotic fracture of the hip in elder patients

Perioperative safety and efficacy of hip arthroplasty for femoral neck fracture in patients aged 80 and over

Prediction of postoperative complications in elderly patients with intertrochanteric fractures

The value of modified POSSUM scoring system in predicting mortality and morbidity for the intertrochanteric fracture in elder patient

Value of modified POSSUM and P-POSSUM scoring system in predicting mortality and morbidity of aged patients underwent hip fracture operation

Neck of femur fractures in the over 90s: a select group of patients who require prompt surgical intervention for optimal results

Three preoperative scoring systems to predict 30-day mortality in elderly patients undergoing hipfracture surgery

A retrospective analysis of the predictive effects of POSSUM and P-POSSUM on postoperative complication and mortality rates in hip fracture patients

Accuracy of the physiological and operative severity score for the enUmeration of mortality and morbidity score and the Nottingham risk score in hip fracture patients in Sweden -a prospective observational study. Acta anaesthesiol Scand

Risk prediction of POSSUM and P-POSSUM scoring system for hip replacement in elderly intertrochanteric fractures

Morbidity and mortality following surgery for hip fractures in elderly patients

POSSUM and P-POSSUM as predictors of postoperative morbidity and mortality in patients undergoing hepato-biliary-pancreatic surgery: a meta-analysis

POSSUM and Portsmouth POSSUM for predicting mortality. Physiological and operative severity score for the enUmeration of mortality and morbidity

Introduction of the POSSUM scoring system and suggestions for promoting its application in orthopedics. Chinese Bone Tumor Bone Dis

P-POSSUM as mortality predictor in COVID-19-infected patients submitted to emergency digestive surgery. A retrospective cohort study

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.