key: cord-0032068-arrb7iwv authors: Chawla, Riddhi; Balaji, S.; Alabdali, Raed N.; Naguib, Ibrahim A.; Hamed, Nadir O.; Zahran, Heba Y. title: Predicting the Kidney Graft Survival Using Optimized African Buffalo-Based Artificial Neural Network date: 2022-05-14 journal: J Healthc Eng DOI: 10.1155/2022/6503714 sha: b06653ebfc32df25c6859f90fe69e69392249d98 doc_id: 32068 cord_uid: arrb7iwv A variety of receptor and donor characteristics influence long-and short-term kidney graft survival. It is critical to predict the effectiveness of kidney transplantation to optimise organ allocation. This would allow patients to choose the best accessible kidney donor and the optimal immunosuppressive medication. Several studies have attempted to identify factors that predispose to graft rejection, but the results have been contradictory. As a result, the goal of this paper is to use the African buffalo-based artificial neural network (AB-ANN) approach to uncover predictive risk variables related to kidney graft. These two feature selection approaches combine to provide a novel hybrid feature selection technique that could select the most important elements to improve prediction accuracy. The feature analysis revealed that clinical features have varied effects on transplant survival. The collected data is processed in both training and testing methods. The prediction model's performance, in terms of accuracy, precision, recall, and F-measure, was examined, and the results were compared with those of other existing systems, including naive Bayesian, random forest, and J48 classifier. The results suggest that the proposed approach can forecast graft survival in kidney recipients' next visits in a creative manner and with more accuracy compared with other classifiers. This proposed method is more efficient for predicting kidney graft survival. Incorporating those clinical tools into outpatient clinics' everyday workflows could help physicians make better and more personalised decisions. e importance of predicting the outcome of kidney transplantation cannot be overstated [1] . Research scholars and decision makers are progressively being urged to promote patient-centred care that respects the preferences, requirements, and values of patients. Patients with end-stage organ dysfunction require organ transplant, which improves their quality of life [2] . e capacity to forecast survival rate after transplant is vital and plays a key role in comprehending the donor-recipient matching procedure. is matching is essential for renal replacement success because it allows patients to choose the fine accessible kidney donor and the finest immunosuppressive medication. Prognosis of organ transplantation outcome is a clinically important and difficult subject. Predicting survival before treatment simplifies the patient's decisions and improves survival by influencing clinical practise decisions [3] . Many variables that influence the prediction problems have been extensively investigated, but the complicated relationship among these variables make prediction process a difficult task. Kidney transplantation is regarded as the potential alternate medication for individuals having end-stage renal illness since it has several benefits over dialysis, including a higher quality of life and a longer survival rate [4] . Graft functioning and survivals have improved significantly over the last two decades, yet several transplanted kidneys are discarded due to chronic allograft nephropathy and acute rejection [5] . Compared with the individuals with functioning grafts, this results in a three-fold increased risk of death. In terms of results, it has long been suspected that in the case of kidney transplantation, patient preferences prefer graft survival over the danger of illness or malignancy. Prediction of individual graft survival [6] could thus be an initial step in enhancing patient's health status information and promote patient-centred care. Because of the scarcity of organs, long waiting lists, the higher retransplantation costs, the risk of graft failure, and kidney graft performance must be closely monitored. A variety of receptor-donor related parameters that affect graft survival affect the kidney transplant distribution. As the demand for kidney transplantation grows around the world, it is essential to recognize the possible issues for graft failure so as to enhance the survivability of patients and the quality of their life [7] . Investigating, identifying, and adjusting for risk variables are critical because transplantation failure is connected with negative outcomes for patients. Nevertheless, due to the obvious wide range of risk variables for graft failure, this evidence is much harder to quantify at an individual scale [8] . Various prognostic and predictive factors impacting the effectiveness of renal grafts were explored in different researches, including age of donor and receptor, sex, type of donor (alive or deceased), body mass index, anaemia, kind of immunosuppressive regimen, and so on. However, the outcomes were contradictory. Several clinical investigations on the impact of these parameters on graft survival have been undertaken [9] , but considering the complicated interplay among those factors, still there is more to explore in this domain. With receiver operating characteristic scores, current risk forecasting models could only predict the outcomes of kidney transplantation recipients to a smaller extent. On the basis of covariates and predictors, numerous classification techniques are employed for predicting a categorical response variable. Although neural networks may predict the whole clinical results, they cannot discern particular risk variables for a specific clinical event [10] . e existence of unrelated factors may increase the approach's difficulty, making it hard to build a predictive model utilizing clinical data. Machine learning approaches presented in this field have shown a reliable and robust performance in categorizing dualistic responses. To develop nonlinear models, the artificial neural network strategy is introduced, and it is capable of automatically detecting complicated nonlinear correlations among dependent and independent variables, as well as all conceivable relations among predictor variables [11] . Kidney transplant is the most effective therapy for end-stage kidney problems. It enhances survival of patients and provides a greater quality of life than haemodialysis. Moreover, it decreases the long-term healthcare costs for such individuals significantly. Extended immunosuppression, on the other hand, is connected to a number of adverse effects that could change both patient and graft survivability. Graft survival is the period of time that a kidney transplant (graft) works well enough for the patient not to require dialysis or any other transplant method. e goal of the research is to establish a novel forecasting approach which combines feature engineering with the deep learning techniques via an optimization mechanism in order to increase prediction performance. To accomplish this goal, a unique prediction approach based on kidney graft survival data has been developed for forecasting the survival of graft after transplantation, which could be used in real-time and is suitable for forecasting kidney transplantation outcomes via data analysis. Any transplantation dataset can be used with the proposed prediction algorithm. e remainder of the article is laid out as follows. e present researches on the prediction of kidney transplantation graft survival are examined in Section 2. e novel proposed AB-ANN prediction approach is presented in detailed manner in Section 3. Section 4 discusses the included dataset, as well as the test results, and Section 5 describes the discussions, and Section 6 concludes this study. Technique. Data-driven strategies were used in a number of studies to predict graft survival following transplantation. e authors in [12] investigated the factors impacting graft survival before and after kidney transplantation by employing Kaplan-Meier methods. To improve organ retrieval allocation, a multivariate analysis [13] was utilized for predicting the kidney transplantation outcomes using a deceased donor. However, by relying solely on statistical methodologies, these researches were limited. As a result, better methodologies are needed to uncover potentially hidden information among the various characteristics that could influence the graft survival state forecasting of a kidney transplant. To determine graft survival out of a deceased individual, a tree regression model [14] has been introduced. After transplanting kidney, a neural network strategy was developed for estimating the delayed graft functioning [3] . However, those researches were limited to the deceased donors. Other researchers sought to develop transplantation outcome prediction models. To identify essential variables and subsequently design a Bayesian belief network, researchers utilized statistical mechanisms like elastic nets with machine learning techniques like ANN, bootstrap, random forest, and support vector machines. is model looked into the variables' hidden dependencies. is model had a precision of 68.4%. Cox-based models [15] have been used extensively in the survival assessment of complex organ transplants; but when the feature space grows larger, such techniques lose the prediction accuracy. A feature selection scheme based on a hybrid genetic algorithm determines the key traits for lung transplantation. ey employed three different classification prediction processes for forecasting the lung transplantation and quality of life of patients [16] . e findings excelled the previous research. Some other studies deployed artificial neural networks and a statistically determined nomogram for forecasting the five-year graft survival after transplanting the kidney, using clinical and demographic data [17] . Using an external validation dataset, they discovered that the artificial neural networks outperformed the nomograms. e authors in [18] created a Bayesian belief network for predicting the graft survival. With good accuracy, the model can determine the graft failure. A Bayesian belief network architecture is employed in some other studies for forecasting the heart transplantation outcome. Compared with other approaches in the literature, the results showed identical predictive effectiveness. In kidney transplantation, machine learning-based predictive algorithms identify the main correlations among receptor and donor characteristics for predicting transplant outcomes based upon acceptor-donor data. ML approaches were used in a number of researches for predicting the outcome of kidney graft [19] , but almost in all examined studies, the conventional mechanism has been to choose one or more arbitrary time periods commencing from the transplant date and use categorization techniques for predictive purpose. In terms of prediction modelling and feature engineering, there is a definite requirement for more research into data stratification methodologies and other machine learning methods [20] . e alternative models for kidney graft and receiver survival prediction include artificial neural networks as well as linear regression mechanisms [21] . Other approaches like landmark modelling and joint modelling utilize timedependent factors to increase predictive performance in addition to such approaches that have used static covariates. e feature selection is a key issue in a variety of fields, including document classification, prediction object identification, and bioinformatics, as well as the representation of complicated production technologies. In such applications, datasets with hundreds of features are frequent. For some situations, all the features could be significant, but for certain target concepts, just a small subset of features are highly essential. Some classification techniques have learned to focus on the most critical features while ignoring the less important feature points. Decision trees are one type of such methods; however, multilayer perceptron neural networks with significant normalization of the input layer also can automatically eliminate unnecessary features [22] . e kidney graft survival is derived by Bayesian belief network modelling. In this research, the 5155 patients were randomly selected from the database of renal data system in US. e key contributions of this research are as follows: (i) Introducing a newly proposed African buffalo optimization for feature selection, which could effectively choose the most relevant feature set for prediction (ii) Designing a newly combined predictive model, which could correctly assess the status of kidney graft transplantation and improve the limitations in the prior studies (iii) Combining information gain function and the ABO mechanism with the ANN model to attain good predictive abilities e research extends to the prediction of graft survival approach by proposing a new three-phase approach, that is, (i) data processing phase, (ii) feature selection phase, and (iii) prediction phase [23] . Prior to data processing, donor and recipient characteristics such as age, gender, blood type, and health are analysed. e cross match test is used to find out how the donor's blood reacts with the recipient's blood, and the HLA test analyses the immune system to determine the outcome of the operation. e input data is first gathered and preprocessed to be used for training and testing purpose. Data cleaning and data censoring are the two phases of data processing phase. Followed by this, feature selection is accomplished to recognize the most essential features which would be used in the prediction phase, reducing both the complexity of the technique and the features dimensionality. Information gain along with the ABO mechanism is utilized to choose the most relevant features. ese two feature selection approaches combine to provide a novel hybrid feature selection technique, which could select the most important elements to improve prediction accuracy. Finally, the status of the graft is forecasted as survive or not survive in the prediction phase. e workflow of the proposed AB-ANN model is represented in Figure 1 . e kidney transplantation dataset given by Mansoura University's Urology and Nephrology Center [24] was utilized to validate the suggested prediction approach for predicting graft survival. is database includes medical history, demographic data, some preoperative considerations for either recipient and donors, physical situations both during and after transplantation, and extra features like transplant date and dialysis information of kidney transplant patients. e data is divided into two categories: training (70%) and testing (30%). Initially, a portion of the dataset is utilized to train the proposed predictive model (training set). e system is then utilized to forecast survival class by testing a new subset of the dataset (test set). e input data is preprocessed during the data processing step, so that it may be used during training and testing. e dataset was preprocessed during this stage. Data cleaning and data censoring are the two phases that make up this process. Because the prediction approach is meant to forecast the outcome of kidney transplantation before transplanting, all operational and postoperational features are deleted during the data cleaning process. e traits that will have no predictive value are removed in the second level (e.g., patient's name, the hospital ID, and date of examination). In the last stage, certain occurrences are deleted since the dataset contains missing data. Missing value imputation can be done in a variety of ways. For 1 percent missing values, the custom mean imputation approach was utilized, in which each covariate's missing values were replaced with the mean of its preceding and next values in the temporal order. Graft survival condition was censored in the data censoring process when the graft time was less than the number of days in the five-year period and the graft was still alive, or the study finishing date. If the patient is on dialysis or died with a failing graft, the graft time is calculated by deducting the transplantation date from the dialysis initiation date. If the individual's state is surviving with functional graft or died with functioning graft, deduct the transplantation date from the last follow-up date. 3.3.1. Feature Selection Phase. Selection of features is a significant part in any data mining procedures. Choosing the most important features would increase the prediction accuracy of the model thereby reducing the computation time and processing costs. An optimised feature selection approach called African buffalo optimization mechanism with information gain function is used in this study to successfully determine the most significant features that could improve the prediction process. e novel combined feature selection mechanism combines the advantages of either method, resulting in significantly improved system performance. To get started, IGBFS is structured to identify key attributes based on choosing which features to use. To use UNC databases, identify 55 features of 67 attributes as important. ese important properties vary with IG rather than zero. Second, in addition to selecting the most important features, NBBFS was used to specify the most important features from the basic features developed by the IGBFS system. e goal of employing information gain (IG) is to identify characteristics that provide the most significant knowledge about the classes [25] . Such characteristics are primarily discriminatory and occur within a single class. IG is a feature ranking methodology that utilizes entropy to calculate the degree to which the entropy is reduced when observing the value of a particular feature. As a result, the value of information gain indicates how much information this feature contributes to the database. Each feature has an information gain rating that indicates whether it is necessary or not. As a result, the feature with IG � 0 is rejected. With a higher IG, the chances of attaining clear classes in the target class increase. e critical characteristics are determined after calculating the information gain values for all features. e qualities with an information gain value higher than zero are considered as essential. e features are examined using an edge value; if a feature's information gain value is more than the edge value, it is chosen; otherwise, it is not. In this study, a threshold of zero is employed, and features having an information gain more than zero are regarded the most essential features for prediction. e African buffalo optimization mechanism [26] selects the essential features for prediction using its fitness function. e African buffalo optimization mechanism is also employed as an optimizer in the last layer of the AB-ANN model for enhancing the prediction accuracy. Furthermore, the learning factors help to process the trace of essential feature points. e cooperative behaviour of buffalo is reorganised by le 1 (β targ p − w f ), and the intelligence of the buffalo is denoted by le 2 (b pmax .f − w f ). Also, the fitness value is computed by Here, m f + 1 denotes the next feature, and also, m f represents the current feature value. In addition, new feature update is deliberated using where w f and m f indicate the respective exploration and exploitation fitness of f. computation accuracy. A multilayer feed forward perceptron was employed as the neural network [3] . e following (3)-(5) are the mathematical depiction of a neural network. Let t n � f ′ m n + μ n1 l 1 + μ n2 l 2 + · · · + μ np l p , (4) where t n is the output from m th hidden node, m is the total number of nodes in the hidden layer, n is the number of covariate, m is the intercept parameter, l p is the p th covariate, μ np is the p th covariate and nth hidden node parameter, and f ′ (·) is considered to be the activation function. Now, where s indicates the neural network output, a is the bias parameter, b is the output parameter from the nth hidden node, t n is the output from the nth hidden node, and g ′ (·) is considered as the output function. e arbitrary functions f ′ (·) and g ′ (·) could be any function; however, the hyperbolic tangent function (e − l + e l )/(e l − e −l ), the logistic function e l /(1 + e l ), or the linear function is the most common. e critical characteristics list is trained and tested using the AB-ANN algorithm as shown in Algorithm 1. Assume that the input dataset comprises n features (f1, f2, . . ., fn). e information gain is calculated for each feature, indicating how much data is there in that feature set. e features with information gain higher than 0 are then considered and added to the list of important features. e accuracy of the classifier is then calculated. Remove each feature from the list of essential elements one by one. en, train and test the remaining features through the ANN classifier model. If removing this characteristic affects classifier accuracy, it is the most important feature, and it is thus included to the list of the most important features. If removing this feature improves classifier accuracy, it is no longer a necessary feature and will be removed. is approach is continued till all key features have been tested and a list of the most important features has been created. e prediction is made as to whether or not the character would survive based on the most essential features from the feature list. is method can be used to save the lives of patients who have undergone transplant surgery. is section evaluates the proposed AB-ANN method's performance. Two important indicators are used to evaluate performance in the test: the number of selected characteristics and predictive accuracy. ey are classification accuracy, precision, recall, F-measure, root mean square error, and mean absolute error. e root mean square error and mean absolute error comparison of the proposed and existing methods is described in Table 1 , and its pictorial representation is mentioned in Figure 2 . From the figure, it is clear that the proposed method has minimum error rate compared with the existing mechanisms. e proposed method has lower root mean square error of 15.4% and lower mean absolute error of 9.3%. e simplest intuitive performance metric is accuracy, which is defined as the ratio of precisely predicted observations to all observations. e proportion of accurately categorized patterns to the total number of classified patterns is known as accuracy. It is calculated using (6) as follows: Figure 3 compare the suggested method's accuracy to that of the most recent techniques. Table 2 shows that the proposed AB-ANN predictive approach for renal transplantation could improve the classification accuracy rate while reducing the feature selection difficulty. Precision is measured by the amount of positive class predictions which belongs to the positive class [28] [29] [30] . Precision is characterized as the proportion of the rate of correctly classified events in all detected events. It is computed using the following: e precision comparison of the proposed and existing methods is described in Table 3 , and its pictorial representation is mentioned in Figure 4 . From the figure, it is clear that the proposed AB-ANN method has higher precision value (97.6%) compared with the existing mechanisms. is shows the outperformance of the proposed method over existing mechanisms. Recall is described as the amount of positive class predictions that are made of all positive examples in the dataset [31] [32] [33] . e fraction of right events among all events is known as recall. It is calculated using the following: e recall comparison of the proposed and existing methods is described in Table 4 , and its pictorial representation is mentioned in Figure 5 . From the figure, it is clear that the proposed AB-ANN method has higher recall value (98.2%) compared with other existing mechanisms. It is the degree of harmonic mean among precision and recall. It is a statistical measure utilized to rate the performance. F1-score is formulated as follows: e F-measure comparison of the proposed and existing methods is described in Table 5 , and its pictorial representation is mentioned in Figure 6 . From the figure, it is clear that the proposed AB-ANN method has higher recall value (99.2%) compared with other existing mechanisms. e extensive availability of alternative treatments has increased the life span of patients having end-stage renal disease. e performance of the AB-ANN technique in detecting the survival rate of persons with kidney graft failure was compared with that of other methodologies in this research. e suggested prediction methodology is tested using the UNC dataset, and the results are compared with other recent methods. e predictions generated by ANN were more exact than previous techniques based on the evaluation parameters like accuracy, precision, recall, f-measure, and error rate. Experiments demonstrated that the newly proposed kidney transplantation survival estimation technique surpassed all previous current strategies, with prediction accuracy and F-measure scores of 99.89 percent and 99.2 percent, respectively. e proposed prediction technique has achieved best accuracy, higher speed, and higher F-measure. Furthermore, the novel feature selection strategy has been successful in speeding up categorization by decreasing the amount of characteristics to a minimum. As a result, it is obvious that the proposed procedure is quite reliable and produces excellent outcomes. e nature of this model allows it to be utilized for both short and long-term forecasting. Such predictive techniques could aid in the implementation of personalised treatment in kidney transplantation. It is stated that the innovative proposed prediction technique can increase classification accuracy while reducing feature selection complexity. ese results show the efficacy of the proposed strategy. e proposed prediction model might be used to a variety of transplant datasets, according to the researchers. e importance of predicting the outcome of kidney transplantation cannot be overstated. is will allow patients to choose the best accessible kidney donor and the best immunosuppressive medication. e ability to predict graft survival following transplanting is essential, and it is especially a challenging problem since it is important to understand the donor-recipient matching method. As finding donors is challenging, this matching is highly essential. Prediction of graft survival in kidney transplantation is a serious and therapeutically significant issue. An optimised deep learning framework for risk prediction of graft failure was built in this study, and it displayed a higher level of prediction performance. ese algorithms outperformed those reported in the literature for existing risk prediction tools, and the future research would focus on how to best integrate such models into healthcare algorithms to improve kidney recipients' long-term health. e data used to support the findings of this study are included within the article. e authors declare that they have no conflicts of interest to report regarding the present study. Factors influencing long-term outcome after kidney transplantation Dynamic predictions of long-term kidney graft failure: an information tool promoting patient-centred care Personalized prediction of delayed graft function for recipients of deceased donor kidney transplants with machine learning Immunosuppressive drugs in kidney transplantation Donor genomics influence graft events: the effect of donor polymorphisms on acute rejection and chronic allograft nephropathy Analyses of the shortand long-term graft survival after kidney transplantation in Europe between Effect of Delayed Graft Function on Short-And Long-Term Kidney Graft Survival A dynamic model for predicting graft function in kidney recipients' upcoming follow up visits: a clinical application of artificial neural network Renal function as a predictor of long-term graft survival in renal transplant patients Neural networks for predicting graft survival Prediction of Kidney Graft Rejection Using Artificial Neural Network Factors affecting graft survival among patients receiving kidneys from live donors: a single-center experience An algorithm for cadaver kidney allocation based on a multivariate analysis of factors impacting on cadaver kidney graft survival and function Predicting kidney transplant survival using tree-based modeling Predictive modeling for organ transplantation outcomes Predicting the graft survival for heart-lung transplantation patients: an integrated data mining methodology Prediction of Delayed Renal Allograft Function Using an Artificial Neural Network Bayesian modeling of pretransplant variables accurately predicts kidney graft survival Predicting graft survival among kidney transplant recipients: a Bayesian decision support model A machine learning approach using survival statistics to predict graft survival in kidney transplant recipients: a multicenter cohort study Single and multiple time-point prediction models in kidney transplant outcomes A new methodology of extraction, optimization and application of crisp and fuzzy logical rules Predicting kidney graft survival using machine learning methods: prediction model development and feature significance analysis study Laparoscopic nephrectomy: Mansoura experience with 106 cases Feature selection based on information gain African buffalo optimization: a swarm-intelligence technique Nomogram that predicts graft survival probability following living-donor kidney transplant An intelligent hybrid technique of decision tree and genetic algorithm for E-mail spam detection Developing an efficient spectral clustering algorithm on large scale graphs in spark Explore the E-learning management system lower usage during COVID-19 pandemic Integration of computer vision and natural language processing in multimedia robotics application Accuracy enhancement scaling factor of Viola-Jones using genetic algorithms Arabic aspect based sentiment analysis using bidirectional GRU based models