key: cord-0562763-1lhuu841
authors: Taghiyeh, Sajjad; Lengacher, David C; Handfield, Robert B
title: Loss Rate Forecasting Framework Based on Macroeconomic Changes: Application to US Credit Card Industry
date: 2020-06-14
journal: nan
DOI: nan
sha: cab36c64c75e37016edcfe0ad6720ef921cf887b
doc_id: 562763
cord_uid: 1lhuu841

A major part of the balance sheets of the largest US banks consists of credit card portfolios. Hence, managing the charge-off rates is a vital task for the profitability of the credit card industry. Different macroeconomic conditions affect individuals' behavior in paying down their debts. In this paper, we propose an expert system for loss forecasting in the credit card industry using macroeconomic indicators. We select the indicators based on a thorough review of the literature and experts' opinions covering all aspects of the economy, consumer, business, and government sectors. The state of the art machine learning models are used to develop the proposed expert system framework. We develop two versions of the forecasting expert system, which utilize different approaches to select between the lags added to each indicator. Among 19 macroeconomic indicators that were used as the input, six were used in the model with optimal lags, and seven indicators were selected by the model using all lags. The features that were selected by each of these models covered all three sectors of the economy. Using the charge-off data for the top 100 US banks ranked by assets from the first quarter of 1985 to the second quarter of 2019, we achieve mean squared error values of 1.15E-03 and 1.04E-03 using the model with optimal lags and the model with all lags, respectively. The proposed expert system gives a holistic view of the economy to the practitioners in the credit card industry and helps them to see the impact of different macroeconomic conditions on their future loss.

Similar to any industry, the goal in the consumer credit industry is to maximize profits by measuring and controlling risk and avoiding exposure to default (also known as charge-off), as much as possible. The term charge-off means an outstanding credit card debt, which is written off as bad debt. Consumers must issue payments by the due date, and failure to do so will result in putting the consumer's account into delinquency or default. Typically, a bad credit card debt will be marked as charged-off after six months of non-payment, and it is withdrawn as an asset from the lender's accounts. This is usually a final action since it is an indication to lenders that the consumer will never pay off their account. Thus the account is written-off as bad debt. The charge-off rate for a given bank or issuer is calculated by dividing the dollar amount of charge-offs by average outstanding balances on credit cards issued by the firm. A higher charge-off rate exhibits a higher risk to a company. Usually, strategic business analysis is incorporated by credit card issuers to develop credit policy and guidelines with legal and regulatory constraints. Credit policy helps an institution develop strategies within the planned asset quality range that are consistent with the institution's profitability goals. Accurate prediction of charge-off rates has been one of the major challenging tasks in the credit card industry. The charge-off rate has shown a strong tie to economic conditions, and it has hit its highest level during the financial crisis, which was 10.79% according to U.S.

Federal Reserve data. Increasing the charge-off rates during the 2008 financial crisis led to the question of how we can predict the charge-off rate based on macroeconomic indicators under different economic conditions.

There has been extensive research on the relationship between charge-off risk and general economic climate, resulting in a general belief that macroeconomic factors directly affect bad debts and charge-offs. Historical data obtained from credit bureaus along with consumer performance data are analyzed by lenders to predict the future behavior of consumers and their risk of going delinquent or charging off. These predictive models classify consumers into different segments and align the bank's strategies towards these segments accordingly. The problem is that many businesses rely only on these models to make decisions, and fail to include certain economic factors into their risk models. Sometimes, to include economic conditions, these predictive models are adjusted by several percentage points in the charge-off rate using a fraction of macroeconomic indicators. However, most of the time, only a fraction of economic aspects are reviewed for these adjustments, as they are deemed to be the most influential.

Consumers' charge-off behavior can be heavily affected as the economy goes through good times (expansion phase) and bad times (the contraction phase), and they are not explicitly modeled in prediction models developed by credit risk management, which raises the question of how charge-off rate will change in different economic conditions. During economic expansion, consumers and businesses have enough income to pay their debts by their respective due dates, and thus this phase is associated with a small number of delinquencies and chargeoffs. On the other hand, in the contraction phase, the number of bad debts will increase, which eventually will lead to a significant jump in the charge-off rate. Credit card companies can be affected by economic factors, and including economic factors in the decision-making process may significantly impact their ability to make effective charge-off decisions proactively. Failing to incorporate economic factors may lead to consequences that may take years for the company to recover. Since many other factors such as government regulations are already reducing the profits of credit card business, there is a need for a new approach that incorporates the relationship between economic factors and charge-off.

Early credit card portfolio literature could not find conclusive evidence on the effects of macroeconomic factors on charge-offs over the business cycle. For example, personal bankruptcy and credit card delinquencies in the 1990s were investigated in Gross and Souleles (2002) , and authors concluded that the relationship between charge-offs and macroeconomic factors had changed substantially over the investigation period and there was not conclusive evidence to prove a relationship between charge-off rate and macroeconomic factors. They also concluded that the unemployment rate has no significant impact on the charge-off rate. They used panel data on credit card accounts for their analysis.

However, later in Agarwal and Liu (2003) , the authors stated that the unemployment rate has significant predictive power for the charge-off rate. They noted that the reason behind the fact that previous empirical studies could not find a consistent relationship between economic factors and bankruptcy is that those studies were either suffering from inadequate data or the variation in the unemployment rate was not sufficient during their analysis period.

Following the Great Recession in 2008, credit card companies focused heavily on controlling credit losses. Their emphasis is mostly on the unemployment rate, as it has a strong correlation with the charge-off rate. However, in the past few years, the unemployment rate was going down while the charge-off rate was increasing, and a model using unemployment rate as its only input may not be able to capture the uptrend in the charge-off rate. Hence, credit card companies need to focus on other economic factors that can affect charge-offs, and most importantly, they need to look at the economy as a whole. Analyzing the impact of variables from all segments of the economy will provide lenders with a holistic insight and will help them to make more effective decisions to reduce future losses.

There are limited cases in the body of literature that focus on charge-off prediction models incorporating macroeconomic variables in the United States.

The slope of U.S. Treasury bond yields over time was mentioned by Estrella and Hardouvelis (1991) and Estrella and Mishkin (1998) to have a strong relationship with output growth and recessions in the United States up to eight quarters in the future. Stock prices (Estrella and Mishkin, 1998) , credit market activity (Levanon et al., 2011) , index of leading economic indicator (Berge and Jordà, 2011; Stock and Watson, 2002) and several interest rates, housing indices and unemployment rate measures (Ng, 2014) as leading indicators for future economic conditions. Moreover, there are different views regarding the significance of specific economic factors. For instance, industrial production was found to be a significant predictor of corporate charge-offs by Figlewski et al. (2012) . However, research was done by Giesecke et al. (2011) has shown that it may not be an important factor in forecasting the charge-off rate. Stochastic optimization algorithms can also be used in financial industry to improve the efficiency of the algorithms (Taghiyeh and Xu, 2016) .

The author was motivated to perform this study when he started working as analytics intern at one of the leading credit card issuer companies in the United States. The models in production were using only unemployment rate as their input to forecast future values of the loss rate, which had an R-squared value of about 63%. Aside from the relatively low R-squared value, the charge-off rate was going up in the past couple of years, but unemployment rate was going down. Therefore, their model was unable to predict uptrend in charge-off rate and it was crucial to develop a new prediction model for the charge-off rate by incorporating macroeconomic factors from all aspects of the economy.

In this study, we aim to identify and analyze economic indicators that have a significant relationship with the charge-off rate in the credit card industry.

Next, we will use machine learning techniques, namely, linear regression with Lasso, linear regression with Ridge, random forest, and gradient boosting machine to develop a loss forecasting framework using selected macroeconomic indicators. Finally, using the model selection approach introduced in Taghiyeh et al. (2020) (MSIC algorithm), we will forecast each of the selected indicators to predict year over year changes. Nineteen macroeconomic indicators from three major economic categories will be used for this analysis. These economic categories include consumer, business, and government segments. The use of indicators from all segments gives a comprehensive view of the economic impact 5 on charge-offs. Credit card companies have recently identified the unemployment rate and housing indices as charge-off accelerators. These two metrics will be included in our analysis to confirm or deny their assumptions. The consumer confidence index is another factor that can be seen to have an impact on charge-off rates, as consumer behavior may change payment behavior when they are optimistic or pessimistic towards the future. However, this index is very volatile and may fluctuate each month as the report comes out (Censky, 2010) . Other macroeconomic indicators used in this research are new from a charge-off analysis standpoint. Charge-off data from the top 100 banks in the United States from 1985 to 2019 will be used in this study to confirm if the selected macroeconomic indicators have significant predictive power for the duration of the analysis. The design of a prediction model covering all aspects of the economy will add a significant value to a company. Executives and managers can incorporate this information into their decision process to anticipate any future credit losses and fluctuations.

The remainder of this paper is organized as follows. Section 2 reviews the literature on loss forecasting. Section 3 presents the details of our proposed loss forecasting framework based on macroeconomic indicators. Section 4 presents an empirical evaluation of the approach using loss data from the top 100 banks in the United States. We summarize our conclusions and discuss the practical implications of our work in section 5.

Credit card companies are in the business of lending money to consumers, but it is a very risky task as they are not certain if consumers will pay back their debt or make payments by due dates (Guseva and Rona-Tas, 2001) . To assess the probability of charge-off, credit card issuers usually use scoring models, which are mostly based on historical consumer performance gathered from any of the credit bureaus, such as Equifax, Experian, and Transunion. This information is used to develop account level models to evaluate the risk of a particular consumer and divide them into low-risk and high-risk segments. However, the account level model may put consumers in the low-risk segment, but eventually, they do not pay their debt and charge-off. What would have been the reason behind the fact that the low-risk consumers ended up being charge-off or delinquent? What are the non-credit factors that affected the payment ability of the consumers that were not taken into account in account level risk models? The answer to these questions may be uncertainty derived from underlying economic conditions, which need to be taken into account for by credit card companies when offering credit to consumers.

The main issue for credit card companies is when to take action and to what extent they need to tighten their credit offerings. If the right time is chosen to act, it may lead to a stable or even increasing revenue, and lenders will avoid unnecessary charge-offs and loss. Nevertheless, an ill-timed action will bring the company an increased loss and a steep shrinkage in revenue. Hence, credit card companies face credit loss challenges brought upon by strong and weak economic conditions. During the 1990s, people kept spending using credits and generated high balances on their credit cards, which lead to a significant loss due to nonpayment on debts when the recession hit in 2001 (Evans and Schmalensee, 2005) . It was the same situation for many credit companies during the 2008 great recession. One of the significant issues with recessions is an increase in the unemployment rate. This will greatly affect the ability of consumers to pay their unpaid debt and credit card charge-offs. If the charge-off trend is identified early in the phase, it will give credit card companies enough time to make the right decision and act promptly and avoid unnecessary losses or drop in revenue.

Evaluating the impact of the economy on the charge-off rate would help lenders to predict the trend and make effective decisions.

In the last recession, several banks started to tighten their credit offering criteria in the last quarter of 2007, but an aggressive action was not taken until the second half of 2008, in which the unemployment rate was already risen by 30%. Credit card issuers tightened the credit offerings by closing the accounts and reducing credit lines. However, the charge-off rate has hit its highest at 10% in the last quarter of 2009, and the credit card company's revenue had a steep shrinkage. Also, Many good customers were affected by this sudden reaction. At that time, it became clear that the unemployment rate and charge-off rate have a strong correlation, and many banks today continue to use the unemployment rate as one of the decision factors in their strategies. However, there are many other economic indicators that may help lenders to understand the future of the economy better and predict the charge-off trend.

To the best of our knowledge, there is only one research that studies the relationship between economic factors and the charge-off rate in the US economy (Liu and Xu, 2003) . In the empirical study by Liu and Xu (2003) , authors use step-wise regression and vector autoregression to identify economic factors which have predictive power regarding credit card charge-offs. The goal of their research was to develop a predictive model based on these variables. Authors concluded that the unemployment rate, consumer confidence index, household debt service burden, inflation rate, personal bankruptcy filings, and stock market returns are the variables that have a strong predictive power for the chargeoff rate. However, there are a few issues with their work that justifies a more recent and thorough work toward identifying economic variables to develop a predictive model for charge-off. The first issue is that their analysis is focused on the period of 1986-1998, and there were quite a few changes in both the credit card industry and economic conditions. Second, in Liu and Xu (2003) , authors only include seven economic variables in their analysis, and they are not covering all the aspects of the economy, namely government, business, and households, entirely.

There exist several studies on the relationship between charge-off and economic conditions. It was stated by Ausubel (1997) that in a generally healthy economy, in which unemployment is relatively low and gross domestic product is reasonably growing, both bankruptcy and charge-off rate increased. This statement was against the general belief that the charge-off rate will increase during bad economic times and decrease in good economic times, and has shown that other economic factors may contribute to charge-offs. Debt-to-disposable 8 income ratio was found by Stavins et al. (2000) to have a strong correlation with credit card charge-off and bankruptcy. The relationship between consumer debt burden and economic indicators was studied by Schmitt (2000) , and it was concluded that consumers' debt burden increases in the expansion phase of the economy. The author found personal consumption expenditure, durable goods, and retail sales to have the most predictive power toward consumer debt and the installment loan delinquency rate.

Since credit cards provide a more flexible way comparing to installment loans, the variables mentioned above may have a different impact on credit card charge-offs. The unemployment rate, consumer price index and the number of bankruptcy filings were deemed to be highly correlated with the charge-off rate in the case of Hong Kong (Fung and Wong, 2002) . The authors used the vector regression model as the basis for their analysis. Macroeconomic indicators were analyzed by Agarwal and Liu (2003) to investigate credit card delinquency. The authors conclude that macroeconomic fluctuations correlate with bankruptcy and delinquency rates. They also found that the unemployment rate has a strong effect on the rate of delinquency. In the analysis done by Musto and Souleles (2006) , the covariance of individual charge-off risk with aggregate charge-off rate was calculated, and it was found that a significant heterogeneity in the covariance of risk exists among consumers. They also stated that the credit line decreases when the covariance of risk increases. By applying portfolio theory to consumer lending, Desai et al. (2014) extend the work of Musto and Souleles (2006) . Authors use credit score along with charge-off and bankruptcy rates to evaluate the charge-off. Mian and Sufi (2011) investigate the relationship between household borrowing and house prices by analyzing account level datasets. The authors conclude that there is a significant relationship between these two variables, and housing prices and household debts can explain fluctuations in the economy. These authors also state that the effect of fluctuations in housing prices is not homogeneous across the population, and people with low credit scores, which highly leverage their credit, are more sensitive to these changes in housing prices.

A regime-switching model is used by Giesecke et al. (2011) to evaluate the predictive power of macroeconomic variables for charge-off rates. Changes in the gross domestic product, stock returns, and their volatility were identified as significant variables. Reduced-form Cox intensity models were fit by Figlewski et al. (2012) to analyze the relationship between a range of macroeconomic and firm-specific factors and charge-off and significant credit ratings. They found that both factor categories were significant, but macroeconomic variables were highly dependent on the inclusion of other factors. Using account-level data, Bellotti and Crook (2012) compare the performance of several loss forecasting models, including a decision tree and fractional logit transformation. The authors conclude that using macroeconomic variables in ordinary least square models will result in the best forecasting model. In an extension to their work, in Bellotti and Crook (2013) , a discrete time survival model was proposed to predict the probability of charge-off. They claim that using macroeconomic variables along with behavioral factors, the best fit will be obtained. Borrowers' characteristic was also found in Leow and Crook (2014) to impact charge-off and recovery behavior significantly. In the study of Rubaszek and Serwa (2014) , interest rate spread and income uncertainty were found to impact the amount of household credit using both theoretical and empirical models.

To evaluate the effect of FICO score, debt-to-income ratio, credit grade, and credit utilization on charge-off, logistic regression, and Cox proportional hazard models were used by Emekter et al. (2015) . The authors concluded that the probability of charge-off increases as debt age increases. A classification model based on the random forest for good and bad loans was proposed by Malekipirbazari and Aksakalli (2015) . The results were compared to the ones obtained from logistic regression, support vector machine, and K-nearest neighbor. However, none of the last two studies we mentioned used out of sample performance to evaluate their classification models. In a study performed by Guo et al. (2016) , cross-validation was used to evaluate out of sample performance for the credit assessment model of P2P loans. The relationship between the age of the borrower and the probability of charge-off in the US was inves-tigated by Debbaut et al. (2016) . The authors conclude that the probability of charge-off is lower in younger borrowers. Using macroeconomic indicators in Turkey, Mazibaş and Tuna (2017) analyze the reason behind recent fluctuations in household debt. In a study performed in Korea, Kim et al. (2017) use account-level credit data to show a positive relationship between the probability of delinquency and the amount of debt. A model with bankruptcy, delinquency, and renegotiation was proposed by Kovrijnykh and Livshits (2017) . Authors conclude that instead of taking charge-off as a binary event, one needs to look at it as a multiple-stage process.

As the literature review performed in this section suggests, the basis of this research is supported by scholars in the field. As we can see, most of the researches believe that macroeconomic factors affect lenders and financial institutions, and by studying the effects of macroeconomic indicators, we can have a better perception of future lending risks. It is essential for credit card companies to incorporate macroeconomic indicators in their risk models to predict future risks and operate effectively in both the expansion and contraction phases of the economy. This way, they can avoid any unnecessary loss in their portfolio due to a lack of perspective toward economic conditions. Several economic factors were studied in previous researches regarding the charge-off rate. However, in this study, we will cover more economic indicators that encompass all segments of the economy, namely households, government, and business segments.

Credit card issuers suffer from unexpected charge-offs due to lack of insight from economic conditions, and a charge-off prediction model which is based on macro-economy data will help managers to make effective business and strategic decisions. This research aims to fill this gap and find the economic indicators with the most significant power to predict future charge-off rates and will use these indicators to build a loss forecasting model for predicting charge-off rate using machine learning models.

In this section, we will use machine learning tools to develop the loss forecasting framework. First, we will explain the important trade-off between interpretabiliy and accuracy that is a hot topic when it comes to using machine learning models, and we will discuss the reason behind the selection of machine learning models in our proposed loss forecasting framework.

Based on the literature review, several macroeconomic indicators that were likely to have correlations with the charge-off rate were selected. Among the selected macroeconomic indicators, 19 indicators were selected by the experts in the credit card industry to form the basis of this research. The goal is to use these indicators as independent variables in a machine learning based model to predict the charge-off rate, which is our dependent variable. In the first step, we apply different transformations (e.g., square root, exponential, ...) to normalize the selected indicators and find the transformation with the highest correlation to dependent variables. We also add lags from 1 to 4 quarters to each indicator and find the correlation of each of the lagged indicators with charge-off rate.

This way, we incorporate the lagged effects of each macroeconomic indicator. The next step in data preparation is to convert all indicators and charge-off rate to year over year changes. To do so, for each indicator, we record the percentage of change comparing to the corresponding period in the last year. This way, instead of using the actual values for macroeconomic indicators to predict the charge-off rate, we build a model that uses the changes in each indicator to forecast the change in charge-off rate.

After we have generated our input data, we will use two versions of Lasso regression (Lasso with optimal lags and Lasso with all lags) to select the features with the most significant correlation to our output data. The difference between these two feature selection methods lies in the approach we use to generate their input. In the first feature selection model (Lasso with optimal lags), for each indicator, we select the lag, which has the highest correlation with the chargeoff rate. Therefore, the model has 19 independent variables corresponding to optimal lags for each of the selected macroeconomic indicators. In the second approach, which is Lasso with all lags, we include all the lags in the input data and let the model select between lags. Note that, in the second feature selection method, we let the model choose more than one lag from each indicator. In doing so, the model can capture the trends for the year-over-year changes of each macroeconomic indicator.

We use the indicators selected by each of the feature selection methods as the input to train machine learning models and capture the relationship between the selected macroeconomic indicators and the charge-off rate. As mentioned earlier, the benchmark machine learning models in this study are Lasso regression, Ridge regression, gradient boosting machine (GBM), and random forest (RF). As it is common in the machine learning field, we split the data into training and test sets to train and evaluate the performance of each machine learning model. Two sets of machine learning models need to be developed since we have two versions of input data resulted from different feature selection approaches.

The last piece of building the loss forecasting framework is to predict future values for each of the selected macroeconomic indicators and use the trained machine learning model to predict future charge-off levels. To predict each macroeconomic indicator, seven well-known forecasting models have been used, namely, naive forecasting, moving average, simple exponential smoothing, Holt, Holt-Winters, ARIMA, and Theta. These models are selected among the models considered in the forecasting competitions, such as M3-Competition. Three variants of the MSIC algorithm proposed in Taghiyeh et al. (2020) are used to select the best performing forecasting model for each macroeconomic indicator.

Using the results from the forecasting model selected by the MSIC algorithm, the trained machine learning models are then used to predict the future values of the charge-off rate. Figure 1 shows the steps of the loss forecasting framework proposed in this study. The details of our proposed Loss rate forecasting framework is outlined in the following subsection. 

Let n be the number of macroeconomic indicators that are selected to build the loss forecasting framework and let m be the number of machine learning models used to predict the loss rate. Feature selection method can be set to either "Optimal lags" or "All lags". -Step 1-2: Add lags from 1 to 4 quarters to indicator i and record the lagged indicator.

-Step 1-3: Try different transformations (e.g., square root, exponential, square, log, etc.) for each lagged indicator and select the best one based on a goodness of fit statistic.

-Step 1-4: For each lagged indicator, add the selected transformation in step 1-3 to I 1 .

-Step 1-5: Convert all the lagged indicators to year over year values by dividing them by the corresponding values from last year.

-Step 1-6: If i = n, go to step 2. Else, set i = i + 1 and go to step 1-1.

• Step 2 (feature selection): If "Optimal lags" is selected for feature selection, go to step 2-1. Else, if "All lags" is selected go to step 2-2.

-Step 2-1 (feature selection with optimal lags): Initialize the input data for feature selection as I 2 = {} and list of final selected features as F . Use loss rate as dependent variable. * Step 2-1-1: For each macroeconomic indicator i (i = 1, ..., n), select the lag with the highest correlation with loss rate from I 1 and append it to I 2 . * Step 2-1-2: Apply Lasso regression using the input data I 2 and loss rate. Use hyperparameter optimization to achieve the best fit in terms of R 2 . * Step 2-1-3: Record the feature importance for each input feature from the Lasso regression model. * Step 2-1-4: Add features with feature importance greater than 0.2 to the list of selected features (F ). -Step 3-1: Use input values selected from the feature selection step (F ) as independent variables and loss rate as the dependent variable.

-Step 3-2: Split the data into training and test set.

-Step 3-3: Train model j on the training set and test it on the test set. Record R 2 for the training set and MSE for both training and test sets. Use hyperparameter optimization to achieve the best fit. In the next section, we will apply the proposed loss forecasting model on the loss rate data from the top 100 banks in the US from 1985 to 2019.

In this section, we will test the proposed loss forecasting framework on the Charge-off rate data from the first quarter of 1985 to the second quarter of 2019.

This data is retrieved from the "Board of Governors of the Federal Reserve System (US)" (Board of Governors of the Federal Reserve System (U.S.), 2020) database and is an aggregated charge-off report for the top 100 US banks ranked by assets. As we mentioned in the introduction section, this study was originally motivated while the author was working for one the leading credit card issuers in the U.S, and a part of this work was originally developed in that company.

However, due to the confidentiality issues, all the data related to the company is omitted in this research and the equivalent publicly accessed datasets were being used as the basis for the numerical experiments.

To select the macroeconomic indicators for this study, initially the "Principles for navigating the big debt crises" by Ray Dalio (Dalio, 2018) was reviewed and the macroeconomic indicators which were mentioned in the book that had a significant correlation with debt, charge-off rate and economic cycles were selected. Several additional macroeconomic indicators were also added to the list using the research articles reviewed in the literature review section. This list was provided to the experts in the leading credit card company, including a senior manager and a director from credit risk assessment department. These experts provided their feedback on these indicators and selected 19 indicators that they believed are the ones having the most significant relationship with the charge-off rate and cover all aspects of the economy, while having the smallest overlap to reduce the risk of overfitting. The list of selected macroeconomic indicators is shown in table 1. Please refer to Table A .1 in the appendix for the list of references corresponding to each macroeconomic indicator.

against charge-off rate. Since each indicator has a different unit, we used the vertical axis to show the charge-off rate, and other indicators were scaled to makes us able to compare their trend against the charge-off rate. The shadowed regions show the US recession periods from 1985 to 2019. As figure 2 shows, "unemployment rate" and "initial unemployment insurance claims" have very similar trends to loss rate. That may be the reason that these two indicators are mostly used in the credit card industry to predict the charge-off rate. However, if we look at the values of loss rate, "unemployment rate", and "initial unemployment insurance claims" from the second quarter of 2018 to second quarter of 2019, we see that "unemployment rate" and "initial unemployment insurance claims" are decreasing, but the loss rate has an increasing trend. Hence, there is no way to predict the loss rate in this period by solely using the "un- University of Michigan Sentiment Index employment rate" and "initial unemployment insurance claims" as independent variables. Using macroeconomic indicators from all segments of the economy is one of the advantages of our proposed loss forecasting model, which will make it able to capture the uptrend even when the "unemployment rate" and "initial unemployment insurance claims" are decreasing.

Another thing interesting fact is that "M1" and "M2" in table 5 have significantly different trends before and after the great recession in 2008. As can be seen, the government started printing money in the great recession to add stimulation to the economy and overcome the recession. However, if we look at the trends, they printed money with a significantly higher rate after the great recession, which may be a negative factor for the economy and could play an important role in our loss forecasting framework when we train the model.

Moreover, "building permits" and "housing starts" have a very similar trend in figure 2, and to avoid overfitting, only one of them needs to be selected for building a prediction model. The same is true for "CCI" and "UM consumer sentiment index" in figure 2, "Dow Jones industrial average" and "S&P 500 index" in figure 3, and "yield (10 year minus 3 month" and "yield (10 year minus federal fund rate" inf figure 5. These collinearities will be handled by the feature selection step of our proposed loss forecasting model. We will show the step by step implementation of our proposed loss forecasting framework in the following subsections.

All the macroeconomic indicators are converted to quarterly values, and the lagged values are recorded (1 to 4 quarters). Hence, for each macroeconomic indicator, we have five columns of input data, and in total, we have 95 input columns for 19 macroeconomic indicators in this study. "bestNormalize" package in R (Peterson, 2017) is used for the normalization of each lagged input.

The function "bestNormalize" in the aforementioned package performs several normalization transformations, including the Box-Cox transformation, the Yeo-Johnson transformation, the square-root transformation, log transformation, 

Using the input data obtained from the data preparation step, we start performing the two versions of our feature selection procedures, "feature selection with optimal lags" and "feature selection with all lags".

To use the feature selection with optimal lags, we first need to find the optimal lag from the input data generated in step 1. We calculated the correlations of lagged values for each macroeconomic indicator and selected the lag with the highest correlation for each one. The results are shown in table 2. As we can see, "initial unemployment insurance claims" and "unemployment rate" have the highest correlations with the loss rate, which is in line with what we have already seen in figure 2. Now we perform Lasso regression on these optimal lags to remove collinearity between variables and select the most significant features among the indicators list in table 2. We used the feature importance list from the results of Lasso regression and selected the indicators with the importance values greater than 0.2. The results for this feature selection procedure is shown in table 3.

As we can see in table 3, only six macroeconomic indicators among the initial 19 indicators are selected using feature selection with optimal lags. These macroeconomic indicators are "buliding permits", "initial unemployment insurance claims", "M1", "PMI", "Weekly hours worked by manufacturing workers", and "unemployment rate". If we look at table 1, we can see the interesting result that these indicators cover all the segments mentioned in the table. "Building permits", "Initial unemployment insurance claims" and "unemployment rate" are from the consumer segment. "PMI" and "weekly hours worked by manufacturing workers" are from the business segment, and "M1" covers the government segment of the economy. The fact that our feature selection procedure selected indicators from all segments of the economy suggests that a holistic view of the economy is a requirement to build an effective loss forecasting framework. Additionally, we can see that "M1" is selected as a significant factor, and is in line with what we already suspected as the trend of "M1" is changed significantly after the great recession.

As opposed to the feature selection with optimal lags, in this version of feature selection, we do not select the optimal lags manually. We feed all the 

The features selected by each of our feature selection procedures will be used as input to our machine learning models. The benchmark machine learning models that we use in this study are Lasso regression, Ridge regression, gradient boosting machine, and random forest. We use the data from the first quarter of 2011 to the second quarter of 2019 as the test set and develop two sets of results corresponding to each of our feature selection procedures. We report R 2 for the training set and Mean Squared Error (MSE) for both training and test sets. The results using the output of "feature selection with optimal lags" are reported in table 5. The corresponding plots for the fit of each machine learning model are shown in figure 6 . Comparing the values of R 2 in table 5, we see that the gradient boosting machine shows a better performance in terms of R 2 , which means that 77% of variations in the loss rate can be explained by the gradient boosting method using optimal lags. The values of MSE in training and test sets are also in line with our conclusion, and the gradient boosting machine shows the best performance in terms of MSE on both training and 26 test sets. Hence, the gradient boosting machine is selected for making the final prediction in the next step when we use Lasso with optimal lags as our model selection procedure. Table 7 shows the statistics corresponding to the result of each machine learning method when using feature selection with all lags. The final fit for each method is depicted in figure 7 . The R 2 results in table 7 suggest that both Lasso and Ridge regression have similar performance. However, looking at the values of MSE in the training and test sets, we see that Ridge regression has a better performance on both train and validation sets. Hence, we select Ridge regression for generating the final forecasts when we use the output of feature selection with all lags as the input of the machine learning model.

As figures 6 and 7 show, the uptrend of the loss rate in the last four quarters can be captured by all the benchmark models using selected features, which is not possible when the unemployment rate is the only decision variable. Addi- Figure 6 : Final fits for machine learning models using optimal lags as input variables tionally, we can see that all the models are able to capture the trends of loss rate with an acceptable accuracy, which shows that our loss forecasting method can provide acceptable results using any of the benchmark machine learning models. We use the selected machine learning model in this step to generate final forecasts in the next step of our loss forecasting algorithm, which is explained in the next subsection. 

The last step to build the loss forecasting framework is to predict each macroeconomic indicator and use the trained model in step 3 to predict the The results suggest the same trend as numerical results in Taghiyeh et al. (2020) , as the MSIC algorithm shows a constant improvement in the optimality gap in all instances. Additionally, there is not a single winner among classifiers for the MSIC algorithm, and it is case dependent. As the overall performance in figure 15 shows, we can get an overall minimum of 60% improvement in optimality gap improvement using the MSIC algorithm over the traditional train/validation model selection procedure. Now that the forecasting models are selected for each macroeconomic indicator, and the predictions are made, we use the forecast values as input to the trained models in step 3. Gradient boosting machine was selected as the best performing machine learning model using feature selection with optimal lags, and ridge regression was the winner when using feature selection with all lags. proposed loss forecasting framework that shows the efficiency of the algorithm.

Moreover, looking at figure 16, we see that both variants of our loss forecasting model can closely predict the loss rate values, and it is able to capture the uptrend of the loss rate, which is not possible when using only unemployment rate as the decision variable. Overall, we see that ridge regression with all lags can obtain better results than gradient boosting with optimal lags. The reason is that in the feature selection with all lags, the lags are selected automatically by the model, and we allow the model to use more than one lag from each indicator. This way, more data is available to make predictions. Hence, the Ridge regression with all lags can perform better than gradient boosting with optimal lags. Moreover, Ridge regression is from the family of monotonic and linear machine learning models, which makes it highly interpretable and the assigned coefficients are available for each input variable. not. While correlation and causation may exist at the same time, but existence of a correlation does not necessarily implies causation. Causation applies in the situations that an action explicitly triggers another action, but correlation simply implies a relationship. When a correlation exists between two actions, it means that they are related to each other, but it does not necessarily mean that any of them cause the other one. We use the example from the book "Introduction to statistical learning" by James et al. (2013) to explain this issue.

Suppose that we are evaluating the correlation between the sales of an ice cream vendor in a beach with number of shark attacks. Interestingly, they have a high correlation, but it does not mean that selling ice cream on the beach causes more shark attacks or vice versa. However, when the weather is hot, people are more attracted to the beaches and consequently, the number of ice cream sales increases. When there are more people on the beach, there is a higher chance of a shark attack and the higher temperature is actually the cause of attracting more people to the beach which results in more shark attacks. This example illustrates the difference between causation and correlation. Our main focus in this research was on correlation rather than causality. The question regarding whether the final significant indicators that were selected to build the model are actually causing the chain of events that leads to the changes in the charge-off rate is left to the experts in the credit card industry and economists.

In this paper, we have proposed a machine learning based loss forecasting framework for the credit card industry using macroeconomic indicators. Our goal was to cover macroeconomic indicators from all segments of the economy to make predictions based on a holistic view of the economic conditions. Using the review of the literature and experts' opinion, we selected 19 macroeconomic indicators, which cover consumer, business, and government sections of the economy as input to the proposed loss forecasting framework. The proposed procedure consists of four steps, data preparation, feature selection, model training, and forecasting. We used four machine learning models, namely Lasso regression, Ridge regression, gradient boosting machine, and random forest to develop two versions of the loss forecasting framework. The difference between these two versions is in the utilization of lags from input data. We also applied the proposed model selection procedure in Taghiyeh banks in the US ranked by assets from 1985 to 2019, and the data corresponding to selected macroeconomic indicators. We applied the proposed loss forecasting framework on the data, and the final results were very promising. We could achieve the test MSE of 1.15E-03 and 1.04E-03 corresponding to feature selection with optimal lags and feature selection with all lags, respectively, which shows the effectiveness of the proposed algorithm in forecasting the loss rate.

The final fit for the prediction period shows that we could closely predict the actual values of the loss rate and the uptrend of the loss rate could be captured by our proposed model, which was not possible in the conventional version of the credit card loss forecasting frameworks that only use the unemployment rate as the decision variable.

In the future, we aim to further improve the proposed loss forecasting model in this paper by adding more machine learning models, such as deep neural networks, long-short term memory (LSTM) model, and extreme gradient boosting to the benchmark models and see if we can make more accurate forecasts. Additionally, more feature selection procedures can be explored to improve the feature selection step of the loss forecasting framework. The other future line of research would be to perform a more exhaustive number of transformation for the macroeconomic indicators to see if a better data transformation can be found to improve the efficiency of the algorithm further, as the final results are sensitive to these transformations. Another interesting future research path is to analyze the credit card charge-off rates due to the rapid changes in the economy caused by the Coronavirus pandemic and adjust the model accordingly.

Determinants of credit card delinquency and bankruptcy: Macroeconomic factors

Credit card defaults, credit card profits, and bankruptcy

Loss given default models incorporating macroeconomic variables for credit cards

Forecasting and stress testing credit card default using dynamic models

Evaluating the classification of economic activity into recessions and expansions

Charge-off rate on credit card loans

Consumer confidence slumps in september

Principles for navigating big debt crises

The card act and young borrowers: The effects and the affected

On the county-level credit outcome beta

Evaluating credit risk and loan performance in online peer-to-peer (p2p) lending

The term structure as a predictor of real economic activity

Predicting us recessions: Financial variables as leading indicators

Paying with plastic: the digital revolution in buying and borrowing

Modeling the effect of macroeconomic factors on corporate default and credit rating transitions

Modeling credit card charge-off ratios: The case of hong kong

Corporate bond default risk: A 150-year perspective

An empirical analysis of personal bankruptcy and delinquency

Instance-based credit risk assessment for investment decisions in p2p lending

Uncertainty, risk, and trust: Russian and american credit card markets compared

An introduction to statistical learning

Additional credit for liquidity-constrained individuals: High-interest consumer credit in korea. Emerging Markets Finance and Trade

Screening as a unified theory of delinquency, renegotiation, and bankruptcy

Intensity models and transition probabilities for credit card loan delinquencies

Using a leading credit index to predict turning points in the us business cycle

The predictive power of economic indicators in consumer credit risk management

Risk assessment in social lending via random forests

Understanding the recent growth in consumer loans and credit cards in emerging markets: Evidence from turkey. Emerging Markets Finance and Trade

House prices, home equity-based borrowing, and the us household leverage crisis

A portfolio view of consumer credit

Boosting recessions

bestnormalize: A suite of normalizing transformations

Determinants of credit to households: An approach using the life-cycle model

Does rising consumer debt signal future recessions?: Testing the causal relationship between consumer debt and the economy

Credit card borrowing, delinquency, and personal bankruptcy

Forecasting using principal components from a large number of predictors

Forecasting model selection using intermediate classification: Application to monarchfx corporation

A new particle swarm optimization algorithm for noisy optimization problems