key: cord-0689227-hv1um2ln
authors: Zhu, Shushang; Zhu, Wei; Pei, Xi; Cui, Xueting
title: Hedging Crash Risk in Optimal Portfolio Selection
date: 2020-07-28
journal: J Bank Financ
DOI: 10.1016/j.jbankfin.2020.105905
sha: b30735dc72f94126c0bd5b9739e24bea4d17d6a3
doc_id: 689227
cord_uid: hv1um2ln

When almost all underlying assets suddenly lose a certain part of their nominal value in a market crash, the diversification effect of portfolios in a normal market condition no longer works. We integrate the crash risk into portfolio management and investigate performance measures, hedging and optimization of portfolio selection involving derivatives. A suitable convex conic programming framework based on parametric approximation method is proposed to make the problem a tractable one. Simulation analysis and empirical study are performed to test the proposed approach.

In this paper, we propose an approach for optimal portfolio choice, taking into account a crash, that combines the ideas of hedge and measure of crash risk. Our strategy and research roadmap are illustrated by Figure 1 . We propose a general nonlinear portfolio optimization model involving both, normal risk and crash risk, which can deal with risk in a more flexible way. The crash risk is hedged by derivatives and the model is formulated as a tractable one via parametric approximation. We then demonstrate that even if the return of derivative is usually asymmetric, the proposed model is reasonable when the portfolio is relatively diversified. We further derive an efficient convex programming approach to solve this general nonlinear portfolio problem.

A common belief in financial literature is that asset returns are distributed with heavy tails. Consequently, several tail risk measures have been proposed. VaR (see Jorion (2007) ), Conditional VaR (CVaR) (see Rockafellar and Uryasev (2000) ), Lower Partial Moments (LPM) (see Fishburn (1977) ) are among the most popular ones. Time-varying tail risks defined as the average logarithmic shortage with respect to a prescribed threshold are discussed by Kelly and Jiang (2014) and Faias and A. Zambrano (2017) . However, with the exception of the extreme returns associated with market crashes or booms, we find that the returns closely follow normal distributions. Therefore, estimating risk using traditional risk measures while falling to distinguish return data sets between normal situations and crash situations, we will overestimate risk in a normal market and underestimate it in a crash situation. Thus, measuring risk under normal versus extreme conditions separately is critical in portfolio decision-making. Wilmott (2007) introduced a risk measuring system called CrashMetrics to deal with risk problems during a crash. It parallels RiskMetrics (J. P. Morgan (1996) ) for market risk and CreditMetrics (J. P. Morgan (1997) ) for credit risk in normal market conditions. Whereas these latter systems based on VaR work well in normal market conditions, CrashMetrics addresses risk management in extreme market conditions.

In CrashMetrics (Wilmott (2007) ), the portfolio risk under crash conditions, i.e., crash risk, is defined as the worst-case realized return of the portfolio where uncertain asset returns are modeled as a suitable set containing all possible returns. This measure of crash risk makes sense since it is approximately the loss of the portfolio in the situation where all the assets fall more or less simultaneously.

Managing crash risk is an issue even more important than how to measure it. Since diversification does not work in a crash, hedging using options and/or other derivatives is a natural choice (see, e.g., Wilmott (2007) ; Hull (2009) ). Although much literature has investigated the hedging of portfolio risk using derivatives, there is relatively little research on portfolio optimization involving derivatives. Isakov and Morard (2001) and Liang et al. (2008) discuss the mean-variance portfolio selection using options. investigate the Conditional Value-at-Risk (CVaR) model based portfolio optimization problem considering only derivatives. Also based on CVaR risk measure, a stochastic portfolio model incorporating options is studied by Topaloglou et al. (2011) , and the model is solved by its corresponding deterministic linear programming form with scenario generation techniques. Cui et al. (2013) propose a general hedged portfolio optimization approach based on risk measure calculated by the approximate parametric VaR. Zymler et al. (2013) use a robust optimization approach to investigate the VaR based derivative portfolio optimization problem when the required complete distribution information is unavailable. Faias and Santa-Clara (2017) note that the option returns are asymmetrically distributed and propose a utility maximization framework for the portfolio selection involving European options held to maturity. They conclude that the effective performance of the approach is mostly obtained by exploiting mispricing between options.

Research on portfolio optimization tends to focus on normal risk even for those on the derivative portfolio optimization. There seems to be no research directly addressing the issue of portfolio optimization using derivatives to hedge the disaster caused by a market crash.

The rest of the paper is organized as follows. In Section 2, we propose a tractable formulation of the hedged portfolio with crash risk control. In Section 3, we conduct a simulation analysis and an empirical study to test the proposed approach. Concluding remarks are given in Section 4.

In this section, we discuss parametric approximation of the value change of a hedged portfolio and investigate the problem of measuring risk of a hedged portfolio in normal market conditions and in a crash, respectively. We then propose a tractable convex conic programming approach to solve the hedged portfolio optimization problem with crash risk control.

We suppose there are m risky underlying assets and based on this there are n derivative assets.

Denote d t = (d t 1 , d t 2 , · · · , d t n ) and u t = (u t 1 , u t 2 , · · · , u t m ) as the value vectors of the derivative assets and their underlying assets at time t, respectively. Let x = (x 1 , x 2 , · · · , x n ) ∈ R n and y = (y 1 , y 2 , · · · , y m ) ∈ R m denote the vector of holding amount of derivative assets and their underlying assets, respectively. Then the value change of portfolio (x, y) over time period [0, ∆t] , the investment period we consider, can be expressed as

where ∆d i and ∆u i denote the value change of derivative i and its underlying asset over time period [0, ∆t], respectively.

Generally, the price of a derivative is a function of factors, such as the prices and volatilities of the underlying assets, the risk-free interest rate and the time interval. We reasonably assume that derivative price is a sufficiently smooth function of some corresponding factors. Recall some definitions of "Greeks" (see, e.g., Glasserman (2004) ; Hull (2009)):

where we omit the notation of time t for simplicity when it is unnecessary. Then we can reasonably approximate the value change of derivative i over time period [0, ∆t] by Taylor expansion using Greeks as follows (see, e.g., Glasserman (2004) ; Hull (2009)):

where the Greeks are all calculated at time 0. Of course, we can use additional terms of Taylor expansion with respect to volatility of underlying asset and/or risk-free interest rate to produce a more accurate approximation, but the most influential factor is the price of the underlying asset. Approximation (2) could be accurate enough (see Cui et al. (2013) and the reference therein).

Combining (1) and (2), we can approximate the value change of portfolio (x, y) as

x i θ i . In the sequel, the value change of portfolio (x, y) is defined by (3).

Remark 1 Since the "more accurate approximation" of v(x, y) with additional terms associated with volatility of underlying asset or/and risk-free interest rate remains a quadratic form in u and a linear form in x and y, just as same as approximation (3), the problem reformulation done in the following according to approximation (3) can be similarly applied to "more accurate approximation". That is to say, the following derivation does not lose its generality in deriving a "more accurate approach".

Because of the unusual nature of a crash, traditional risk measures are not suitable for portfolio management. In this subsection, we discuss the measure and calculation of hedged portfolio risk in a normal market and in a crash, respectively. Figure 2 , an example of a jointed distribution of value changes of two assets, illustrates the intuitive idea of distinguishing these two types of measures of risk.

One thing deserving special mention is that a market boom can also cause large losses in a portfolio with a large short position. Actually, a portfolio involving derivatives, such as the butterfly strategy, may encounter large loss even when there is no crash or boom. Thus the concept of "crash risk" in this paper is best interpreted in the broader sense according to

CrashMetrics (Wilmott (2007) ): the worst-case loss, not only the loss occurred in a real market crash.

In a normal market, there are no extreme asset price movements and therefore the value changes of underlying assets can be effectively modeled with a normal distribution. In such a market, we can assume that the value change vector ∆u follows a multi-normal distribution, i.e.,

where µ and Σ denote the mean vector and the covariance matrix, respectively. This assumption will be verified by empirical tests in Section 3. The following useful results can be obtained on calculation of mean and variance of the value change of hedged portfolio ∆v(x, y).

Proposition 1 Suppose ∆u ∼ N (µ, Σ) and Σ is nonsingular. Then the mean and variance of ∆v(x, y) are given as

Here, tr(·) denotes the trace of a matrix and I denotes the m-dimensional identity matrix.

With the results of Proposition 1, we can analyze portfolios under the mean-variance framework. Although variance might be the most popular risk measure adopted in portfolio management, it is only regarded as suitable for symmetrical return distributions. Usually return of a derivative such as an option is asymmetrical. Whether variance is a suitable risk measure for hedged portfolio with derivatives will be addressed in the following.

Denote ξ = Σ −1/2 ∆u, where Σ −1/2 is the inverse of Σ 1/2 satisfying Σ = Σ 1/2 Σ 1/2 . Recall that ∆u ∼ N (µ, Σ), which implies ξ ∼ N Σ −1/2 µ, I , then we can reformulate (3) as:

∆v(x, y) = δ Σ 1/2 ξ + 1 2 ξ Σ 1/2 ΓΣ 1/2 ξ + θ∆t.

Defining A = Σ 1 2 ΓΣ 1 2 , we can decompose the symmetrical matrix A as A = C ΛC, where Λ is the diagonal matrix constructed by the eigenvalues λ 1 , · · · , λ m of A, and C is an orthogonal matrix consisting of the corresponding eigenvectors.

Denoting η = Cξ, we have η ∼ N CΣ −1/2 µ, I . We can then express the change of portfolio value as ∆v(x, y) = c η + 1 2 η Λη + θ∆t where c = (c 1 , . . . , c m ) = CΣ 1/2 δ. And by assuming λ 1 , . . . , λ h = 0 and λ h+1 , . . . , λ m = 0, h ≤ m without loss of generality, we can have

Then the first part of the right side of above equation can be written as

From the above discussion, it can be seen that ∆v(x, y) is actually the sum of a linear combination of some independent χ 2 1 random variables, an independent normal random variable and a constant. Now to clarify the normality of ∆v(x, y), we only need to verify the normality of a linear combination of independent χ 2 1 random variables. We can prove the following result:

Proposition 2 Suppose z 1 , . . . , z n are independent normal random variables with means ζ i , i = 1, . . . n and unit variances, i.e.,

where E(Z n ) and V (Z n ) denote the mean and variance of Z n , and the symbol " " means convergence in distribution.

Proof. See Appendix B.

For a clear understanding of the condition of Proposition 2, let's consider the special case with λ i = λ and ζ i = 0 for i = 1, · · · , n. The condition holds for this special case since

Essentially, except for the assumption of normal distribution on the returns of underlying assets, the condition of Proposition 2 indicates that there are two key points for Z n = n i=1 λ i z 2 i to be close to a normal distribution: Sufficiently large n and no dominant λ i . Intuitively, these two points can be satisfied for a sufficiently diversified portfolio. In plain language, Proposition 2 says that the return of a sufficiently diversified hedged portfolio is close to a normal distribution if the returns of underlying assets are normally distributed.

Now assume that ∆v(x, y) is normally distributed. Then several other well-known risk measures, such as MAD (Mean-absolute Deviation, see Konno and Yamazaki (1991) ), VaR (see Jorion (2007) ), CVaR (see Rockafellar and Uryasev (2000) ), can be uniformly defined as follows:

ρ risk (x, y) τ 1 µ(x, y) + τ 2 σ 2 (x, y)

where τ 1 and τ 2 are two constant parameters independent of decision variables x and y. Thus, portfolio optimization using these risk measures is equivalent to that using variance as a risk measure (Some mild condition is needed for VaR and CVaR model, see Rockafellar and Uryasev (2000) ). Artzner et al. (1999) proposed four axioms to qualify risk measures and called any risk measure satisfying these four axioms a coherent risk measure. Of the four popular risk measures mentioned above, only CVaR qualifies as a coherent risk measure. Fortunately, in the situation of a normal distribution, portfolio optimization under a return-risk framework with risk measures other than variance is almost equivalent to that under the mean-variance framework.

We can conclude from the above discussion that, under a normal market condition, variance is a suitable risk measure for diversified hedged portfolios. But downside risk measures, such as VaR and CVaR, are better choices for non-diversified hedged portfolios even in a normal market. While could include downside risk measures in the model, we will focus on the mean-variance formulation as it is sufficient for diversified hedged portfolios.

In this paper, a crash is identified as the situation where almost all asset prices fall suddenly.

In such a situation, almost all asset returns are corrected perfectly and the change in portfolio value can not be well modeled as a random variable. Since a risk measure is usually defined as a moment (e.g., variance and MAD), a quantile (e.g., VaR) or a quantile-based moment (e.g., CVaR) of the random return/loss, it is not suitable for measuring crash risk.

When a crash happens, loss of a portfolio is almost an inevitable and can be interpreted, to a large extent, as the realization of portfolio return in the worst-case market condition. Noting its speciality, we following Wilmott (2007) and define the crash risk measure as

where U is the set that contains all possible realizations of ∆u in an extreme market condition.

Intuitively, crash risk is measured by the worst-case realization of value change of a portfolio.

Actually, the above idea used to measure risk is not new. For example, it was employed for portfolio management by Young (1998) , where the worst-case historical return is adopted as risk measure. Although, worst-case return is too conservative to quantify risk under a normal market condition, it is obviously a proper risk measure for counting loss in a crash.

The crucial issue left for computing crash risk is the determination of set U. As suggested by Wilmott (2007) , a suitable choice of U is an ellipsoid defined as

where · denotes the Euclidean norm, Λ 1 is an invertible matrix that scales the ellipsoid with respect to ∆u and Λ −1 1 ∆u 0 is the ellipsoid center.

Since factor model is widely used in practical applications, computing crash risk according to a factor model will facilitate a realworld application. Generally, a factor model is given as:

where f j 's are factors driving the changes of asset values, β i0 is a constant, β ij 's are coefficients representing the sensitivities to factors, and ε i is the random residual error with zero mean.

We assume that residual errors are independent with each others and also independent of all the factors, which implies Cov(ε i , ε j ) = 0 for i = j and Cov(ε i , f j ) = 0 for all j = 1, · · · , l.

Since ε i 's are the independent random residual errors with zero means, they shall have a small impact on the value of sufficient diversified portfolios. We can reasonably measure the crash risk according to factors f = (f 1 , · · · , f l ) as

where ∆ṽ(x, y) is calculated according to (3) and (11) by omitting ε i 's andŨ is defined as

Here, Λ 2 is an invertible matrix that scales the ellipsoid with respect to f and Λ −1 2 f 0 is the ellipsoid center.

Another advantage of using (12) as a measure of crash risk is that it can reduce the dimensionality of the problem. Since the dimension of ∆u is usually high for a sufficient diversified portfolio, it is time-consuming to solve the corresponding optimization problem using crash risk defined by (9). The dimension of f is usually much lower than that of ∆u, hence formulating the problem with crash risk defined by (12) can significantly improve the computational efficiency.

Generally, how to choose Λ 1 /Λ 2 and ∆u 0 /f 0 should be problem-oriented. In this paper, we suggest an optimization approach to determine these parameters. More specifically, we can determine them by solving an ellipsoid covering problem, which can be equivalently formulated as a semidefinite program. The details will be discussed in empirical test in Section 3.

As we have noted above, crash risk is completely different from the normal risk. We propose a portfolio analysis framework that takes crash risk into account as this can help investor avoid large losses. Such an analysis can be used to deal with portfolio risk more flexibly and effectively.

LetX denote the set of all admissible portfolios:

where p x and p y are the prices of the derivative assets and their underlying assets at time 0, l x , l y and u x , u y are the lower and upper bounds of the investment amount of the derivative assets and the underlying assets, respectively. The price of an option is different when it is bought and sold. The difference between the two prices is exactly the ask-to-bid spread which is a nonnegligible transaction cost in derivative market. Incorporating the ask-to-bid spread, the portfolio setX has the following form:

where p ask x and p bid x are vectors of the ask and bid prices, respectively, and x + and x − are the vectors with max{x i , 0} and min{x i , 0} being the ith element, respectively. For the ask-to-bid spreads in option strategies, please refer to Santa-Clara and Saretto (2009), Eraker (2013) and

Faias and Santa-Clara (2017).

Following the framework of mean-variance analysis (Markowitz (1952) ), we propose a meanvariance model with crash risk control as follows:

whereσ andρ are pre-determined risk tolerance parameters with respect to the risk under normal condition and the crash risk under extreme condition.

Remark 2 In accordance with Wilmott (2007) , the crash risk is defined as the "worst-case return". Thus crash risk constraint is defined as ρ crash (x, y) ≥ρ in (P ) to control the extreme loss under a crash.

As compared to the bicriteria mean-variance model, problem (P ) is a tricriteria decision model integrating crash risk. The model (P ) we propose is different from the existing tri-criteria portfolio selection models, such as Mean-Variance-Skewness model (Briec et al. (2007) ) and

Mean-Variance-CVaR model (Gao et al. (2016) ). In our model, the two risk criteria are defined according to portfolio value distributions under normal v.s. crash market situations whereas in other models risks have been defined in terms of a common portfolio value distribution.

Optimizing a utility function, for example, constant relative risk aversion (CRRA) function, is also a method widely used for portfolio construction, especially for option strategies (Bliss and Panigirtzoglou (2004) ; Faias and Santa-Clara (2017)). The expected utility maximization framework integrates all distribution information in a clear way. However, it is inconvenient for investors to specify their utility functions. Even if the utility function is determined, the limitation may remain. As for the CRRA utility, the existence of expectation of CRRA utility is extremely fragile with respect to distribution assumption (Geweke (2001) ). In contrast, the return-risk framework does not take into account all the distribution information. But it captures the key point of return-risk tradeoff and facilitates the practical application.

In problem (P ), we can use (9) or (12) to define crash risk ρ crash (x, y), and regardless of which definition of crash risk is adopted, it can be translated into a semidefinite program.

In the following, to avoid unnecessary repetition, we only show the details of reformulation of problem (P ) with crash risk defined by (9) and the details of using (12) is proposed in Appendix B.

First, we show that the first constraint on normal risk in (P ) can be formulated as a secondorder cone constraint. Following Cui et al. (2013) , we give the derivation below. Attention needs to be paid to the fact that we can rewrite the second term in the right side of equation (5) as

Since Σ is a covariance matrix, it is positive semidefinite and can be decomposed as Σ = Σ where Σ 1 2 is symmetrical. Notice that tr(ABC) = tr(BCA) = tr(CAB). Then we further have

where the last inequality is from the fact that tr(A 2 ) ≥ 0 holds for any real symmetrical matrix A. Thus, Φ in (14) is a positive semidefinite matrix.

Since both Φ and Σ are positive semidefinite, we can decompose them as Φ = M M and Σ = L L.

Denote

By (5), we can rewrite σ(x, y) ≤σ as the following second-order cone constraint

which means that (x , y ) H ≤σ.

Second, we show that the constraint on crash risk defined by (9) can be reformulated as a semidefinite constraint. Notice the constraint is defined as

Now we invoke the following well-known S-Lemma (see, e.g., Boyd et al. (1994) ) to demonstrate that constraint (16) can be equivalently formulated as a semidefinite constraint.

Then, according to the notations of Lemma 1, δ ∆u + 1 2 ∆u Γ∆u + θ∆t ≥ρ and ∆u ∈ U are F 0 (∆u) ≤ 0 and F 1 (∆u) ≤ 0, respectively. Noting that F 1 (Λ −1 1 ∆u 0 ) < 0, by Lemma 1, the crash risk constraint is satisfied if and only if there exists a real number λ ≥ 0 such that

Note that A 0 (x), b 0 (x, y) and c 0 (x) are linear in x and/or y.

Turning to the portfolio set X , we notice that the term (p bid x ) x − in the budget constraint in X is not convex with x. We rewrite the budget constraint:

Due to the fact that the ask price is not lower than the bid price for an option, the constraint above can be transformed in the form:

which is equivalent to a group of linear constraints

Thus, portfolio set X is a polyhedral set formed by linear constraints.

Using (4), (15), (17) and (18), we get the following proposition which means that the portfolio optimization problem with crash risk control can be solved by a tractable convex programming approach.

Proposition 3 Problem (P) with crash risk defined by (9) can be transformed to the fol-lowing equivalent semidefinite program (SDP):

If we omit the second constraint in (P 1 ), then it is referred as a second-order cone program (SOCP). SOCP and SDP are both instances of linear conic program, as they are a linear optimization problem under constraints represented by second-order cones or constraints represented by cones of positive semidefinite matrices. Both of them can be regarded as an extension of linear program (LP). The interior point methods applied in LP can be easily extended to SOCP and SDP, which has been extensively investigated over the last two decades. LP is in fact a special case of SOCP, and SOCP is a special case of SDP, i.e., LP ⊂ SOCP ⊂ SDP.

The reader is referred to Lobo et al. (1998) and Alizadeh and Goldfarb (2003) for details of SOCP problems, and Vandenberghe and Boyd (1996) for details of SDP problems. The wide applications of conic programming in finance can be found in Cornuejols and Tütüncü (2006) .

The portfolio optimization problem (P ) can also be defined as one of minimizing normal risk (under constraints placed on expected returns) as well as crash risk, and in this way can also be reformulated as an SDP. Reformulation (P 2 ) of (P ) in terms of SDP with crash risk defined by (12) using a factor model is given in Appendix C.

In this section, we mainly conduct the simulation/empirical test to compare the performance of portfolio strategies generated by different models. The following four portfolio strategies are considered:

1) Crash strategy without factor generated by model (P 1 ) without using factor model;

2) Crash strategy generated by model (P 2 ) using factor model;

3) No crash strategy generated by model (P 2 ) without crash risk constraint; 4) No option strategy generated by model (P 2 ) without options.

Simulation/empirical test is based on historical data related to the index of Dow Jones

Industrial Average (DJS), the constituent stocks of DJS and the options written on these stocks and index. In the following, we first investigate the characteristics of value/price change of DJS constituent stocks, which is used to verify the motivation of this paper. We then conduct simulation and empirical analysis to compare in-sample efficient frontiers and outof-sample performance of different strategies by using data sets of the DJS constituents and options written on them. Finally, we conduct empirical test on the performance of different strategies based on data sets of DJS index and the associate index options in Subsection 3.3.

All the calculations are conducted on a personal computer using Matlab R2015b and Stata 11.0, and the SDP problem is solved via CVX, which is a Matlab-based modeling package for convex optimization problems.

In this subsection, we investigate the characteristics of the value/price change of DJS constituent stocks. We perform a test for one dimensional normality of stock returns using samples from January 2000 to December 2018. After deleting 2 constituents with missing data, 28

constituents are retained in the empirical study. Notice that ∆u i = u ∆t i − u 0 i = r i u 0 i where r i denotes the return over time period [0, ∆t] . Thus it can be inferred that the properties of ∆u i are characterized equivalently by r i since u 0 i is a pre-given parameter.

We perform statistical test on the normality of stock returns by using different data samples associated with different time frequencies . Since the test of joint normality in a highdimensional case is a hard thing, we just perform the test for one-dimensional case in this paper.

We apply both Kolmogorov-Smirnov (K-S) test and Jarque-Bera (J-B) test of the normality of price change for each constituent of DJS index. The data samples are divided into three categories: (i) full samples, (ii) full samples after removing those three times standard deviation above or below the mean and (iii) full samples after removing those two times standard deviation above or below the mean. Confidence levels for statistical tests are set as α = 5% and 1%.

The results of the test are summarized in Table 1 . From the table, we can see that for daily data with relatively high frequency, price changes of a large part of stocks are non-normal whether or not the extreme data has been removed. But for weekly data and monthly data, after removing extreme cases with two standard deviations above or below the mean, the normality assumption on the price changes cannot be rejected for more than 95% of the stocks. Based on the results of normality test, we cluster the historical returns of the corresponding index as follows: returns with two standard deviations above the mean, returns with two standard deviations below the mean, and the rest. Accordingly, we classify the individual stock data samples as boom samples, crash samples and normal samples, respectively.

Factor model is usually used to describe the common features of a group of stocks. By statistical analysis, two-factor models are enough to model the returns of the constituents of DJS index. Figure 3 illustrates the distribution of two standardized factors for the historical daily returns of DJS constituents. Typically, the most typical characteristic is that the factor values during both a boom and a crash scatter near the two poles of the ellipsoid which covers all the realizations of factor values. This finding confirms that it is reasonable to use the worstcase realization of the portfolio returns within an ellipsoid covering all the possible returns to measure the crash risk. 

The details of data sets used in the analysis and test in this subsection are as follows: All the data for stocks and options are from Bloomberg Database. We divide the time period of each data set into in-sample period and out-of-sample period. For each data set, the out-of-sample period is the same as the time period of option data, while the in-sample period is the time period of the stock data after removing the out-of-sample period.

For problem formulation related to Data sets 1 and 2, we use LSM (Least-Square Monte Carlo Simulation) method given by Longstaff and Schwartz (2001) to calculate or estimate the prices and "Greeks" of American options. Specifically, we simulate 20000 different 5-period paths of the underlying stock price under the risk-neutral measure. By the end of last period, the option will be exercised if it is in the money. The option will be exercised prior to the last period if the value of immediate exercise is more than the expected value of continuation. LSM uses least-square method to calculate the expected value of continuation which can sharply decrease the scale of simulation. When the exercise time has been determined on each path, we can calculate the price of the American option using the mean of the discounted exercising value on these paths. The "Greeks" of American option can be calculated using finite difference method. For example, to calculate δ, we change the price of the stock for one unit and calculate the difference of the option price as an approximation of δ. γ can be similarly calculated by using second order difference.

Normal risk in the model is estimated only by normal samples. As to the ellipsoid involved in (10) or (13), we adopt the Löwner-John ellipsoid, i.e., the minimum volume ellipsoid containing all the samples, as illustrated in Figure 2 . Calibrating this ellipsoid is a convex optimization problem (see, e.g., Boyd et al. (1994) ), which can be solved by CVX when the number of data points is relatively small. We first calculate the center (mean) of the samples, then pick out the top ten percent data farthest from the center and reconstruct the minimum volume ellipsoid containing these data. Finally, we add the data out of the ellipsoid and reconstruct the minimum volume ellipsoid containing all the samples. All the portfolio optimization models are constructed with the weekly data facilitating description and comparison.

In this subsection, we compare the efficient frontiers of crash strategy without factor, crash strategy, no crash strategy and no option strategy. Investment period is set as one week, the risk free interest rate r is set to be 5% per year, the total amount of derivatives is restricted within 30%, the upper and lower bounds for holding any assets are 10% and −10%, respectively.

To calculate the efficient frontier of crash strategy without factor by model (P 1 ), we first determine the lower and upper bounds of parameterσ by minimizing and maximizing σ(x, y)

under the constraint (x, y) ∈ X , respectively. Then we choose 10 values ofσ uniformly in this interval. For each choice ofσ, we set the lower and upper bounds of parameterρ by minimizing and maximizing ρ crash (x, y) with constraints σ(x, y) ≤σ and (x, y) ∈ X . Again we choose 10 values ofρ uniformly in this interval. Finally, for each pair of (σ,ρ), we solve the optimization problem (P 1 ) to get the maximum µ, denoted byμ, and consequently we get the efficient frontier of crash strategy without factor sketched by the triples of (μ,σ,ρ). The calculations of efficient frontiers of crash strategy and no option strategy according to model (P 2 ) are basically the same as that according to model (P 1 ). It should be mentioned that the efficient frontier of no crash strategy is with some speciality since it does not consider the parameterρ. However, it can be regarded as the one generated by the model with crash risk control where the parameterρ is set sufficiently small. As no crash strategy does not take into account the constraint on crash risk, the corresponding efficient frontier is a two-dimensional curve lying exactly on the boundary (associated with highest crash risk) of the three-dimensional efficient frontier corresponding to crash strategy.

Thus, when an extreme event happens, the possible loss of a strategy without considering the crash risk could be very large. Another interesting finding from these figures is that crash risk is positively correlated with normal risk. Hence, the traditional minimum variance strategy is usually accompanied by a relatively small crash risk, which might be the reason why the minimum variance strategy performs well in practice. Figure 4 show the efficient frontiers corresponding to no option strategy. We find that the efficient frontier of no option strategy is much narrower than that of crash strategy using options. Furthermore, the former is totally dominated by the latter. This means that using option can not only bring a more flexible control on crash risk, while also enhancing the portfolio performance. Using the leverage of option can nonetheless increase risk.

In this subsection, we test the performance of different strategies in different market scenarios ranging from a crash to a normal and a boom market simulated using data set 1 and 2. We first generate the optimal crash strategy and no crash strategy with different parameters.

Then we simulate 100 different market scenarios, which gradually change from crash scenarios to normal and boom scenarios. Specifically, we calculate the ellipsoid associated with factors defined by (13) The subfigures show that when an extreme crash happens, no crash strategy suffers a large loss, while high risk strategy bears a relatively small loss and low risk strategy basically maintains its initial value. In a normal market, the performance of crash strategy and no crash strategy are similar. In a boom market, no crash strategy performs better than crash strategy.

These results suggest that crash strategy helps to avoid losses in a market crash but only at the cost of losing opportunities under a boom market.

In this subsection, we compare the out-of-sample performance of crash strategies with no crash strategy in a rebalancing manner with real data.

The data sets described in Subsection 3.1 are used to construct portfolio strategies. Note that the out-of-sample period is the same as the time period of option data. For two data sets, we setρ = −0.05, −0.15 andσ = 0.15. The portfolio strategies are denoted as low risk and high risk according to different choices of crash risk parameterρ, respectively. The other parameters required by model (P 2 ) are set the same as in Subsection 3.2.2.

We start with an initial wealth P 0 = 1 and update the portfolio strategies every week. We estimate the required parameters at the beginning of each week using the samples taken prior to the portfolio rebalance, and reconstruct the portfolio strategies with the updated parameters.

We calculate the portfolio value at the beginning of every week during the out-of-sample period using the real data. As trading costs have a significant influence on the performance of option portfolios during the rebalancing strategy we take them into account in the form of ask-to-bid spreads in the rebalance process.

The detailed process is described as follows. At the initial time, we solve the problem (P 2 ) and generate an initial portfolio (x 0 , y 0 ). Denote the cash K 0 = 1 − p ask

Assume that (x t , y t ) and K t are the portfolio and cash amount generated at the beginning of week t. The total value of the strategy at week t + 1 is calculated by

where p ask x and p bid x are the ask and bid prices of options, p y is the constituent price at week t + 1, and r is the riskfree rate. At the beginning of week t + 1, we rebalance the portfolio strategy based on the existing portfolio (x t , y t ) and the cash K t . Here, the admissible portfolio

where p ask x , p bid x are the ask and bid prices of options at week t + 1, p y is the stock price at

week t + 1, and a + , a − are the vectors with max{a i , 0} and min{a i , 0} being the ith elements, respectively. Solving problem (P 2 ) with setX generates portfolio x t+1 , y t+1 , and the cash We also calculate the weekly returns of these strategies during out-of-sample period. Tables 2 summarizes the results: mean return (MeanRet), standard deviation (Std), minimum return (MinRet), average drawdown (ADD), upside potential ratio (UP ratio) and downside Sharpe ratio (DS ratio). Here, we just recall the details of the average drawdown and upside potential ratio, because they are not as common as others in the literature. The average drawdown is defined as (Alexander and Baptista (2006) ; Chekhlov et al. (2005) ):

where W t is the portfolio value at time t during the out-of-sample period. The upside potential ratio (see Sortino and Van Der Meer (1991) ) measures the ratio of higher partial moment of order 1 with the lower partial moment of order 2 under a target return level τ :

where r 1 , · · · , r N are weekly returns during the out-of-sample period. Here, we set τ the mean return of DJS index. The downside Sharpe ratio (see Ziemba (2005) ) is defined as the ratio of the mean return to the square root of lower semivariance. According to the listed 6 performance measures, among the three type of strategies, the low risk strategies have slight outperformance when compared with high risk strategies and no crash strategies.

To detect the performance of the strategies during the 2007 -2008 crisis, we also conduct an analysis over a time period from January 2007 through December 2018. Unfortunately, we cannot obtain real option data during the crisis. Thus, we simulate a series of European options based on the constituents of DJS index. In the test , the set of candidate assets contains the constituents of the DJS index and simulated European options based on these constituents.

We use the historical prices of the constituents of the DJS index from January 2000 through we construct 4 one-month-to-maturity options for each constituent including four options: an ATM call, an ATM put, a 5% OTM call and a 5% OTM put. All the prices and "Greeks" of these options are calculated under the Black-Scholes pricing formula. We can see that, during 2007 to 2009, the strategy with low crash risk could resist a crash and gain the best portfolio value. On the other hand, during the boom period, no crash strategy has an evident advantage of portfolio value when compared with crash strategies at both required risk levels as well as DJS index. Table 3 summarizes the monthly returns of these strategies from 2007 to 2018. The strategy with low crash risk has the minimum standard deviation, the highest minimum return, the best downside Sharpe ratio and the minimum average drawdown among these three strategies. 

In this subsection, we analyze the performance of different portfolio strategies constructed using index and index options, including no crash strategy and two crash strategies with two different required crash risk levels. Compared with the data set related to constituents of DJS index , we can collect more real data of the DJS index related options, so we can conduct relatively long-term test. Here, the upper and lower bounds for option are 0.5 and -0.5, respectively. We start with an initial wealth of 1 in May 2019, and rebalance the strategies every week.

The portfolio value are evaluated the same way as before, and the only difference is that we evaluate the portfolio value by delivering the options in the third Friday of every month. In the model, we setσ = 0.02 andρ = −0.05, −0.15. Other parameters are set in a same way. Figure   9 displays the portfolio value of these strategies from May 2019 to May 2020. It is obvious that the low risk strategy performs better than other strategies, especially during the volatile months. Table 5 list the different performance measures of these strategies. The strategy with low crash risk performs best in average return, upside potential ratio, downside Sharpe ratio and average drawdown among the six performance measures. From these results, it seems that the model with crash risk control can generate a stable strategy by reducing the opportunities of a boom market. This is not, however, the case in practice. In the test, call and put options are both used in portfolio construction, and crash and boom data samples are both covered by the ellipsoid adopted to measure crash risk. In this setting both crash and boom could cause heavy losses. Therefore only stable strategies can satisfy the crash risk constraint. The model allows for flexibility to suit different purposes. If one wants to hedge the risk resulting from a crash but also wants to avoid losing the opportunities of a rising market, then it is possible, for example, to hold a put option while shrinking the ellipsoid for measuring crash risk covering only the crash down data samples.

In this paper, we have investigated performance measures, hedging and optimization of portfolio selection involving crash risk. The basic idea for dealing with crash risk has been to distinguish the modeling of return in normal market conditions from that in the extreme situation of a market crash. Following Wilmott (2007) , we define the crash risk as the maximum potential loss in an extreme situation and show that it is reasonable to use an worst-case realization of the portfolio return within an ellipsoid to measure it. We have also clarified that the return (value change) of a sufficiently diversified hedged portfolio in normal market conditions approximated with "Greeks" is close to a normal distribution, thus its risk can be well captured by variance.

We have further revealed that the optimal portfolio selection problem with crash control can be translated into an efficiently solvable semidefinite program.

In addition, we have compared the performance of a crash strategy with that of a no crash strategy via both a simulation analysis and an empirical test. The results have shown that the crash strategy can resist crash, while the no crash strategy cannot. Nonetheless the crash strategy may lose its opportunity in certain situations, for example, a stable strategy with crash control in any market conditions will bear heavy losses in a boom market. This is to be expected as the crash strategy cannot completely dominate the no crash strategy. In fact, the two-dimensional efficient frontier of no crash strategy lies exactly on the boundary of the three-dimensional efficient frontier of the crash strategy. That's to say the no crash strategy can be regarded as a crash strategy with a sufficiently loose parameter for crash risk control.

Finally, it should be mentioned that the simulation analysis and empirical test done in Section 3 exhibit only the outcomes for the case that the ellipsoid used to measure crash risk is selected as the one covering all the historical returns, which is clearly the most conservative but not the necessary choice. Investment is not only a "science", but also an "art". Although our focus has been on the science aspect, the art aspect (e.g., predicting the future and determining the location and size of the ellipsoid used to capture the crash risk) is critical to successful practice and should be problem-oriented, and to this end we suggest that the model with crash risk control offers greater flexibility.

Appendix A: Proof of Proposition 1

Recall that ∆u ∼ N (µ, Σ) and

∆v(x, y) δ ∆u + 1 2 ∆u Γ∆u + θ∆t

Since Σ is a covariance matrix, thus it is positive semidefinite and can be decomposed as Σ = Σ Then the change of portfolio value can be further reformulated as

where ξ ∼ N Σ − 1 2 µ, I .

We can decompose the real symmetrical matrix Ω as Ω = CΛC , where Λ is the diagonal matrix constructed by the eigenvalues λ 1 , · · · , λ m of Ω, and C is an orthogonal matrix consisting of the corresponding eigenvectors. Let z = C ξ, we can express 1 2 ξ Ωξ as a linear combination of independent χ 2 1 random variables:

Recall the fact that if ζ is a normally distributed random variable with mean µ and variance σ 2 . Then we have that the random variable ( ζ σ ) 2 follows noncentral χ 2 -distribution with mean 1 + ( µ σ ) 2 and variance 2 + 4( µ σ ) 2 . By (20) and (21), we have that µ(x, y) = E (∆v(x, y)) By (20), the variance of ∆v(x, y) is given as σ 2 (x, y) = var (∆v(x, y)) = var δ Σ 1 2 ξ + var 1 2 ξ Ωξ + 2cov δ Σ 1 2 ξ, 1 2 ξ Ωξ .

It is easy to see that the first term of (22) equals δ Σδ. Let's consider the second and the third terms in the sequel. where the third equation is based on the fact that cov δ Σ 1 2ξ , 1 2ξ Ωξ = 0 due to the the expectation ofξ equals zero (see Lemma 2 of Stein (1981) and page 175 of Britten-Jones and Schaefer (1999)).

By (22), we have that σ 2 (x, y) = δ Σδ + 1 2 tr (ΓΣ) 2 + µ ΓΣΓµ + 2δ ΣΓµ

The proof is completed.

Remark 4 Britten-Jones and Schaefer (1999) derive the similar results for the special case of ∆u ∼ N (0, Σ). Cui et al. (2013) derive the the same results under the additional condition that Γ is nonsingular.

We first introduce the following Lévy's continuity lemma (Van der Vaart (1998) . Since the characteristic function of standard normal distribution is e − 1 2 t 2 , according to Lemma 2, to prove Y n N (0, 1), we only need to prove lim n→+∞ E e itYn = e − 1 2 t 2 or equivalently lim n→+∞ ln E e itYn = − 1 2 t 2 . Recall that the mean and variance of z 2 i are 1+ζ 2 i and 2+4ζ 2 i . Let M = n j=1 λ 2 j 1 + 2ζ 2 j .

Notice the fact that the characteristic function of z 2 j is (1 − 2it) − 1 2 e it 1−2it ζ 2 j , then we have

.

Denoting a j = 

By using (4), (15) and (25), we get the following equivalent SDP reformulation of

Portfolio selection with a drawdown constraint

Minimizing CVaR and VaR for a portfolio of derivatives

Second-order cone programming

Coherent measures of risk

Option-implied risk aversion estimates

Linear Matrix Inequalities in System and Control Theory

Mean-variance-skewness portfolio performance gauging: A general shortage function and dual approach

Non-linear value-at-risk

Drawdown measure in portfolio optimization

Optimization Methods in Finance

Nonlinear portfolio selection using approximate parametric value-at-risk

An overview of value at risk

The performance of model based option trading strategies

Equity premium predictability from cross-sectoral downturns. Available at SSRN 2617242

Optimal option portfolio strategies: Deepening the puzzle of index option mispricing

Mean-risk analysis with risk associated with below-target returns

Dynamic mean-risk portfolio selection with multiple risk measures in continuous-time

A note on some limitations of CRRA utility

Monte Carlo Methods in Financial Engineering

Options, Futures and Other Derivatives

Improving portfolio performance with option strategies: Evidence from Switzerland

Value at risk: The New Benchmark for Managing Financial Risk

Tail risk and asset prices

Mean-absolute deviation portfolio optimization model and its applications to Tokyo stock market

Optioned portfolio selection: Models and analysis

Applications of second-order cone programming

Valuing american options by simulation: a simple least-squares approach

Portfolio selection

Optimization of conditional value-at-risk

Option strategies: Good deals and margin calls

Downside risk

Estimation of the mean of a multivariate normal distribution. The Annals of Statistics

Optimizing international portfolios with options and forwards

Asymptotic statistics

Semidefinite programming

Paul Wilmott Introduces Quantitative Finance

A minimax portfolio selection rule with linear programming solution

The symmetric downside-risk sharpe ratio

Worst-case value-at-risk of non-linear portfolios

This research is supported by National Natural Science Foundation of China under grants 71471180, and 71721001. We authors would like to thank the reviewers of the paper for their insightful comments.

Appendix C: Semidefinite Programming Reformulation of (P ) with Crash Risk Defined by (12) We just need to show that the constraint on crash risk defined by (12) can be reformulated as a convex semidefinite constraint similar to that defined by (9). According to (12), we havewhere β 0 = (β 10 , · · · , β m0 ) and B = (β ij ) m×l and f = (f 1 , . . . , f l ) . Then the constraint on crash risk is equivalent toNow we use Lemma 1 to demonstrate that (24) can be reformulated as a semidefinite constraint.According to the notations of Lemma 1, δ (β 0 + Bf ) + 1 2 (β 0 + Bf ) Γ(β 0 + Bf ) + θ∆t ≥ρ is actually F 0 (f ) ≤ 0, and f ∈Ũ is actually F 1 (f ) ≤ 0. Noting that F 1 Λ −1 2 f 0 < 0, by Lemma 1, the crash risk constraint is satisfied if and only if there exists a real number λ ≥ 0 such that

The values of European call and put options in terms of Black-Scholes formula are given as:S − the underlying asset price at current time t;K − the exercise price;T − the expiry time;σ − the volatility of the underlying asset return;r − the riskless return;N (x) − cumulative probability distribution function of standard normal distribution.The "Greeks" under Black-Scholes pricing formula are summarized in table 6.