SEMI-PARAMETRIC APPROACH TO LARGE-SCALE PORTFOLIO OPTIMIZATION WITH FACTOR MODELS OF ASSET RETURNS

BACKGROUND OF THE INVENTION
Field of the Invention

The present invention relates generally to a system and method for management of a portfolio of financial assets and, more particularly, to large-scale portfolio optimization. Specifically, various embodiments in accordance with the present invention provide a system and method for modeling and solving large-scale portfolio optimization problems, including mean-variance optimization, expected utility maximization, and general mean-risk optimization problems.

Description of the Prior Art

Since H. Markowitz, Portfolio selection, Journal of Finance, 7(1): 77-91, 1952, portfolio management problems are routinely formulated and solved as mean-variance portfolio optimization problems, where the expected return of a portfolio is traded off with its risk and where risk is represented as portfolio variance. Let {tilde over (R)} be the random n-vector of asset returns. The mean-variance portfolio optimization problem may be stated as

${\max (E \tilde{R})}^{T} x - \frac{γ}{2} x^{T} M x$

$Ax = b, l \leq x \leq h$

where E{tilde over (R)} is the n-vector of expected asset returns; M=[M_ij] is the n×n covariance matrix of asset returns (M_ij=cov({tilde over (R)}_i,{tilde over (R)}_j)), γ is the risk aversion parameter, Ax=b are linear constraints, and l and h are lower and upper bounds on asset holdings. The formulation Ax=b may include a portfolio constraint e^Tx=1, for modeling both long-only and long-short portfolios and/or more elaborate constraints for controlling the leverage of long-short portfolios. It may also include sector exposure constraints, industry exposure constraints, transaction cost modeling, turnover constraints, and any constraints related to a piecewise-linear market impact model representation.

Using mean-variance optimization is particularly appropriate when asset returns are approximately multi-variate normally distributed, i.e., {tilde over (R)}≈N(E{tilde over (R)}, M), since in this case the distribution is fully determined by E{tilde over (R)} and M only and all higher moments are either zero (odd) or monotonically determined by M (even). But it is also appropriate to use when asset returns are not approximately multi-variate normally distributed but an investor only cares about portfolio variance (or tracking error) as a measure of risk. By varying the risk aversion parameter γ from zero to a very large number, one may determine the optimal efficient frontier from the point of considering expected returns only (γ=0) to the point of the minimum variance portfolio (γ=∝).

A related, more general, concept is expected utility maximization. Let u(W) be a concave utility function of wealth (W). The expected utility maximization problem may be stated as

max E u(1+{tilde over (R)}^Tx)

Ax=b,l≤x≤h

where, given an initial wealth normalized to 1, the end-of-period wealth is given by the random variable W=1+{tilde over (R)}^Tx and evaluated by the utility function u(W). Typically, the utility function is assumed to be monotonically increasing and concave. The functional form of the utility function u(W), in particular, the ArrowPratt risk aversion; see K. J. Arrow, Aspects of the theory of risk bearing, Essays on the Theory of Risk Bearing, Markham, Chicago, pages 90-109, 1965 and J. W. Pratt, Risk aversion in the small and in the large, Econometrica, 32, pages 122-136, 1964, a measure of the second derivative of the utility function, represents the investor preference. Utility functions frequently used are from the HARA (hyperbolic absolute risk aversion) class of utility functions. Often in finance the power function is used, i.e.,

$u (W) = \frac{W^{1 - γ} - 1}{1 - γ},$

where γ≥0 and γ≠1. Here γ

represents the constant (with respect to wealth) relative risk aversion parameter describing the investor preference towards risk. Therefore, the power function is typically referred to as CRRA (constant relative risk aversion). Another utility function in the HARA class is the exponential utility function u(W)=−e^−λW, where λ represents the constant (with respect to wealth) absolute risk aversion parameter. The exponential utility function is also referred to as CARA (constant absolute risk aversion). The logarithmic utility function u(W)=log(W) is a special case of the power function for the limit of the risk, aversion γ=1. The latter has the property of maximizing growth in a multi-period setting. A generalization is the generalized log utility function u(W)=log(a+W), where, by proper choice of a, increasing and decreasing relative risk aversion may be modeled; see M. Rubin-stein, Risk aversion in the small and in the large, Econometrics, pages 32:122-136, 1965. The above-mentioned functions exhaust the class of HARA utility functions. An one-switch class of utility functions is given by D. Bell, One switch utility functions and a measure of risk. Management Science, 24(12):1416-1424, 1965. Other utility functions that are not necessarily monotonically increasing and concave have been devised to represent specific investor risk preferences; see D. Kahnemann and A. Tversky, Prospect theory: An analysis of decisions under risk, Econometrics, 47(2), pages 263-291, 1979. These are important for determining individualized portfolios.

Expected utility maximization facilitates the appropriate representation of all higher moments (skewness, kurtosis, etc.) of the asset return distribution in the portfolio optimization framework. It can be shown that if the utility function is quadratic (using only the increasing part of the quadratic function), the expected utility maximization problem results in a mean-variance optimization problem and therefore gives identical results. Any other utility function will yield different results, when asset returns are not multi-variate normally distributed. If asset returns are multi-variate normally distributed, any monotonically increasing and concave utility function will yield a mean-variance efficient portfolio. One expects different results for expected utility maximization than mean-variance analysis when asset returns are not multi-variate normally distributed and the utility function is not quadratic. But the differences in the portfolios may be small, as Y. Kroll, H. Levy, and H. Markowitz, Mean-variance versus direct utility maximization, Journal of Finance, 39(1), pages 47-61, 1984, argued. Since one may locally approximate any utility function by a quadratic approximation, the mean-variance model will in most cases give reasonable results even for asset returns not following a multi-variate normal distribution. This explains the great success of mean-variance portfolio optimization. However, tail behavior might be very different.

A generalization of the mean-variance framework is mean-risk portfolio optimization. The mean-risk portfolio optimization problem may be stated as

${\max (E \tilde{R})}^{T} x - \frac{γ}{2} Risk ({\tilde{R}}^{T} x)$

$Ax = b, l \leq x \leq h$

where Risk({tilde over (R)}^Tx) is a risk measure on the distribution of portfolio returns {tilde over (R)}^Tx. In mean-variance portfolio optimization the risk measure used is portfolio variance, i.e., Risk({tilde over (R)}^Tx)=var({tilde over (R)}^Tx)=E({tilde over (R)}^Tx−E({tilde over (R)}^Tx))²=x^TMx. Mean-variance optimization is therefore part of the broader class of mean-risk portfolio optimization, where various risk measures are considered.

Risk measures are typically either dispersion measures or downside risk measures. Variance is a dispersion measure. Another dispersion measure is the mean-absolute-deviation measure (MAD), introduced by H. Konno and H. Yamazaki, Mean-absolute deviation portfolio optimization model and its application to Tokyo stock market, Management Science, 37(5), pages 519-531, 1991. Typical downside risk measures are the semi-variance and variants of lower partial moments. But most important in finance are the tail measures Value-at-Risk (VaR) and Conditional-Value-at-Risk (CVaR).

In addition to considering a risk measure in the objective function, risk, especially downside risk, may also be controlled as a constraint. For example,

Risk({tilde over (R)}^Tx)≤ρ

may be a constraint added to the mean-variance portfolio optimization problem or to the expected utility portfolio optimization problem, where ρ is the maximum level of acceptable risk as defined by the risk measure Risk. Special risk constraints may involve higher moments of the returns distribution, in particular, one may constrain the skewness and/or the kurtosis of the returns distribution.

SUMMARY OF THE INVENTION

Various embodiments in accordance with the present invention provide a system and method for efficiently solving general classes of large-scale financial asset portfolio optimization problems. Preferred embodiments of the system and method in accordance with the present invention solve the general portfolio optimization problem, employing a factor representation of asset returns. Various embodiments in accordance with the present invention calibrate the optimization model to a benchmark to obtain unconditional mean returns and enable active management based on conditional expected return predictions. Various additional embodiments of the system and method in accordance with the present invention consider derivatives as part of the portfolio.

BRIEF DESCRIPTION OF THE DRAWING

The various embodiments of the present invention will be described in conjunction with the accompanying figures of the drawing to facilitate an understanding of the present invention. In the figures, like reference numerals refer to like elements. In the drawing:

FIG. 1 is a block diagram of an example of a system in accordance with a preferred embodiment of the present invention implemented on a personal computer.

FIG. 2 is a block diagram of an example of a system in accordance with an alternative embodiment of the present invention implemented on a personal computer coupled to a web or Internet server.

FIG. 3 is a flowchart illustrating a method in accordance with a preferred example of the present invention for providing large-scale portfolio optimization with factor models of asset returns.

DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS

The present invention is particularly applicable to a computer implemented software based financial asset portfolio management system for providing large-scale portfolio optimization, and it is in this context that the various embodiments of the present invention will be described. It will be appreciated, however, that the system and method for providing general portfolio optimization, including mean-variance optimization, expected utility maximization, and mean-risk optimization in large-scale portfolio management in accordance with the present invention have greater utility, since they may be implemented in hardware or may incorporate other modules or functionality not described herein.

FIG. 1 is a block diagram illustrating an example of a general portfolio, management system 10 for large-scale portfolio optimization in accordance with one embodiment of the present invention implemented on a personal computer 12. In particular, the personal computer 12 may include a display unit 14, which may be a cathode ray tube (CRT), a liquid crystal display, or the like; a processing unit 16; and one or more input/output devices 18 that permit a user to interact with the software application being executed by the personal computer. In the illustrated example, the input/output devices 18 may include a keyboard 20 and a mouse 22, but may also include other peripheral devices, such as printers, scanners, and the like. The processing unit 16 may further include a central processing unit (CPU) 24, a persistent storage device 26, such as a hard disk, a tape drive, an optical disk system, a removable disk system, or the like, and a memory 28. The CPU 24 may control the persistent storage device 26 and memory M. Typically, a software application may be permanently stored in the persistent storage device 26 and then may be loaded into the memory 28 when the software application is to be executed by the CPU 24. In the example shown, the memory 28 may contain a large-scale portfolio optimization tool 30 for portfolio management. The portfolio optimization tool 30 may be implemented as one or more software modules that are executed by the CPU 24. In accordance with various contemplated embodiments of the present invention, the general portfolio management system 10 may also be implemented using hardware and may be implemented on different types of computer systems, such as client/server systems, Web servers, mainframe computers, workstations, and the like.

Thus, in accordance with another embodiment of the present invention, the general portfolio optimization system 10 is implemented via a hosted Web server. A system using a hosted Web server, generally indicated by the numeral 1801, is shown in FIG. 2. The system 1801 preferably comprises a Web-based application accessed by a personal computer 1802, as shown in FIG. 2. For example, the personal computer 1802 may be any personal computer having at least two gigabytes of random access memory (RAM), using a Web browser, preferably MICROSOFT Internet Explorer 6.0 browser or greater. In this example, the system 1801 is a 128-bit SSL encrypted secure application running on a MICROSOFT Windows Server 2003 operating system or Windows Server 2000 operating system or later operating system available from Microsoft Corporation located in Redmond, Wash. The personal computer 1802 also, comprises a hard disk drive preferably having at least 40 gigabytes of free storage space available. The personal computer 1802 is coupled to a network 1807. For example, the network 1807 may be implemented using an Internet connection. In one implementation of the system 1801, the personal computer 1802 can be ported to the Internet or Web, and hosted by a server 1803. The network 1807 may be implemented using a broadband data connection, such as, for example, a DSL or greater connection, and is preferably a Tl or faster connection. The graphical user interface of the system 1801 is preferably displayed on a monitor 1804 connected to the personal computer 1802. The monitor 1804 comprises a screen 1805 for displaying the graphical user interface provided by the system 1801. The monitor 1804 may be a 15 color monitor and is preferably a 1024×768, 24-bit (16 million colors) VGA monitor or better. The personal computer 1802 further comprises a 256 or more color graphics video card installed in the personal computer. As shown in FIG. 2, a mouse 1806 is provided for mouse-driven navigation between screens or windows comprising the graphical user interface of the system 1801. The personal computer 1802 is also preferably connected to a keyboard 1808. The mouse 1806 and keyboard 1808 enable a user utilizing the system 1801 to perform general portfolio management. Preferably, the user can print the results using a printer 1809. The system 1801 is implemented as a Web-based application, and data may be shared with additional software (e.g., a word processor, spreadsheet, or any other application). Persons skilled in the art will appreciate that the systems and techniques described herein are applicable to a wide array of business and personal applications.

In accordance with a preferred example of the method of the present invention shown in FIG. 3 to manage a portfolio of financial assets to provide large-scale portfolio optimization, including mean-variance optimization, expected utility maximization, and general mean-risk portfolio optimization, the representation of asset returns is presented via a factor model.

Asset Returns

Factor models have been introduced that linearly relate at each period t≥1 the n-vector of asset returns {tilde over (R)}_tto the values (or change in values) of a smaller number k of factors, {tilde over (V)}_t. The factor model representation of the asset returns has many desirable properties, including that it has good explanatory power and the resulting covariance matrix of asset returns is of full rank. It has also theoretical importance, as modern asset pricing theories have factor models as their underpinnings, e.g., the capital asset pricing model (CAPM); see William F. Sharpe, Capital asset prices: A theory of market equilibrium under conditions of risk, Journal of Finance, 19(3): pages 425-442, 1964 and John Liirtner, The valuation of risk assets and the selection of risky investments in stock portfolios and capital budgets, Review of Economics and Statistics, 47(1): pages 13-37, 1965; and the arbitrage pricing theory (APT); see Stephen A. Ross, The arbitrage theory of capital asset pricing; Journal of Economic Theory, 13 (3), pages 341-360, provide equilibrium prices and returns for assets traded in the markets.

Suppose asset returns in each period t≥1 follow a factor model,

{tilde over (R)}
_t
={tilde over (F)}
_t
^T
{tilde over (V)}
_t+{tilde over (ε)}_t

where {tilde over (F)}_tis the k×n random matrix of factor loadings, {tilde over (V)}_tis the random k-vector of the values of the factors (sometimes also called factor returns), and {tilde over (ε)}_tis the random n-vector of idiosyncratic returns. The formulation includes a mean (or intercept) vector, if we define the (random) value of the first factor as having always the value 1, thus, the random returns for each asset i are represented as

{tilde over (R)}
_it
={tilde over (F)}
_1it
+{tilde over (F)}
_2it
{tilde over (V)}
_2t
+ . . . +{tilde over (F)}
_kit
{tilde over (V)}
_kt+{tilde over (ε)}_it.

We assume that the idiosyncratic returns {tilde over (ε)}_tare multi-variate normally distributed, {tilde over (ε)}=N(0, Σ_t), where the covariance Σ_t=diag(σ_it²), and {tilde over (ε)}_tis assumed independently distributed, between its components, respectively, and independently distributed with respect to {tilde over (V)}_t. (A factor model may also be defined for asset risk premia, i.e., excess returns over the risk-free rate, and the risk-free rate may be added to the asset risk premia to obtain asset returns.)

Underlying the factor model of asset returns may be two different statistical models of asset returns:

Statistical model (1): Let {tilde over (F)}_t=F be constant. Let {tilde over (V)}_t, t≥1 and {tilde over (ε)}_t, t≥1 be each independently and identically distributed random variables. Then {tilde over (R)}_t=F^T{tilde over (V)}_t+{tilde over (ε)}_t, t≥1 is an independently and identically distributed random variable. We observe at each period t=1, . . . T an outcome R_t, V_t, and ε_tof {tilde over (R)}_t, {tilde over (V)}_t, and {tilde over (ε)}_t, respectively. At period T+1, the current period at which a portfolio decision is to be made, we write the random vector of asset returns as

{tilde over (R)}
_T+1
|{tilde over (R)}1, . . . ,{tilde over (R)}T=F^T{tilde over (V)}_T+1|{tilde over (V)}1, . . . ,{tilde over (V)}T+{tilde over (ε)}_T+1|{tilde over (ε)}1, . . . ,{tilde over (ε)}T.

But based on independence, {tilde over (R)}_T+1={tilde over (R)}_T+1|{tilde over (R)}₁, . . . , {tilde over (R)}_T, {tilde over (V)}_T+1={tilde over (V)}_T+1|{tilde over (V)}₁, . . . , {tilde over (V)}_Tand {tilde over (ε)}_T+1={tilde over (ε)}_T−1|{tilde over (ε)}₁, . . . , {tilde over (ε)}_T, thus,

{tilde over (R)}
_T+1
=F
^T
{tilde over (V)}
_T+1+{tilde over (ε)}_T+1,

which we may write as

{tilde over (R)}=F
^T
{tilde over (V)}+{tilde over (ε)}

by setting {tilde over (R)}≡{tilde over (R)}_T−1, {tilde over (V)}≡{tilde over (V)}_T+1, and {tilde over (ε)}≡{tilde over (ε)}_T+1, thereby suppressing the time index for period T+1. Accordingly, {tilde over (ε)}=N(0, Σ), where Σ=diag(σ_i²).

Statistical model (1) is applicable to macro-economic factor models. The factors in this framework may be macro-economic variables that influence asset returns, such as (changes in) gross domestic product, oil prices, unemployment rate, interest rates, etc., and the factor loadings of an asset represent its exposure to each of the macro-economic factors.

Extensions of the model include possible time dependency of {tilde over (V)}_tand/or {tilde over (ε)}_t, by defining the conditional distributions {tilde over (R)}_T+1|{tilde over (R)}₁, . . . , {tilde over (R)}_T, {tilde over (V)}_T+1|{tilde over (V)}₁, . . . , {tilde over (V)}_Tand/or {tilde over (ε)}_T+1|{tilde over (ε)}₁, . . . , {tilde over (ε)}_T. For example, considering {tilde over (V)}_T+1|{tilde over (V)}₁, . . . , {tilde over (V)}_Tincludes time-series models of the factors such as vector autoregressive processes and considering {tilde over (ε)}_T+1|{tilde over (ε)}₁, . . . , {tilde over (ε)}_Tincludes models with time-varying idiosyncratic variances, i.e., GARCH processes.

Statistical model (2): Let {tilde over (F)}_t, t≥1 be a sequence of independently and identically distributed random variables. Conditional on {tilde over (F)}_t, let {tilde over (V)}_tand {tilde over (ε)}_t, t≥1 be each independently and identically distributed random variables. Then, {tilde over (R)}_t|{tilde over (F)}_t={tilde over (F)}_t^T{tilde over (V)}_t|{tilde over (F)}_t+{tilde over (ε)}_t|{tilde over (F)}_tis an independently and identically distributed random variable. We observe at each period t=1, . . . T an outcome R_t, V_t, F_t, and ε_tof {tilde over (R)}_t, {tilde over (V)}_t, {tilde over (F)}_t, and {tilde over (ε)}_t, respectively and at the current period T+1. at which a portfolio decision is to be made, an outcome F_tof {tilde over (F)}_t. At the current period T÷1 we write the random vector of asset returns as

{tilde over (R)}
_T+1
|{tilde over (R)}
₁
,{tilde over (F)}
₁
, . . . ,{tilde over (R)}
_T
,{tilde over (F)}
_T
,{tilde over (F)}
_T+1
={tilde over (F)}
_T÷1
{tilde over (V)}
_T+1
|{tilde over (V)}
₁,{tilde over (ε)}₁, . . . ,{tilde over (V)}_T,{tilde over (F)}_T,{tilde over (F)}_T+1+{tilde over (ε)}_T+1|{tilde over (ε)}₁,{tilde over (F)}₁, . . . ,{tilde over (ε)}_T,{tilde over (F)}_T,{tilde over (F)}_T+1.

Based on independence,

{tilde over (R)}
_T÷1
|{tilde over (F)}
_T+1
={tilde over (R)}
_T+1
|{tilde over (R)}
₁
,{tilde over (F)}
₁
, . . . ,{tilde over (R)}
_T
{tilde over (F)}
_T
,{tilde over (F)}
_T+1,

V
_T+1
|{tilde over (F)}
_T+1
={tilde over (V)}
_T+1
|{tilde over (V)}
₁
,{tilde over (F)}
₁
, . . . ,{tilde over (V)}
_T
,{tilde over (F)}
_T
,{tilde over (F)}
_T+1

and

{tilde over (ε)}_T÷1|{tilde over (F)}_T+1={tilde over (ε)}_T+1|{tilde over (ε)}₁,{tilde over (F)}₁, . . . ,{tilde over (ε)}_T,{tilde over (F)}_T,{tilde over (F)}_T+1,

thus,

{tilde over (R)}
_T+1
|{tilde over (F)}
_T+1
={tilde over (F)}
_T+1
^T
{tilde over (V)}
_T+1
|{tilde over (F)}
_T+1+{tilde over (ε)}_T+1|{tilde over (F)}_T+1.

Since at period T+1, an outcome F_T+1of {tilde over (F)}_T+1is observed, we may write

{tilde over (R)}=F
^T
{tilde over (V)}+{tilde over (ε)}

by setting {tilde over (R)}≡{tilde over (R)}_T+1|{tilde over (F)}_T+1=F_T+1, {tilde over (V)}≡{tilde over (V)}_T+1|{tilde over (F)}_T+1=F_T+1, {tilde over (ε)}≡{tilde over (ε)}_T+1|{tilde over (F)}_T+1=F_T+1, and F≡F_T÷1, thereby suppressing the time index T+1 and the dependency on the observed value F_T+1of {tilde over (F)}_T÷1. Accordingly, {tilde over (ε)}=N(0, Σ), where Σ=diag(σ_i²).

Statistical model (2) is applicable to fundamental factor models, where factor loadings may be asset-specific fundamental quantities, say, data derived from accounting statements, such as earnings (over price ratio), dividend yield, past performance, etc., and a factor may be defined as the return of a (long-short) portfolio that has an exposure of one to a specific factor loading and a zero exposure to all other factor loadings considered in the factor model. Extensions of the model include possible time dependency of {tilde over (V)}_T+1and/or {tilde over (ε)}_T÷1, by, defining the conditional distributions {tilde over (R)}≡{tilde over (R)}_T+1|{tilde over (R)}₁, {tilde over (F)}₁, . . . , {tilde over (R)}_T, {tilde over (F)}_T, {tilde over (F)}_T+1, {tilde over (V)}≡{tilde over (V)}_T+1|{tilde over (V)}₁, {tilde over (F)}₁, . . . , {tilde over (V)}_T, {tilde over (F)}_T, {tilde over (F)}_T+1and/or {tilde over (ε)}≡{tilde over (ε)}_T+1|{tilde over (ε)}₁, {tilde over (F)}₁, . . . , {tilde over (ε)}_T, {tilde over (F)}_T, {tilde over (F)}_T+1. For example, defining {tilde over (V)}_T+1|{tilde over (V)}₁, {tilde over (F)}₁, . . . , {tilde over (V)}_T, {tilde over (F)}_T, {tilde over (F)}_T+1would allow for time-series models of the factors and/or defining {tilde over (ε)}_T+1|{tilde over (ε)}₁, {tilde over (F)}₁, . . . , {tilde over (ε)}_T, {tilde over (F)}_T, {tilde over (F)}_T+1would allow for models with time-varying idiosyncratic variances, i.e., GARCH processes.

The factor model representation {tilde over (R)}=F^T{tilde over (V)}+{tilde over (ε)}, based on statistical models (1) and (2), covers a wide range of factor models that have been developed for representing asset returns in practical situations.

In any portfolio optimization problem, one needs to maximize the expected value of a function of the portfolio return, say, max EG({tilde over (R)}^Tx). In mean-variance portfolio optimization this would result in max

$E {\tilde{R}}^{T} x - \frac{γ}{2} var ({\tilde{R}}^{T} x),$

in utility maximization max Eu(1+{tilde over (R)}^Tx), and in mean-risk optimization max

$E {\tilde{R}}^{T} x - \frac{γ}{2} Risk ({\tilde{R}}^{T} x) .$

For asset returns following a factor model,

EG({tilde over (R)}^Tx)=EG((F^T{tilde over (V)}+{tilde over (ε)})^Tx)

and calculating this expectation, without taking advantage of any special structure, involves multiple integration:

$EG ({\tilde{R}}^{T} x) = \int_{- \infty}^{+ \infty} \int_{- \infty}^{+ \infty} \dots \int_{- \infty}^{+ \infty} G ({(F^{T} \tilde{V})}^{T} x + \sum_{i} σ_{i} x_{i} z_{i}) dP (\tilde{V}) p_{1} (z_{1}) \dots p_{n} (z_{n}) {dz}_{1} \dots {dz}_{n},$

where P({tilde over (V)}) is the cumulative distribution function of the factors {tilde over (V)} and where z_i=N(0,1) is an independent unit normal distribution with density function

$p_{i} (z_{i}) = \frac{1}{\sqrt{2 π}} e^{- z_{i}^{2} / 2},$

for each asset i=1, . . . , n. In this formulation, portfolio returns are a linear function of the portfolio weights x.

In order to maximize EG({tilde over (R)}^Tx) one needs function evaluations and gradients as a function of the portfolio weights x or at any given value of x.

For a given portfolio with weights x, the factor model returns of a portfolio,

{tilde over (R)}
_T
x=(F^Tx{tilde over (V)}+{tilde over (ε)})^Tx=(F^T{tilde over (V)})^Tx+{tilde over (ε)}^Tx,

may be expressed as

{tilde over (R)}
^T
x=(F^T{tilde over (V)})^Tx+σ(x)z,

based on the assumption that the idiosyncratic returns are each independently normally distributed, from which

$σ (x) = \sqrt{x^{T} Σ x} = \sqrt{\sum_{i = 1}^{n} σ_{i}^{2} x_{i}^{2}}$

$and$

$z = N (0, 1)$

is a one dimensional unit normal distribution. Thus,

EG({tilde over (R)}^Tx)=∫_−∞^+∞∫_−∞^+∞G((F^T{tilde over (V)})^Tx+σ(x)z)dP({tilde over (V)})p(z)dz,

where

$p (z) = \frac{1}{\sqrt{2 π}} e^{- z^{2} / 2}$

is the density of the unit normal distrubution.

With this reformulation, portfolio returns (F^T{tilde over (V)})^Tx+σ(x)z are a nonlinear function of the portfolio variables x, since they depend on σ(x). The function √{square root over (x^TΣx)} is convex with respect to the portfolio variables x, since compound functions of the type

${(\sum_{i} {g_{i} (x)}^{q})}^{\frac{1}{q}}$

are convex for convex and nonnegative functions g_i(x); see, e.g., S. P. Boyd and L. Vandenberghe, Convex Optimization, Cambridge University Press, Cambridge, UK, 2004. In this context, g_i(x)=√{square root over (σ_i²x_i²)} is convex (linear) in x and nonnegative.

The expectation

EG({tilde over (R)}^Tx)=∫_−∞^+∞∫_−∞^+∞G((F^T{tilde over (V)})^Tx+σ(x)z)dP({tilde over (V)})p(z)dz,

is exactly the same as the linear form in x introduced above:

for any value of x. If two functions with respect to x have the same function value for any value of x, then they are the same function, irrespective of their inner workings. Thus, if one function is concave (convex) in x so is the other. From the latter expression for EG({tilde over (R)}^Tx) it follows that EG({tilde over (R)}^Tx) is concave (convex) if the function GO is a concave (convex) function in its argument. This is the case, because compound functions G(h(x)) are concave (convex) if G(·) is concave (convex) and h(x) is linear in x, and because the expectation of a concave (convex) function in x with respect to a random variable is a concave (convex) function in x; see, e.g., S. P. Boyd and L. Vandenberghe, Convex Optimization, Cambridge University Press, Cambridge, UK, 2004. Thus, we proved that

EG({tilde over (R)}^Tx)=∫_−∞^+∞∫_−∞^+∞G((F^T{tilde over (V)})^Tx+σ(x)z)dP({tilde over (V)})p(z)dz,

is a concave (convex) function with respect to x, if G(·) is a concave (convex) function in its argument.

This is not obvious, since, as stated above, (F^T{tilde over (V)})^Tx+σ(x)z is not concave (convex) with respect to x, because σ(x)z is convex for outcomes z>0 and concave for outcomes z<0. Thus, G((F^T{tilde over (V)})^Tx+σ(x)z) is not necessarily concave (convex) with respect to x, even if G(·) is concave (convex) in its argument; but it may be, depending on how strongly concave (convex) G(·) is. For the convex risk measures discussed herein we have empirically observed that G((F^T{tilde over (V)})^Tx+σ(x)z) (defined as the negative of Risk({tilde over (R)}^Tx)) is typically concave in x, but for expected utility maximization with a very low risk aversion parameter, where G(·) is almost linear in its argument, G((F^T{tilde over (V)})^Tx+σ(x)z) may not be concave in x. This demonstrates that the expectation of a non concave (non convex) function can be a concave (convex) function.

We now approximate the unit normal random variable z by a discrete random variable

ζ=(z_v,p_v)

with realizations z_voccurring with probability p_v, for v=1, . . . , m. That is, the continuous unit normal distribution is represented by a histogram with the properties that its mean is zero, E(ζ)=0, its variance is approximately one, E(ζ²)≈1, and its higher moments match those of the unit normal distribution. For a sufficiently large number of discrete outcomes, the discrete representation closely approximates the unit normal distribution, i.e.,

$\lim_{m \to \infty} \max_{z} \langle (z) - Q (z) \rangle = 0,$

and the cumulative distribution function of the discrete approximation custom-character (z) would match closely the cumulative distribution function Q(z) of the unit normal distribution.

A discrete approximation of the unit normal distribution, obtained using optimization, is given in Table 1. It is preferably based on 51 equally spaced points between −5 and +5 and very closely matches the unit normal distribution. For practical Purposes this seems sufficiently accurate, since the tail area of

$\frac{1}{\sqrt{2 π}} \int_{- \infty}^{- 5} {ze}^{- \frac{z^{2}}{2}} dz = 2.8665 e - 07$

of probability mass is all that is not captured on either side of the unit normal distribution. (Using 6 standard deviations, the one sided error would be 9.8659e-10.) Its first 8 moments are: mean=0.000000, variance=1.000000, skewness=0.000000, kurtosis=3.000000, m₅=0.000000, m₆=15.000000, m₇=0.000000, m₈=105.000000.

Using the discrete approximation ζ of the unit normal random variable, we may express the returns of a portfolio generated by the factor model for any outcome z_vof ζ as

{tilde over (R)}
_v(x)=(F^T{tilde over (V)})^Tx+σ(x)z_v,

and compute expectations as

$EG ({\tilde{R}}_{v} (x)) = \int_{- \infty}^{+ \infty} \sum_{v} G ({(F^{T} \tilde{V})}^{T} x + σ (x) z_{v}) p_{v} dP (\tilde{V}) .$

The multi-variate distribution {tilde over (V)}, may have different forms depending on the factors used. It may exhibit fat tails and may be peaky. The statistics of the population of {tilde over (V)} are typically not known and thus a parametric representation appears difficult. Instead, we proceed non-parametrically. Since {tilde over (V)}_t, t=1, . . . , T+1, are assumed independently and identically distributed, observed outcomes V_tat periods t=1, . . . , T are also observed outcomes of {tilde over (V)}≡{tilde over (V)}_T+1. Let {tilde over (V)} be a random variable defined by the empirically observed outcomes V_twith corresponding, probability

$p_{t} = \frac{1}{T} .$

Using {tilde over (V)}, we obtain an entirely discrete representation of the factor model returns as

R
_tv(x)=(F^TV_t)^Tx+σ(x)z_v

with associated probabilities p_tv=p_tp_v. Defining a new discrete random vector custom-character (x)=(R_tv(x),p_tv), with outcomes R_tv(x) and associated probability p_tv, and using the sample-average approximation, using {tilde over (V)}, we calculate the conditional expectation, given ζ=z_v, as

$EG ({\tilde{R}}_{v} (x)) | ζ = \sum_{t} G ({(F^{T} V_{t})}^{T} x + σ (x) z_{v}) p_{t},$

where

$p_{t} = \frac{1}{T} .$

For sufficiently large number T of observations V_t, EG custom-character (x))|ζ closely approximates EG({tilde over (R)}_v(x))|ζ, as EG((x))|ζ→EG({tilde over (R)}_v(x))|ζ as T→∞.

Now we can calculate the expectation EG( custom-character (x)) entirely as a multiple sum:

$EG (\tilde{R} (x)) = \sum_{t} \sum_{v} G ({(F^{T} V_{t})}^{T} x + σ (x) z_{v}) p_{t} p_{v},$

and EG( custom-character (x)) closely approximates EG({tilde over (R)}(x)), as EG((x))→EG({tilde over (R)}(x)) as T→∞ and as ζ closely approximates z.

We, thus, have, for a general factor model representation of asset returns expressed portfolio returns as a function of x as a random variable with a discrete distribution. We call this approach semi-parametric approximation, since the idiosyncratic component of the asset returns is represented parametrically and the factor explained component is represented non-parametrically. Any expectation of functions of portfolio returns that may occur in a portfolio optimization model can therefore, be computed by multiple sums (over t and v), making portfolio optimization problems very tractable and amenable to solution. We have shown above that EG({tilde over (R)}^Tx) is concave (convex) in x if G(·) is concave (convex) in its argument, therefore portfolio optimization problems based on our reformulation are convex problems that can be solved with (convex) nonlinear programming techniques and any local optimum is also a global optimum.

In practice, the true parameters of the distributions {tilde over (R)}, {tilde over (V)}, and {tilde over (ε)} are unknown and need to be estimated. We obtain estimates of these unknown true quantities by actually estimating a factor model, based on the statistical models introduced above. There are three types of factor models generally in use: the macro-economic (see, e.g., Roll A. Chen, N. F. and S. A. Ross: Economic forces and the stock market, The Journal of Business, 59 (3): 383-404, 1986, the statistical and the fundamental factor model, see, e.g., B. A. Rosenberg: Extra-market components of covariance in security returns, Journal of Financial and Quantitative Analysis, 9 (2): 263-273, 1974. Macro-economic and statistical factor models are estimated based on statistical model (1) and fundamental factor models are estimated based on statistical model (2). See G. Connor: The three types of factor models: A comparison of their explanatory power, Financial Analysts Journal, 51 (3): 42-46, 1995, about the explanatory power of the three types of factor models. There are also hybrid models as a suitable combination of the three factor models. A related approach, includes E. F. Fama and K. R. French: The cross-section of expected stock returns, Journal of Finance, 47 (2): 427-465, 1992, and E. F. Fama and K. R. French: Common risk factors in the returns on stocks and bonds, Journal of Financial Economics, 33: 3-56, 1993, where factors are defined as returns of so called factor-mimicking portfolios. For the estimation of factor models, see also E. F. Fama and J. D. MacBeth: Risk, return, and equilibrium: Empirical tests, The Journal of Political Economy, 81 (3): 607-636, 1973.

In the following the discrete representation of the factor returns is applied to expected utility optimization, mean-variance optimization, and general mean-risk optimization to formulate and study the resulting models. By doing so we develop a semi-parametric approach for modeling and solving general types portfolio optimization problems.

It is useful to partition the factor explained returns F^T{tilde over (V)} into a demeaned part F^T{tilde over (V)}₀and its mean vector μ=F^TE{tilde over (V)}, where {tilde over (V)}=μ+{tilde over (V)}₀. With this partition, the factor-explained returns are {tilde over (R)}_F=μ+F^T{tilde over (V)}₀and the factor model returns may be expressed as {tilde over (R)}=μ+F^T{tilde over (V)}+{tilde over (ε)}. Accordingly, we denote observed outcomes of {tilde over (V)}₀as V_0tand observed outcomes of {tilde over (R)}_Fas {tilde over (R)}_Ft.

Expected Utility Maximization

It is desirable to use the factor model also for expected utility optimization. The corresponding portfolio optimization problem is stated as

max E u(1+(F^T{tilde over (V)}+{tilde over (ε)})^Tx)

Ax=b,l≤x≤h

We note that the objective function includes the expectation over the random vector (F^T{tilde over (V)}) and over the continuous n-dimensional random vector {tilde over (ε)}. This is why this problem is considered to be difficult.

Expected utility maximization problems have been put forward using a sample-average approximation based on historical return observations in R. C. Grinold, Mean-variance and scenario-based approaches to portfolio selection, Journal of Portfolio Management, 25(2), pages 10-22, 1999:

$\max \frac{1}{T} \sum_{t} u (I + R_{t}^{T} x)$

$Ax = b, l \leq x \leq h$

TABLE 1

Discrete representation of the unit normal distribution

z_v
z_v
p_v

−5.0000
5.0000
0.000000536685270

−4.8000
4.8000
0.000000849689232

−4.6000
4.6000
0.000001845241790

−4.4000
4.4000
0.000004525164464

−4.2000
4.2000
0.000010663489269

−4.0000
4.0000
0.000028325893333

−3.8000
3.8000
0.000061546829887

−3.6000
3.6000
0.000124946943306

−3.4000
3.4000
0.000246546756629

−3.2000
3.2000
0.000473614054347

−3.0000
3.0000
0.000880247095147

−2.8000
2.8000
0.001575730929924

−2.6000
2.6000
0.002710675043685

−2.4000
2.4000
0.004477252136737

−2.2000
2.2000
0.007099906873363

−2.0000
2.0000
0.010810776888386

−1.8000
1.8000
0.015808766582307

−1.6000
1.6000
0.022204750725027

−1.4000
1.4000
0.029961198684615

−1.2000
1.2000
0.038839350894021

−1.0000
1.0000
0.048374186883117

−0.8000
0.8000
0.057889903396360

−0.6000
0.6000
0.066565812031542

−0.4000
0.4000
0.073547303686698

−0.2000
0.2000
0.078082629138417

0.0000
0.0000
0.079655674554058

where the return observations R_tare calibrated to reflect forward-looking estimates of mean return and volatility. The sample average model is a good approximation as long as T>>n, since only then is the problem of full rank and statistically viable. For large-scale utility maximization problems, arising in equities, where n>T, the sample average approximation based on historical return observations is not a satisfactory approximation.

A more promising approach may be to use sampling from the factor model representation of asset returns. Also in this case, in order to represent the distribution of asset returns accurately and to obtain a problem of full rank, the sample size needs to be very large, i.e., T>>n. However, this may be computationally prohibitive for a large number of assets.

Approximations to related versions of the expected utility maximization problem based on a factor model of asset returns have been put forward by M. W. Brandt, P. Santa Clara, and R. Valkanov, Parametric portfolio policies: Exploiting characteristics in the cross section of equity returns, Review of Financial Studies, 22(9), pages 3411-3447, 2004, and by S. De Boer, Factor tilting for expected utility maximization, Journal of Asset Management 11, pages 31-42, 2010. M. W. Brandt et al. built an expected utility maximization model with factor exposures as the decision variables, and the portfolio weights were subsequently derived from the estimated factor loadings and the optimal factor exposures. This model assumes constant factor exposures over time. S. De Boer calculates first the expected utility optimal portfolio weights for a given factor exposure parametrically as a function of possible factor exposures, and then solves the expected utility optimization problem in the factor space. Both methods are, approximations and appear unable to handle general constraints.

We proceed differently and, using the semi-parametric approach, formulate and solve the expected utility optimization problem directly:

maxΣ_tΣ_vu(1+R_Ft^Tx+σ(x)z_w)p_tp_v

Ax=b,l≤x≤h,

where σ(x)=√{square root over (x^TΣx)}. This is a discrete formulation with T*m realizations representing accurately the factor model returns, where for each outcome t there are m outcomes representing the unit normal distribution multiplied by the nonlinear term σ(x). Like the original expected utility maximization problem, this reformulated expected utility maximization problem is a convex problem for concave utility functions u(·).

Gradients with respect to the decision variables x are obtained as

$\frac{\partial}{\partial x_{i}} \sum_{t} \sum_{v} u (1 + R_{Ft}^{T} x + σ (x) z_{v}) p_{t} p_{v} = \sum_{t} \sum_{v} u^{'} (1 + R_{Ft}^{T} x + σ (x) z_{v}) (R_{Fi} + \frac{1}{σ (x)} σ_{i}^{2} x_{i} z_{v}) p_{t} p_{v}$

with σ(x)=√{square root over (x^TΣx)}. The expected utility maximization problem is then solved using a gradient-based nonlinear optimization algorithm; see for example, MINOS, B. A. Murtagh and M. A. Saunders, Minos user's guide, Technical Report SOL 83-20, Department of Operations Research, Stanford University, Stanford Calif. 94305, 1983.

Thus, we formulated the expected utility maximization problem as a nonlinear optimization problem with linear constraints. It is a convex problem if u(·) is concave.

Calibrating the Expected Utility Model to a Benchmark

Equilibrium returns can be obtained from the expected utility maximization model.

We write the vector of mean returns, μ=F^TE{tilde over (V)}, predicted by the factor model, as the sum of two components, an unconditional part, equilibrium returns implied by the market, and a conditional (on the factor model) part:

μ=μ_e+μ_e,

where μ_eis the vector of the unconditional equilibrium mean asset returns and μ_cis the vector of the conditional part of mean asset returns, conditioned on the factor model used. For convenience, we define as

{tilde over (η)}_t=F^T{tilde over (V)}_0t,

the demeaned factor-explained return.

Thus, the factor model returns may be written as:

{tilde over (R)}=μ
_c+μ_e+{tilde over (η)}_t+{tilde over (ε)}

and its demeaned part as

{tilde over (R)}
_u=μ_e+{tilde over (η)}_t+{tilde over (ε)}.

The goal in calibration is to determine an unconditional mean vector μ_esuch that the expected utility maximization problem

max E u(1+(μ_e+{tilde over (η)}_t+{tilde over (ε)})^Tx)

e
^T
x=1

for u=u_Bresults in the benchmark portfolio x₃. The benchmark weights x_Bare considered as representative of the weights of the market portfolio. Thus, the equilibrium weights x_Bimply unconditional expected returns μ_e. The optimization problem includes only e^Tx=1 as a constraint, and also no bounds on holdings are needed since constraints and bounds are not relevant for the benchmark portfolio.

The market equilibrium approach has been introduced for a mean-variance based market equilibrium by F. Black and R. Litterman, Global portfolio optimization, Financial Analysts Journal, 48(5), pages 28-43, 1992, and extended for a scenario-based utility maximization equilibrium by R. C. Grinold, Mean-variance and scenario-based approaches to portfolio selection, Journal of Portfolio Management, 25(2), pages 10-22, 1999. We now present an equilibrium model for expected utility optimization when returns follow a factor model.

The Lagrangian function is

L(x,λ)=E u(1+{tilde over (R)}_u^Tx)+λ(1−e^Tx)

and the optimality conditions are

$\frac{\partial L (x, λ)}{\partial x} = {Eu}^{'} (1 + {\tilde{R}}_{u}^{T} x) R_{u} - λ e = 0$

$and$

$\frac{\partial L (x, λ)}{\partial λ} = 1 - e^{T} x = 0,$

where {tilde over (R)}_u=μ_e+{tilde over (η)}_t+{tilde over (ε)}.

For x=x_Band u=u_B(say, for the power utility function, γ=γ_B)

E u′
_B(1+_u^Tx_B){tilde over (R)}_u−λe=0

Multiplying by x_B^Ton the left, one obtains

E u′
_B(1+{tilde over (R)}_u^Tx_B){tilde over (R)}_uB−λ=0,

by setting {tilde over (R)}_uB=x^TD{tilde over (R)}_u, the unconditional return of the benchmark, and noting that x_B^Te=1. Thus,

λ=E u′_B(1+{tilde over (R)}_uB){tilde over (R)}_uB,

Substituting for λ, one obtains for each i=1, . . . , n:

E u′
_B(1+{tilde over (R)}_uB){tilde over (R)}_u,i−E u′_B(1+{tilde over (R)}_uBB){tilde over (R)}_uB=0.

Expanding {tilde over (R)}_u, denoting μ_B=μ_e^Tx_B, {tilde over (η)}_Bt={tilde over (η)}_t^Tx_B, and {tilde over (ε)}_B={tilde over (ε)}^Tx_B, one obtains for i=1 . . . , n:

E u′
_B(1+{tilde over (R)}_uB)μ_ei+E u′_B(1+{tilde over (R)}_uB)({tilde over (η)}_ti+{tilde over (ε)}_i)−E u′_B(1+{tilde over (R)}_uB)μ_B−Eu′_B(1+{tilde over (R)}_uB)({tilde over (η)}_Bt+{tilde over (ε)}_B)=0.

Thus, one obtains for a given value of μ_B:

$μ_{ei} - μ_{B} = - \frac{E u_{B}^{'} (1 + {\tilde{R}}_{uB}) ({\tilde{η}}_{ti} - {\tilde{η}}_{Bt} - {\tilde{ϵ}}_{i} - {\tilde{ϵ}}_{B})}{E u_{B}^{'} (1 + {\tilde{R}}_{uB})}, i = 1, \dots, n .$

Further expanding for {tilde over (R)}_uB=μ_Bt+{tilde over (η)}_Bt+{tilde over (ε)}_B, we obtain as the solution

${\dot{μ}}_{ei} = μ_{B} - \frac{E u_{B}^{'} (1 + μ_{B} + {\tilde{η}}_{Bt} + {\tilde{ϵ}}_{B}) ({\tilde{η}}_{ti} - {\tilde{η}}_{Bt} - {\tilde{ϵ}}_{i} - {\tilde{ϵ}}_{B})}{E u_{B}^{'} (1 + μ_{B} + {\tilde{η}}_{Bt} + {\tilde{ϵ}}_{B})}, i = 1, \dots, n .$

One can easily see that the obtained ratio expression defines the covariance between a random variable ⊖,

$⊖ = \frac{- u_{B}^{'} (1 + μ_{B} + {\tilde{η}}_{Bt} + {\tilde{ϵ}}_{B})}{E u_{B}^{'} (1 + μ_{B} + {\tilde{η}}_{Bt} + {\tilde{ϵ}}_{B})},$

and the difference

{tilde over (R)}
_ui
−{tilde over (R)}
_uB.

Thus, one may write

μ_ei=μ_B+cov(⊖,{tilde over (R)}_i−{tilde over (R)}_B)

and note that the unconditional mean return for each asset equals the benchmark return plus the covariance between the variable ⊖, defined only by quantities of the benchmark, and the difference between the return of each asset i and the benchmark return; see R. C. Grinold, Mean-variance and scenario-based approaches to portfolio selection, Journal of Portfolio Management, 25(2), pages 10-22, 1999. This reflects an equilibrium pricing equation for expected utility optimization, and in particular here for asset returns represented by a factor model.

However, the equilibrium pricing equation did not help in the actual computation. Therefore we proceed to integrate the original equation and note that the random variables {tilde over (ε)}_tand {tilde over (ε)}_Bare correlated (per the definition {tilde over (ε)}_B={tilde over (ε)}^Tx_B) with covariance

cov({tilde over (ε)}_i,{tilde over (ε)}_B)=x_Biσ_i².

Thus, one may write for i=1, . . . , n,

$μ_{ei} = μ_{B} - \frac{E u_{B}^{'} (1 + μ_{B} + {\tilde{η}}_{Bt} + σ_{B} z_{1}) ({\tilde{η}}_{ti} - {\tilde{η}}_{Bt} - σ_{i} z_{2} - σ_{B} z_{1})}{E u_{B}^{'} (1 + μ_{B} + {\tilde{η}}_{Bt} + σ_{B} z)}$

where σ_B=√{square root over (x_B^TΣx_B)} and where z₁and z₂are each N(0,1) with covariance

$\frac{x_{Bi} σ_{i}}{σ_{B}} .$

With

$c_{iB} = corr ({\tilde{ϵ}}_{i}, {\tilde{ϵ}}_{B}) = \frac{ϰ B_{i} σ_{i}}{σ_{B}}$

we may compute the expectation as follows:

$E u_{B}^{'} (1 + μ_{B} + {\tilde{η}}_{Bt} + σ_{B} z_{1}) ({\tilde{η}}_{ti} - {\tilde{η}}_{Bt} + σ_{i} z_{2} - σ_{B} z_{1}) = \sum_{t} p_{t} \int_{- \infty}^{+ \infty} \int_{- \infty}^{+ \infty} u_{B}^{'} (1 + μ_{B} + {\tilde{η}}_{Bt} + σ_{B} z_{1}) ({\tilde{η}}_{ti} - {\tilde{η}}_{Bt} + σ_{i} z_{2} - σ_{B} z_{1}) p_{i} (z_{1}, z_{2}) {dz}_{1} {dz}_{2}$

where p_i(v, w) is the density function of the bivariate unit normal distribution,

$p (z_{1}, z_{2}) = \frac{1}{2 π \sqrt{1 - c_{iB}^{2}}} \exp (- \frac{1}{2 (1 - c_{iB}^{2})} [z_{1}^{2} + z_{2}^{2} - 2 c_{iB} z_{1} z_{2}]), and$

$E u_{B}^{'} (1 + μ_{B} + {\tilde{η}}_{Bt} + σ_{B} z) = \sum_{t} p_{t} \frac{1}{\sqrt{2 π}} \int_{- \infty}^{+ \infty} u_{B}^{'} (1 + μ_{B} + {\tilde{η}}_{Bt} + σ_{B} z) e^{- \frac{z^{2}}{2}} dz$

where we integrate numerically between, say, −5 and 5, which we found sufficiently accurate, using the known trapezoidal method. This had been proposed by G. Infanger, U.S. Pat. No. 8,548,890 B2, issued Oct. 1, 2013.

In the spirit of using a discrete approximation of the bivariate unit normal distribution, one could calculate p(z_1v₁, z_2v₂) corresponding to z_1v₁, z_2v₂for given correlation coefficient cis, using numerical integration. But the probability mass function would have to be computed separately for each asset i due to the dependency on c_iB. From a computational perspective, this would be similar to doing the integration directly as described above.

Therefore, we propose the following way of integrating via discrete approximation. Let L be the Cholesky factorization of the covariance matrix of the bivariate unit normal distribution with correlation c, i.e.,

${LL}^{T} = (\begin{matrix} 1 & c \\ c & 1 \end{matrix}) .$

One obtains, using algebra,

$L = (\begin{matrix} 1 & 0 \\ c & \sqrt{1 - c^{2}} . \end{matrix})$

One may now obtain two dependent unit normal random variables

$(\begin{matrix} 1 & 0 \\ c & \sqrt{1 - c^{2}} . \end{matrix}) (\begin{matrix} z_{1} \\ z_{2} \end{matrix})$

that is, z₁and cz₁+√{square root over (1−c²)}z₂as linear functions of z₁and z₂that are correlated with correlation coefficient c. Thus, one may calculate the expectations as

$E u_{B}^{'} (1 + μ_{B} + {\tilde{η}}_{Bt} + σ_{B} z_{1}) ({\tilde{η}}_{ti} - {\tilde{η}}_{Bt} + σ_{i} z_{2} - σ_{B} z_{1}) = \sum_{t} \sum_{v_{1}} \sum_{v_{2}} u_{B}^{'} (1 + μ_{B} + {\tilde{η}}_{Bt} + σ_{B} z_{v_{1}}) ({\tilde{η}}_{ti} - {\tilde{η}}_{Bt} + σ_{i} (c_{iB} z_{v_{1}} + \sqrt{1 - c_{iB}^{2}} z_{v_{2}}) - σ_{B} z_{v_{1}}) p_{t} p_{v_{1}} p_{v_{2}}$

$and$

$E u_{B}^{'} (1 + μ_{B} + {\tilde{η}}_{Bt} + σ_{B} z_{v}) = \sum_{t} \sum_{v} u_{B}^{'} (1 + μ_{B} + {\tilde{η}}_{Bt} + σ_{B} z_{v}) p_{t} p_{v}$

by using discrete approximations (z_v₁p_v₁), (z_v₂p_v₂), and (z_v,p_v) of the unit normal distributions, z₁, x₂, and z, respectively. The multiple sums may be readily implemented in a modeling language.

For the calibration, one needs to first quantify μ_E. Noting that {tilde over (η)}_Bt=(F^TV_0t)^Tx_Band using the benchmark utility function u_B(for, say, the power utility function, γ=γ_B), one obtains the unconditional means μ_e.

The calibrated expected utility maximization model

Having obtained μ_efrom the above calibration, one may calculate

μ_c=μ−μ_e

and can write the expected utility maximization portfolio optimization model as follows:

$\max Eu (1 + {(\frac{1}{γ_{c}} μ_{c} + μ_{e} + {\tilde{η}}_{t} + \tilde{ϵ})}^{T} x)$

$Ax = b, l \leq x \leq h,$

where γ_cscales the conditional expected returns. We call γ_cthe active risk aversion, or the tilt parameter. The model will for u=u_B(for the power utility function, γ=γ_B) and γ_c→∝ result in the benchmark portfolio if the side constraints are relaxed. For smaller values of γ_cthe model will tilt away from the benchmark portfolio to follow the active predictions μ_c. But also the overall, risk aversion γ, and more generally the utility function, may be chosen differently to obtain a suitable portfolio, different from the benchmark portfolio.

We implement the model as

$\max Σ_{t} Σ_{v} u (1 + {(\frac{1}{γ_{c}} μ_{c} + μ_{e} + F^{T} V_{0 t})}^{T} x + σ (x) z_{v}) p_{t} p_{v}$

$Ax = b, l \leq x \leq h, where σ (x) = \sqrt{x^{T} Σx} .$

The result is a powerful model for active portfolio management, with the potential to effectively control downside risk by using an appropriate choice of utility function. It can be implemented in a modeling language, since the expected value is calculated based on multiple sums of discrete realizations representing with sufficient accuracy the factor model returns. No functions carrying out numerical integration need to be programmed.

Mean-Variance Optimization

In practical implementations of mean-variance portfolio optimization, the mean vector and the covariance matrix M need to be estimated. Historical, observations R_t, t=1, . . . , T, of {tilde over (R)} may be used to estimate the quantities. However, for a large number of assets, i.e., n>T, using sample averages directly to estimate the mean vector and the covariance matrix do not give the desired results, because the sample errors tend to be large and also the resulting covariance matrix is rank deficient (positive semidefinite rather than positive definite). In order to overcome this problem, factor models, as discussed above have been applied to model asset returns.

Using the factor model representation, the covariance matrix can be represented as

M=F
^T
M
_{{tilde over (V)}}
F+Σ,

where M_{{tilde over (V)}}is the k×k covariance matrix of the factors (or factor returns), and Σ=diag(σ_i²) is the diagonal matrix of idiosyncratic variance, where σ_i²is the variance of the i-th independent error term {tilde over (ε)}₂. The matrix M_{{tilde over (V)}}is, typically estimated using historical observations. Presuming all asset returns have positive variance, because Σ is diagonal, the resulting covariance matrix M is of full rank (rank(M)=n). The number of parameters to be estimated (nk for the factor loadings+k(k+1)/2 for the factor covariances+k for the means) is much smaller than without imposing the linear factor model ((n(n+1)/2 for the covariances+n for the means), especially when the number of factors k is kept reasonably small.

The mean-variance portfolio optimization problem based on a factor model representation of asset returns is

$\max {E (F^{T} \tilde{V})}^{T} x - \frac{γ}{2} x^{T} (F^{T} M_{\tilde{V}} F + \sum) x$

$Ax = b, l \leq x \leq h$

An equivalent formulation arises when using the semi-parametric formulation of the factor model in accordance with the present invention directly:

$\max μ^{T} x - \frac{γ}{2} \sum_{t} \sum_{v} {({(F^{T} V_{0 t})}^{T} x + σ (x) u_{v})}^{2} p_{t} p_{v}$

$Ax = b, l \leq x \leq h$

where σ(x)=√{square root over (x^TΣx)}. This is a scenario formulation of the mean-variance problem with Tm scenarios representing the covariance structure. The number of data points m representing N(0,1) determines the accuracy of the formulation. There does not appear to be a particular advantage of the scenario formulation over the typical factor model mean-variance formulation. If the unit normal distribution representation is sufficiently accurate, the scenario formulation will result in the same optimal portfolios as the typical factor model mean-variance formulation.

Calibrating the Mean-Variance Model to a Benchmark

The mean-variance portfolio optimization problem for calibrating unconditional expected returns is

$\max μ_{e}^{T} x - \frac{γ_{B}}{2} x^{T} M x$

$e^{T} x = 1$

where μ_eis the n-vector of unconditional mean returns to be determined, M=F^TM_{{tilde over (V)}}F+Σ is the n×n covariance matrix of asset returns, and γ_Bis the risk aversion parameter of the benchmark.

The Lagrangian function of the mean-variance problem is

$L (x, λ) = μ_{e}^{T} x - \frac{γ_{B}}{2} x^{T} M x + λ (1 - e^{T} x) .$

Setting all derivatives to zero one obtains:

$\frac{\partial L (x, λ)}{\partial x} = μ_{e} - γ_{B} M x - e λ = 0$

$and$

$\frac{\partial L (x, λ)}{\partial λ} = 1 - e^{T} x = 0.$

Multiplying on the left by x^T_B, one obtains

x
_B
^Tμ_e−γ_Bx_B^T(Mx_B)−λ=0

and it follows that

λ=μ_B−γ_Bσ₂^B,

where μ_B=μ_e^Tx_Band σ_B²=x_B^Tx_B. Now we obtain for a given value of γ_B

μ_e=γ_BMx_B+(μ_B−γ_Bσ_B²)e

Note that the second term (μ_B−γ_Bσ_B²)e is a constant and when added to the objective does not affect the solution of the mean-variance portfolio optimization problem. Therefore that term may be dropped. Thus,

μ_e=γ_BMx_B.

We actually calculate the equilibrium returns μ_eby exploiting the factor model form of the covariance matrix as

μ_e=γ_B(F^TM_{{tilde over (V)}}F+Σ)x_B.

Equilibrium returns obtained for the mean-variance model, of course, differ from those obtained earlier for expected utility optimization, but both are labeled as μ_e. It should be clear from the context which is referred to.

The calibrated mean-variance model

Having obtained μ_efrom the calibration, we calculate

μ_c=μ−μ_e

and may now write the mean-variance portfolio optimization model as:

$\max {(\frac{1}{γ_{c}} μ_{c} + μ_{e})}^{T} x - \frac{γ}{2} x^{T} (F^{T} M_{\tilde{V}} F + Σ) x$

$Ax = b, l \leq x \leq h$

or equivalently, using the semi-parametric formulation, as

${\max (\frac{1}{γ_{c}} μ_{c} + μ_{e})}^{T} x - \frac{γ}{2} \sum_{t}^{} \sum_{v}^{} {({(F^{T} V_{0 t})}^{T} x + σ (x) z_{v})}^{2} p_{t} p_{v} Ax = b, l \leq x \leq h$

The model will for γ=γ_Band γ_c→∞ result in the benchmark portfolio if the side constraints are relaxed. For smaller values of γ_cthe model will tilt away from the benchmark portfolio to follow the active predictions μ_c. The overall risk aversion γ may be chosen differently to obtain a suitable portfolio with a different risk than that of the benchmark portfolio.

Mean-Risk Optimization

Mean-risk optimization typically concerns portfolio optimization problems with risk measures other than portfolio variance, while mean-variance optimization is also a subset of mean-risk optimization with variance as the risk measure. Risk measures under consideration are either dispersion measures or downside risk measures. Besides variance (or standard deviation), important dispersion measures are mean absolute deviation and mean absolute moments. Important downside measures are semi-variance and lower partial moments, but most important in finance are the tail measures Value-at-Risk (VaR) and Conditional-Value-at-Risk (CVaR). We will present the formulation of the latter first and then discuss the other risk measures considered in mean-risk optimization.

Value-at-Risk and Conditional Value-at-Risk

Value-at-Risk is defined as

VaR_α({tilde over (R)}^Tx)=min{W: P(−{tilde over (R)}^Tx>W)≤α)}.

It is the smallest number W such that the probability of a loss greater than W is no more than α. The quantity α is called the loss tolerance, whereas the quantity (1−α) is referred to as the confidence, level. VaR_αdefines a quantile, for example the 5% quantile when α=0.05. An equivalent definition is VaR_α({tilde over (R)}^Tx)=−max{W: P({tilde over (R)}^Tx<W)≤α)}, where Value-at-Risk Is defined as the negative of the largest value W such that the probability of a return less than W is no more than α. The negative sign reflects that VaR is defined as a loss and not as a return.

For general distributions, Value-at-Risk is not a coherent measure of risk. It is not convex with respect to the portfolio variables x and is therefore difficult to optimize. However, Conditional-Value-at-Risk, defined as

CVaR_α({tilde over (R)}^Tx)=E{−{tilde over (R)}^Tx|{tilde over (R)}^Tx≤−VaR_α({tilde over (R)}^Tx)},

is convex with respect to the portfolio variables x. It is therefore well suited to mean-risk optimization, when tail risk is to be controlled. CVaR_αis the expected value of losses greater than or equal to the VaR_αquantile.

Other names for Conditional Value-at-Risk are Expected Shortfall and Tail Value of Risk. Since the losses in the tail are at least as much as VaR_α({tilde over (R)}^Tx), and CVaR_α({tilde over (R)}^Tx) is the average of these, it follows that

CVaR_α({tilde over (R)}^Tx)≥VaR_α({tilde over (R)}^Tx).

Thus, CVaR_αalways dominates VaR_α. Therefore one may use CVaR_αin an optimization as a conservative approximation to VaR_α.

Let

Ψ({tilde over (R)}^Tx,W)=∫_{{tilde over (R)}}_T_x≤−Wp({tilde over (R)})d{tilde over (R)}

be the probability of {tilde over (R)}^Tx not exceeding a threshold−W. For a fixed portfolio x it is the cumulative distribution function of a loss associated with portfolio x. It is generally nondecreasing with respect to −W and is continuous with respect to −W as long as there are no jumps. For simplicity, only distributions with conthmous distribution functions are considered. However, while involving substantial technical detail, the analysis extends to non-continuous distribution functions as well.

Using the definition of Ψ, VaR_α({tilde over (R)}^Tx) may be written as

VaR_α({tilde over (R)}^Tx)=min{W: Ψ({tilde over (R)}^Tx,W)≤α)}

and CVaR_α({tilde over (R)}^Tx) may also be written as

${CVaR}_{α} ({\tilde{R}}^{T} x) = \frac{1}{α} \int_{{\tilde{R}}^{T} x \leq - {VaR}_{α} ({\tilde{R}}^{T} x)} - {\tilde{R}}^{T} xp (\tilde{R}) d \tilde{R}$

Based on the above definitions; R. T. Rockafellar and S. Uryasev, Optimization of conditional value-at-risk, Portfolio Safeguard by AOrDa.com, 2(3), pages 21-41, 2000, defined the following function:

$F_{α} ({\tilde{R}}^{T} x, W) = W + \frac{1}{α} E {{(- {\tilde{R}}^{T} x - W)}^{+}},$

where F_a({tilde over (R)}^Tx, W), as a function of W, is convex and continuously differentiable in W for any value of x. Accordingly, CVaR_αand VaR_αfor any value of x may be obtained as:

$C V a R_{α} ({\tilde{R}}^{T} x) = \min_{W} F_{α} ({\tilde{R}}^{T} x, W) and$

${VaR}_{α} ({\tilde{R}}^{T} x) \in \arg \min_{W} F_{α} ({\tilde{R}}^{T} x, W) .$

Furthermore, CVaR_α({tilde over (R)}^Tx) is also convex. That is F_α({tilde over (R)}(x), W) is convex for convex functions {tilde over (R)}(x) and, therefore, also for the linear function {tilde over (R)}^Tx. Thus, CVaR is amenable to optimization:

$\min_{x} C V a R_{α} ({\tilde{R}}^{T} x) = \min_{W, x} F_{α} ({\tilde{R}}^{T} x, W),$

where the function F_α({tilde over (R)}^Tx, W) is jointly minimized over (W, x). Given linear (or convex) portfolio constraints, one may minimize this convex function subject to linear (or convex) constraints. Thus, the obtained CVaR portfolio optimization problem is a convex optimization problem.

In particular, R. T. Rockafellar and S. Uryasev, Optimization of conditional value-at-risk. Portfolio Safeguard by AOrDa.com, 2(3), pages 21-41, MOO showed that when using a sample R_ω, ω∈S_ω of {tilde over (R)},

$F_{α} (R_{ω}^{T} x, W) = W + \frac{1}{α} \frac{1}{\langle S_{ω} \rangle} \sum_{ω} {(- {\tilde{R}}_{ω}^{T} x - W)}^{+}$

and, for this case, the portfolio optimization problem can be formulated as a linear program:

$\begin{matrix} \min W + \frac{1}{α} \frac{1}{\langle S_{ω} \rangle} \sum_{ω} v_{ω} = & {CVaR}_{α} \\ v_{ω} + R_{ω}^{T} x_{i} + W \geq 0, & v_{ω} \geq 0, \\ e^{T} x = & 1 \\ μ^{T} x \geq & {\overline{r}}_{P} \end{matrix}$

where {tilde over (r)}_pis a predefined desired value of portfolio expected return. At the optimal solution one obtains W*=VaR_α(R_ω^T,x*), the Value-at-Risk of the optimal portfolio. Note that VaR_α(R_ω^Tx*) is a sample average approximation of VaR_α({tilde over (R)}^Tx*) based on a return sample ω∈Sω_.

We proceed differently by using the factor model returns in the expression for CVaR_α({tilde over (R)}^Tx)=minus F_α({tilde over (R)}^Tx, W) to obtain:

${CVaR}_{α} ({\tilde{R}}^{T} x) = \min_{W} {W + \frac{1}{α} E {{(- {(F^{T} \tilde{V} + \tilde{ɛ})}^{T} x - W)}^{+}}$

which is a convex function in x. Then,

${CVaR}_{α} (R^{T} x) = \min_{W} {W + \frac{1}{α} E {{(- {(F^{T} \tilde{V})}^{T} x + σ (x) z - W)}^{+}}$

is a convex function in x. Using the semi-parametric factor model representation of asset returns in accordance with the present invention we calculate the expectation by multiple sums to obtain

${CVaR}_{α} (R_{t v}^{T} x) = \min_{W} {W + \frac{1}{α} \sum_{t} \sum_{v} {{(- {(F^{T} V_{t})}^{T} x + σ (x) z_{v} - W)}^{+}} p_{t} p_{v},$

which is convex in x. One may implement this function via a nonlinear programming formulation as:

$\min W + \frac{1}{α} \sum_{t} \sum_{υ} v_{t v} p_{t} p_{v}$

$s / t {(F^{T} V_{t})}^{T} x + σ (x) z_{v} + v_{t v} + W \geq 0, v_{t v} \geq 0, \forall t, v$

and minimize it subject to linear or convex portfolio constraints, thereby solving a convex program.

Changing the sign of the objective and maximizing an afflne transform of the objective with a positive coefficient leads to maximizing a concave functiOn subject to linear (or convex) constraints; a convex optimization problem. The following presents a new formulation of the mean-risk portfolio optimization probleth with CVaR as the risk measure for the factor model representation R^Tx=(F^TV_t+{tilde over (ε)})^Tx of asset returns as

$\max μ^{T} x - \frac{γ}{2} (W + \frac{1}{α} \sum_{t} \sum_{v} v_{t v} p_{t} p_{v})$

$\begin{matrix} {(F^{T} V_{t})}^{T} x & + σ (x) z_{v} + ν_{t v} + W & \geq 0, v_{t v} \geq 0, \forall t, v \\ Ax & = b, l \leq x \leq h \end{matrix}$

where σ(x)=√{square root over (x^TΣx)}. At the optimal solution,

$W^{*} + \frac{1}{α} \sum_{t} \sum_{v} v_{t v}^{*} p_{t} p_{v} = {CVaR}_{α} ({\overline{R}}^{T} x^{*}),$

the optimal Conditional Value-at-Risk value, and W*=VaR_α({tilde over (R)}^Tx*), the Value-at-Risk value of the returns of the optimal portfolio.

Note that by using the semi-parametric factor model representation of asset returns in accordance with the present invention a nonlinear program based on the nonlinear discrete formulation with Tm realizations is obtained. It is a convex problem as outlined above. We solve the problem by using gradient-based nonlinear programming techniques.

As in the case of mean-variance portfolio optimization, one would use

$\frac{1}{γ_{c}} μ_{c} + μ_{e}$

instead of μ, in order to facilitate a measured approach to active portfolio management. The equilibrium returns μ_ewould be calibrated using the mean-variance model. The same also applies to the following models entailing different risk measures.

Other Risk Measures

Other Dispersion Measures

The following describes the mean-absolute-deviation (MAD) and the mean-absolute-moment (MAM) measures as part of mean-risk optimization.

The mean-absolute-deviation measure,

Risk({tilde over (R)}^Tx)=MAD({tilde over (R)}^Tx)=E|{tilde over (R)}^Tx−E({tilde over (R)}^Tx)|,

introduced by H. Konno and H. Yamazaki, Mean-absolute deviation portfolio optimization model and its application to Tokyo stock market, Management Science, 37(5), pages 519-531. 1991, results in exactly the same optimal portfolio as mean-variance optimization if asset returns are multi-variate normally distributed. However, the portfolios are different to the extent that asset returns deviate froth the multi-variate normal distribution. When a sample of asset returns is used to represent the asset distribution, the mean-MAD model results in a linear program, and is, thus, easier to solve than the quadratic mean-variance problem. However, using factor models of asset returns, large-scale mean-variance models are routinely solved in a short time.

We now introduce a new formulation of the mean-MAD model for asset returns following the factor model R^Tx=(F^T{tilde over (V)}+{tilde over (ε)})^Tx using the semi-parametric approximation:

$\max μ^{T} x - \frac{γ}{2} \sum_{t} \sum_{v} (v_{t v}^{+} + ν_{t v}^{-}) p_{t} p_{v}$

$\begin{matrix} - {(F^{T} V_{0 t})}^{T} x & - σ (x) z_{v} + ν_{t v}^{+} - ν_{t v}^{-} & = 0, v_{t v}^{+}, ν_{t v}^{-} \geq 0, \forall t, v \\ Ax & = b, l \leq x \leq h \end{matrix}$

where σ(x)=√{square root over (x^TΣx)}. It is a convex nonlinear problem, and can be solved using a modern nonlinear programming solver.

The mean-absolute-moment measure is defined as

Risk({tilde over (R)}^Tx)=MAM_q({tilde over (R)}^Tx)=E|{tilde over (R)}^Tx−E({tilde over (R)}^Tx)|^q,q>1,

Using the semi-parametric factor model representation of asset returns, the mean-MAM model can be stated directly as a convex nonlinear program (for q>1):

$\max μ^{T} x - \frac{γ}{2} \sum_{t} \sum_{v} {[{({(F^{T} V_{0 t})}^{T} x + σ (x) z_{v})}^{2}]}^{q / 2} p_{t} p_{v}$

$Ax = b, l \leq x \leq h$

where σ(x)=√{square root over (x^TΣx)}, and where V_0tis the observed demeaned part of the factor return. In order to account for the absolute value, for odd q, we first square the term in the objective and then raise it to the power of q/2. The problem may be solved using a gradient-based nonlinear programming solver. Note that for q=1, MAM₁({tilde over (R)}^T=MAD({tilde over (R)}^Tx). Thus, the mean-MAM_Iportfolio optimization problem equals the mean-MAD model described above.

An equivalent formulation that does not rely on the “squaring and then raising to the power of q/2” method of calculating the mean-absolute-moment is:

$\max μ^{T} x - \frac{γ}{2} \sum_{t} \sum_{v} {(v_{t v}^{+} + ν_{t v}^{-})}^{q} p_{t} p_{v}$

$\begin{matrix} - {(F^{T} V_{0 t})}^{T} x & - σ (x) z_{v} + ν_{t v}^{+} - ν_{t v}^{-} & = 0, v_{t ν}^{+}, ν_{t ν}^{-} \geq 0, \forall t, v \\ Ax & = b, l \leq x \leq h \end{matrix}$

It is a convex problem, and can be solved using a gradient-based nonlinear programming solver.

Other Downside Measures

The following describes the semi-variance and the lower-partial-moment measures as part of mean-risk optimization.

The semi-variance is defined as,

Risk({tilde over (R)}^Tx)=σ_semi²({tilde over (R)}^Tz)=Emin({tilde over (R)}^Tx−E({tilde over (R)}^Tx),0)²,

where only return outcomes smaller than the expected return are considered in the standard deviation. Squaring makes the negative return values positive. Thus, the semi-variance, as defined above, is a risk measure in the sense that negative returns represent risk. Using the semi-parametric factor model representation of asset return, the mean-semi-variance model can be stated as:

$\max μ^{T} x - \frac{γ}{2} \sum_{t} \sum_{v} v_{t v}^{2} p_{t} p_{v}$

$\begin{matrix} {(F^{T} V_{0 t})}^{T} x + σ (x) z_{v} + ν_{t v} & = 0, v_{t ν} \geq 0, \forall t, v \\ Ax & = b, l \leq x \leq h \end{matrix}$

where σ(x)=√{square root over (x^TΣx)}, and where V_0tis the demeaned part of the factor return. It is a convex nonlinear program, and is solved using a gradient-based nonlinear programming solver.

Lower partial moments are defined as

Risk({tilde over (R)}^Tx)=LPM_qw({tilde over (R)}^Tx)=E(−min({tilde over (R)}^Tx−W,0))^q,q≥1

where W is a predefined value of return and risk is considered as the expected value of the negative of returns that are below the level W raised to the power of q. Using the semi-parametric factor model representation of asset returns, the mean-LPM model can be stated as:

$\max μ^{T} x - \frac{γ}{2} \sum_{t} \sum_{v} v_{t v}^{q} p_{t} p_{v}$

$\begin{matrix} {(F^{T} V_{t})}^{T} x + σ (x) z_{v} + ν_{t v} & \geq W, v_{t v} \geq 0, \forall t, v \\ Ax & = b, l \leq x \leq h \end{matrix}$

where σ(x)=√{square root over (x^TΣx)} and where W is a predefined constant. For q=1, risk is a linear measure, namely, the expected value of returns below the level W. In this case, the mean-LPM model becomes a linear program: For q>1 the mean-LPM model is a convex nonlinear program, and may be solved using a nonlinear programming solver.

Risk Constraints

Constraints on risk measures may be effectively used to control certain risk metrics in a mean-variance optimization or expected utility optimization. For example, one may want to explicitly control variance or tracking error in an expected utility problem, or one may want to control Value-at-Risk in a mean-variance problem. With the different risk measures discussed above, and considering mean-variance, expected utility, and mean-risk optimization problems, there are many combinations to analyze. For example, one combination is controlling Value-at-Risk in a mean-variance optimization problem by adding, a Conditional Value-at-Risk constraint. We use the semi-parametric representation of the factor model returns in the problem formulation. Let p be a given maximal Value-at-Risk level. One may formulate the CVaR constraint as part of the mean-variance optimization, as follows:

$\max μ^{T} x - \frac{γ}{2} x^{T} (F^{T} M_{\overline{V}} F + \sum) x$

$Ax = b, l \leq x \leq h$

$W + \frac{1}{α} \sum_{t} \sum_{v} u_{tv} p_{t} p_{v} \leq {ρ (F^{T} V_{t})}^{T} x + σ (x) z_{v} + u_{tv} + W \geq 0, u_{tv} \geq 0, \forall t, v$

If the CVaR constraint is binding in the optimal solution, the variable W* represents the VaR value and the expression

$W^{*} + \frac{1}{α} \sum_{t} \sum_{v} u_{tv}^{*} p_{t} p_{v}$

represents the CVaR value of the optimal solution x* of the problem. If the CVaR constraint is not binding, the obtained values of W* and

$W^{*} + \frac{1}{α} \sum_{t} \sum_{v} u_{tv}^{*} p_{t} p_{v}$

may be arbitrary, and one only knows that both VaR and CVaR at the optimal solution x* are strictly less than ρ. A minor modification of the problem, where one assigns an extra variable to CVaR and penalizes it ever so slightly in the objective, will also provide VaR and CVaR values, when at the optimal solution the CVaR constraint is not binding.

Since the CVaR constraint is a convex constraint, the problem is a convex nonlinear programming problem.

Moment Constraints

As part of a portfolio optimization problem, higher moments of the returns distribution, in particular, skewness and/or kurtosis of portfolio returns, may need to be constrained. With portfolio returns defined as {tilde over (R)}^Tx, for a given portfolio x, skewness is defined as

$Skew = \frac{{E ({\tilde{R}}^{T} x - E ({\tilde{R}}^{T} x))}^{3}}{{({E ({\tilde{R}}^{T} x - E ({\tilde{R}}^{T} x))}^{2})}^{\frac{3}{2}}}$

and kurtosis is defined as the fourth standardized moment,

$Kurt = \frac{{E ({\tilde{R}}^{T} x - E ({\tilde{R}}^{T} x))}^{4}}{{({E ({\tilde{R}}^{T} x - E ({\tilde{R}}^{T} x))}^{2})}^{2}}$

The term {tilde over (R)}^Tx−E({tilde over (R)}^Tx) in the above definitions represents the demeaned portfolio returns. The normal distribution has a skewness of 0 and a kurtosis of 3. Therefore, excess kurtosis (K−3) is often used instead of kurtosis. Leptokurtic distributions (with a peakedness higher than that of the normal distribution) tend to have long tails also. An investor may seek a returns distribution that is not too negatively skewed and has limited kurtosis, in order to limit heavy left tails. Neither skewness nor kurtosis are convex functions of x.

We use the semi-parametric representation of factor model returns and calculate skewness and kurtosis. Using a variable u_tvrepresenting the demeaned portfolio returns,

v
_tv=(F^TV_0t)^Tx+σ(x)z_v,

where σ(x)=√{square root over (x^TΣx)}, skewness is expressed as

$Skew = \frac{{Eu}_{tv}^{3}}{{({Eu}_{tv}^{2})}^{\frac{3}{2}}}$

and kurtosis as

$Kurt = \frac{{Eu}_{tv}^{4}}{{({Eu}_{tv}^{2})}^{2}} .$

Constraining skewness to be greater than or equal to a given lower bound Skews, we formulate

Ev
_tv
³−Skew_l(Ev_tv²)^3/2≥0,

and using summation for calculating the expectations based on the discrete formulation of the factor model returns, one obtains

$\sum_{t} \sum_{v} u_{tv}^{3} p_{t} p_{v} - {{Skew}_{l} (\sum_{t} \sum_{v} u_{tv}^{2} p_{t} p_{v})}^{\frac{3}{2}} \geq 0.$

The formulas simplify if one seeks to constrain skewness to be nonnegative (Skew≥0),

$\sum_{t} \sum_{v} u_{tv}^{3} p_{t} p_{v} \geq 0.$

Constraining kurtosis to be less than or equal to a given upper bound Kurt_hyields the following relation

Ev
_tv
⁴−Kurt_h(Ev_tv²)²≤0

and using summation for calculating the expectations based on the discrete formulation of the factor model returns one obtains

$\sum_{t} \sum_{v} u_{tv}^{4} p_{t} p_{v} - {{Kurt}_{h} (\sum_{t} \sum_{v} u_{tv}^{2} p_{t} p_{v})}^{2} \leq 0.$

Estimates based on samples of ratios of expectations and expectations taken to a power may be biased. Other formulations for sample skewness and sample kurtosis exist that include factors for bias correction.

As an example, a mean-variance model may be formulated where skewness is constrained to be nonnegative and kurtosis is constrained to be less than or equal to three:

$\max μ^{T} x - \frac{γ}{2} \sum_{t} \sum_{v} u_{tv}^{2} p_{t} p_{v} Ax = b, l \leq x \leq h - {(F^{T} V_{0 t})}^{T} x - σ (x) z_{v} + u_{tv} = 0$

$\sum_{t} \sum_{v} u_{tv}^{3} p_{t} p_{v} \geq 0$

$\sum_{t} \sum_{v} u_{tv}^{4} p_{t} p_{v} - 3 {(\sum_{t} \sum_{v} u_{tv}^{2} p_{t} p_{v})}^{2} \leq 0.$

where σ(x)=√{square root over (x^TΣx)}. It is a non-convex optimization problem, and multiple local optima may exist.

Derivative Securities

Derivative securities are securities whose price depends on the price of an underlying security. Derivative securities include, for example, options, fonvards, futures, swaps and others. Derivatives may be considered on an underlying, portfolio of assets, which includes options on an individual asset or options on an index. In order to obtain the return of a derivative security, its value is determined at the end of the investment period depending the value (return) of the underlying security. This may be shown using the example of options on a portfolio of assets (index). Let K be the strike price and p be the price of the option, expressed as a fraction of the price of the underlying security (index). The return of a call option is

$r^{call} = \frac{\max {0, p_{s 0} (1 + {\tilde{R}}_{U}) - p_{s 0} K}}{p_{s 0} p} - 1,$

where p_a0is the price of the underlying security at the beginning of the investment period and {tilde over (R)}_Uis its return. Dividing by p_s0one obtains:

$r^{call} = \frac{\max {0, (1 + {\tilde{R}}_{U}) - K}}{p} - 1,$

the return of a call option depending: on the return of the underlying security. Similarly, the return of a put option is

$r^{put} = \frac{\max {0, k - (1 + {\tilde{R}}_{U})}}{p} - 1$

depending on the return of the underlying security. Let custom-character be a specific option, put or call, with relative strike price K and relative price p. Then, one may define the return of an option on the underlying portfolio x_Uas

r
_l
=f
_l({tilde over (R)}^Tx_U)

where f_l(·) is the return generating function of option custom-character and {tilde over (R)}_U={tilde over (R)}^Tx_U. In general, one may define τ_l=f_l({tilde over (R)}^Tx_U) as the return generation function of derivative security , defining its return based on the return {tilde over (R)}^Tx_Uof the underlying portfolio of assets.

Let f be the n_d-vector of return generating functions of various derivative securities and let y be the n_d-vector of holdings of these derivative securities. In portfolio optimization, one needs to maximize the expectation of a function G of the portfolio return. Accordingly, with derivatives in the portfolio one obtains:

max EG({tilde over (R)}^Tx+f^T({tilde over (R)}^Tx_U)y)

subject to portfolio constraints on n+n_dassets. Substituting the factor model representation. we obtain:

max EG((F^TV_t+{tilde over (ε)})^Tx+f^T((F^TV_t+{tilde over (ε)})^Tx_U)y)

and using the semi-parametric reformulation described above,

max EG(R_Ft^Tx+σ(x)z₁+f^T(R_Ft^Tx_U+σ_Uz₂)y)

where z₁and z₂are unit normal random variables correlated with correlation coefficient

$c_{xU} = \frac{x^{T} \sum x_{u}}{σ (x) σ_{U}} = \frac{\sum x_{i} x_{Ui} σ_{i}^{2}}{\sqrt{\sum_{i}^{} σ_{i}^{2} x_{i}^{2}} σ_{U}}$

and where σ_U=√{square root over (x_U^TΣx_U)}=√{square root over (Σ_iσ_i²x_Ui²)}. For any given value of x and y one may integrate this function using two-dimensional numerical integration as

$\max \sum_{t} \int_{- \infty}^{+ \infty} \int_{- \infty}^{+ \infty} G (R_{Fi}^{T} x + σ (x) z_{1} + f^{T} (R_{Fi}^{T} x_{U} + σ_{U} z_{2}) y) p_{t} p (z_{1}, z_{2}) {dz}_{1} {dz}_{2} where$

$p (z_{1}, z_{2}) = \frac{1}{2 π \sqrt{1 - c_{xU}^{2}}} \exp (- \frac{1}{2 (1 - c_{xU}^{2})} [z_{1}^{2} + z_{2}^{2} - 2 c_{iB} z_{1} z_{2}]) .$

This was proposed in G. Infanger, U.S. Pat. No. 8,548,890 B2, issued Oct. 1, 2013, where the numerical integration was carried out using the trapezoidal method integrating between −5 and 5. Gradients were provided and the resulting problem was solved using a gradient-based nonlinear optimization algorithm. The approach worked well and led to short computation times.

We now extend the approach to using the discrete approximation of the unit normal variables. Using the Cholesky factorization described above, one obtains two dependent unit random variables with correlation coefficient c_xUas z₁and c_xUz₁+√{square root over (1−c_xU²)}z₂, where now z₁and z₂are two independent unit normal variables. Using the discrete approximations (z_v₁,p_v₁) and (z_v₂,p_v₂) for z₁and z₂, one obtains:

$\max \sum_{t} \sum_{v_{1}} \sum_{v_{2}} G (R_{Fi}^{T} x + σ (x) z_{v_{1}} + f^{T} (R_{Fi}^{T} x_{U} + σ_{U} (c_{xU} z_{v_{1}} + \sqrt{1 - c_{xU}^{2}} z_{v_{2}})) y) p_{t} p_{v_{1}} p_{v_{2}}$

subject to portfolio constraints. The portfolio optimization problem including derivatives on an underlying portfolio of assets (index) can now be carried out using multiple sums. Thus, it can be implemented in a modeling system. Note that a single asset is a portfolio with one position and, thus, derivatives on a single asset are covered by this approach in the same way. However, derivatives on each of multiple underlying assets cannot be handled by the approach, since this would lead to many multiple sums (two plus one for each underlying asset) and would be computationally prohibitive.

Calculating Performance Statistics

In any portfolio optimization context, it is often useful to provide forward-looking (or a ante) statistics, including (predicted values of) portfolio mean return, standard deviation, and Sharpe ratio. These can be calculated based on the covariance structure of asset returns. Forward-looking statistics involving downside measures, including downside standard deviation, or Value-at-Risk are more difficult to calculate, unless they are based on assuming normally distributed returns. Using the factor model returns for a given portfolio x, forward-looking statistics involving downside risk can be calculated, taking advantage of the semi-parametric formulation of the factor model returns. The following shows this for the Sortino (SoR) ratio and for Value-at-Risk (VaR).

Forward-Looking Downside Target Standard Deviation and Sortino Ratio

For a Given Portfolio x_P. We May Write the Factor Model Returns as

R
_Ptv
=R
_Ft
^T
x
_P+σ(x_P)z_v,

where R_Ptvare Tm forward-looking return realizations with corresponding probability p_tp_v, representing the forward-looking returns distribution of portfolio x_P.

The downside target standard deviation is defined as

√{square root over (E[min({tilde over (R)}^Tx_P−r_f,0)]²)}

and is the square root of the lower partial moment of order 2 of {tilde over (R)}^Tx_P, with the risk-free rate r_fas the target rate.

The Sortino Ratio is defined as

$SoR = \sqrt{n_{Y}} \frac{E ({\tilde{R}}^{T} x_{P} - r_{f})}{\sqrt{{E (\min ({\tilde{R}}^{T} x_{P} - r_{f}, 0)]}^{2}}}$

This is the scaled ratio of the excess returns (over the risk-free rate r_f) divided by the downside target standard deviation. It is typically expressed in annual terms. If returns are observed n_Ytimes per year, the Sortino ratio obtained from the observations is annualized by multiplying with √{square root over (n_Y)}. For example, when using monthly observations, one multiplies by √{square root over (12)}.

The forward-looking Sortino Ratio based on the semi-parametric factor model returns is

$SoR = \sqrt{n_{Y}} \frac{E_{t} R_{FPt} p_{t} - r_{f}}{\sqrt{\sum_{t} \sum_{v} {[\min (R_{{Pt}_{v}} - r_{f}, 0)]}^{2}} p_{t} p_{v}}$

where r_fis the forward-looking risk-free rate (the current one). The denominator is the downside target standard deviation based on the factor model returns. It is also scaled by √{square root over (n_Y)}. Note that for T=60 observations and m=51, the total number of return realizations is 3,060, enough points to expect an accurate representation of downside frisk.

Forward-Looking VaR and CVaR for Portfolio Analysis

Using the return realizations obtained from the factor model representation with corresponding probabilities,

R
_Ptv
=R
_FPt÷σ_Pz_vp_tv=p_tp_v

we may compute the Value-at-Risk (VaR_α) by sorting the outcomes of R_Ptvfrom the smallest to the largest value (maintaining the corresponding p_tv). Call the sorted outcomes r_j, j=1, . . . , Tm such that after sorting, (r₁, p₁) is the smallest outcome with corresponding probability. (r₂, p₂) is the second smallest outcome with corresponding probability, etc. One may then construct

P₁
P₂
P₃
P₄
. . .

p₁
p₂
p₃
p₄
. . .

r₁
r₂
r₃
r₄
. . .

where P_jare the cumulative probabilities, i.e.,

$P_{j} = \sum_{k = 1}^{j} p_{k}$

Now find the smallest index j* for which P_jequals or exceeds α.

If P_j*=a, then VaR_α=r_j*

If P_j*>α. then VaR_α=r_j*÷1.

Then, the conditional Value-at-Risk may be calculated as

${CVaR}_{α} = \frac{1}{α} \sum_{j = 1}^{j_{α}} r_{j} p_{j} where j_{α} = j | r (j) = {VaR}_{α} .$

With this calculation, one obtains a forward-looking Value-at-Risk and a forward-looking conditional Value-at-Risk that are not based on assuming normally distributed returns and, thus, reflect downside risk more accurately.

While the foregoing description has been with reference to particular examples of the present invention, it will be appreciated by those skilled in the art that changes to these examples may be made without departing from the principles and spirit of the invention. Accordingly, the scope of the present invention can only be ascertained with reference to the appended claims.

SEMI-PARAMETRIC APPROACH TO LARGE-SCALE PORTFOLIO OPTIMIZATION WITH FACTOR MODELS OF ASSET RETURNS

Information

Publication Number

Date Filed

Date Published

Inventors

CPC

International Classifications

Abstract

Description

Claims