Modern portfolio theory (MPT), or mean-variance analysis, is a mathematical framework for assembling a portfolio of assets such that the expected return is maximized for a given level of risk. It is a formalization and extension of diversification in investing, the idea that owning different kinds of financial assets is less risky than owning only one type. Its key insight is that an asset's risk and return should not be assessed by itself, but by how it contributes to a portfolio's overall risk and return. The variance of return (or its transformation, the standard deviation) is used as a measure of risk, because it is tractable when assets are combined into portfolios. Often, the historical variance and covariance of returns is used as a proxy for the forward-looking versions of these quantities,[1] but other, more sophisticated methods are available.[2]
Economist Harry Markowitz introduced MPT in a 1952 essay,[3] for which he was later awarded a Nobel Memorial Prize in Economic Sciences; see Markowitz model.
In 1940, Bruno de Finetti published[4] the mean-variance analysis method, in the context of proportional reinsurance, under a stronger assumption. The paper was obscure and only became known to economists of the English-speaking world in 2006.[5]
MPT assumes that investors are risk averse, meaning that given two portfolios that offer the same expected return, investors will prefer the less risky one. Thus, an investor will take on increased risk only if compensated by higher expected returns. Conversely, an investor who wants higher expected returns must accept more risk. The exact trade-off will not be the same for all investors. Different investors will evaluate the trade-off differently based on individual risk aversion characteristics. The implication is that a rational investor will not invest in a portfolio if a second portfolio exists with a more favorable risk vs expected return profile — i.e., if for that level of risk an alternative portfolio exists that has better expected returns.
Under the model:
\sigmap
In general:
- Expected return:
\operatorname{E}(Rp)=\sumiwi\operatorname{E}(Ri)
where
is the return on the portfolio,Rp
is the return on asset i andRi
is the weighting of component assetwi
(that is, the proportion of asset "i" in the portfolio, so thati
).\sumiwi=1
- Portfolio return variance:
,
2 \sigma p =\sumi
2 w i
2 \sigma i +\sumi\sumjwiwj\sigmai\sigmaj\rhoij
where
is the (sample) standard deviation of the periodic returns on an asset i, and\sigmai
is the correlation coefficient between the returns on assets i and j. Alternatively the expression can be written as:\rhoij
,
2 \sigma p =\sumi\sumjwiwj\sigmai\sigmaj\rhoij
where
for\rhoij=1
, ori=j
,
2 \sigma p =\sumi\sumjwiwj\sigmaij
where
is the (sample) covariance of the periodic returns on the two assets, or alternatively denoted as\sigmaij=\sigmai\sigmaj\rhoij
,\sigma(i,j)
orcovij
.cov(i,j)
- Portfolio return volatility (standard deviation):
\sigmap=\sqrt
2} {\sigma p For a two-asset portfolio:
- Portfolio expected return:
\operatorname{E}(Rp)=wA\operatorname{E}(RA)+ wB\operatorname{E}(RB)=wA\operatorname{E}(RA)+(1-wA)\operatorname{E}(RB).
- Portfolio variance:
2 \sigma p =
2 w A
2 \sigma A +
2 w B
2 \sigma B +2wAwB\sigmaA\sigmaB\rhoAB
For a three-asset portfolio:
- Portfolio expected return:
\operatorname{E}(Rp)=wA\operatorname{E}(RA)+wB\operatorname{E}(RB)+wC\operatorname{E}(RC)
- Portfolio variance:
2 \sigma p =
2 w A
2 \sigma A +
2 w B
2 \sigma B +
2 w C
2 \sigma C +2wAwB\sigmaA\sigmaB\rhoAB+2wAwC\sigmaA\sigmaC\rhoAC+2wBwC\sigmaB\sigmaC\rhoBC
The algebra can be much simplified by expressing the quantities involved in matrix notation.[6] Arrange the returns of N risky assets in an
vectorN x 1
, where the first element is the return of the first asset, the second element of the second asset, and so on. Arrange their expected returns in a column vectorR
, and their variances and covariances in a covariance matrix\mu
. Consider a portfolio of risky assets whose weights in each of the N risky assets is given by the corresponding element of the weight vector\Sigma
. Then:w
- Portfolio expected return:
andw'\mu
- Portfolio variance:
w'\Sigmaw
For the case where there is investment in a riskfree asset with return
, the weights of the weight vector do not sum to 1, and the portfolio expected return becomesRf
. The expression for the portfolio variance is unchanged.w'\mu+(1-w'1)Rf
An investor can reduce portfolio risk (especially
\sigmap
-1\le\rhoij<1
If all the asset pairs have correlations of 0—they are perfectly uncorrelated—the portfolio's return variance is the sum over all assets of the square of the fraction held in the asset times the asset's return variance (and the portfolio standard deviation is the square root of this sum).
If all the asset pairs have correlations of 1—they are perfectly positively correlated—then the portfolio return's standard deviation is the sum of the asset returns' standard deviations weighted by the fractions held in the portfolio. For given portfolio weights and given standard deviations of asset returns, the case of all correlations being 1 gives the highest possible standard deviation of portfolio return.
See main article: article and Efficient frontier.
See also: Portfolio optimization. The MPT is a mean-variance theory, and it compares the expected (mean) return of a portfolio with the standard deviation of the same portfolio. The image shows expected return on the vertical axis, and the standard deviation on the horizontal axis (volatility). Volatility is described by standard deviation and it serves as a measure of risk.[7] The return - standard deviation space is sometimes called the space of 'expected return vs risk'. Every possible combination of risky assets, can be plotted in this risk-expected return space, and the collection of all such possible portfolios defines a region in this space. The left boundary of this region is hyperbolic,[8] and the upper part of the hyperbolic boundary is the efficient frontier in the absence of a risk-free asset (sometimes called "the Markowitz bullet"). Combinations along this upper edge represent portfolios (including no holdings of the risk-free asset) for which there is lowest risk for a given level of expected return. Equivalently, a portfolio lying on the efficient frontier represents the combination offering the best possible expected return for given risk level. The tangent to the upper part of the hyperbolic boundary is the capital allocation line (CAL).
Matrices are preferred for calculations of the efficient frontier.
In matrix form, for a given "risk tolerance"
, the efficient frontier is found by minimizing the following expression:q\in[0,infty)
wherewT\Sigmaw-qRTw
is a vector of portfolio weights andw\inRN
(The weights can be negative);
N \sum i=1 wi=1.
is the covariance matrix for the returns on the assets in the portfolio;\Sigma\inRN x
is a "risk tolerance" factor, where 0 results in the portfolio with minimal risk andq\ge0
results in the portfolio infinitely far out on the frontier with both expected return and risk unbounded; andinfty
is a vector of expected returns.R\inRN
is the variance of portfolio return.wT\Sigmaw\inR
is the expected return on the portfolio.The above optimization finds the point on the frontier at which the inverse of the slope of the frontier would be q if portfolio return variance instead of standard deviation were plotted horizontally. The frontier in its entirety is parametric on q.RTw\inR
Harry Markowitz developed a specific procedure for solving the above problem, called the critical line algorithm,[9] that can handle additional linear constraints, upper and lower bounds on assets, and which is proved to work with a semi-positive definite covariance matrix. Examples of implementation of the critical line algorithm exist in Visual Basic for Applications,[10] in JavaScript[11] and in a few other languages.
Also, many software packages, including MATLAB, Microsoft Excel, Mathematica and R, provide generic optimization routines so that using these for solving the above problem is possible, with potential caveats (poor numerical accuracy, requirement of positive definiteness of the covariance matrix...).
An alternative approach to specifying the efficient frontier is to do so parametrically on the expected portfolio return
This version of the problem requires that we minimizeRTw.
wT\Sigmaw
subject to
RTw=\mu
and
N \sum i=1 wi=1
for parameter
. This problem is easily solved using a Lagrange multiplier which leads to the following linear system of equations:\mu
\begin{bmatrix}2\Sigma&-R&-{\bf1}\ RT&0&0\ {\bf1}T&0&0\end{bmatrix}\begin{bmatrix}w\\λ1\\λ2\end{bmatrix}=\begin{bmatrix}0\\\mu\ 1\end{bmatrix}
One key result of the above analysis is the two mutual fund theorem.[12] [13] This theorem states that any portfolio on the efficient frontier can be generated by holding a combination of any two given portfolios on the frontier; the latter two given portfolios are the "mutual funds" in the theorem's name. So in the absence of a risk-free asset, an investor can achieve any desired efficient portfolio even if all that is accessible is a pair of efficient mutual funds. If the location of the desired portfolio on the frontier is between the locations of the two mutual funds, both mutual funds will be held in positive quantities. If the desired portfolio is outside the range spanned by the two mutual funds, then one of the mutual funds must be sold short (held in negative quantity) while the size of the investment in the other mutual fund must be greater than the amount available for investment (the excess being funded by the borrowing from the other fund).
See main article: article and Capital allocation line.
The risk-free asset is the (hypothetical) asset that pays a risk-free rate. In practice, short-term government securities (such as US treasury bills) are used as a risk-free asset, because they pay a fixed rate of interest and have exceptionally low default risk. The risk-free asset has zero variance in returns (hence is risk-free); it is also uncorrelated with any other asset (by definition, since its variance is zero). As a result, when it is combined with any other asset or portfolio of assets, the change in return is linearly related to the change in risk as the proportions in the combination vary.
When a risk-free asset is introduced, the half-line shown in the figure is the new efficient frontier. It is tangent to the hyperbola at the pure risky portfolio with the highest Sharpe ratio. Its vertical intercept represents a portfolio with 100% of holdings in the risk-free asset; the tangency with the hyperbola represents a portfolio with no risk-free holdings and 100% of assets held in the portfolio occurring at the tangency point; points between those points are portfolios containing positive amounts of both the risky tangency portfolio and the risk-free asset; and points on the half-line beyond the tangency point are portfolios involving negative holdings of the risk-free asset and an amount invested in the tangency portfolio equal to more than 100% of the investor's initial capital. This efficient half-line is called the capital allocation line (CAL), and its formula can be shown to be
E(RC)=RF+\sigmaC
E(RP)-RF | |
\sigmaP |
.
In this formula P is the sub-portfolio of risky assets at the tangency with the Markowitz bullet, F is the risk-free asset, and C is a combination of portfolios P and F.
By the diagram, the introduction of the risk-free asset as a possible component of the portfolio has improved the range of risk-expected return combinations available, because everywhere except at the tangency portfolio the half-line gives a higher expected return than the hyperbola does at every possible risk level. The fact that all points on the linear efficient locus can be achieved by a combination of holdings of the risk-free asset and the tangency portfolio is known as the one mutual fund theorem,[12] where the mutual fund referred to is the tangency portfolio.
The efficient frontier can be pictured as a problem in quadratic curves. On the market, we have the assets
R1,R2,...,Rn
w1,w2,...,wn
\sumiwi=1
wTR=\sumiwiRi
Since we wish to maximize expected return while minimizing the standard deviation of the return, we are to solve a quadratic optimization problem:Portfolios are points in the Euclidean space
\Rn
\sumiwi=1
wTE[R]=\mu
\sumijwi\rhoijwj
\rhoij
\sumiwi=1
\{w:wTE[R]=\muand\sumiwi=1\}
As we vary
\mu
Let the line be parameterized as
\{w+w't:t\in\R\}
(\sigma,\mu)
\mu
\sigma>0
\muMVP
\mu
\mumid
The tangency portfolio exists if and only if
\muRF<\muMVP
In particular, if the risk-free return is greater or equal to
\muMVP
It is usually assumed that the risk-free return is less than the return of the global MVP, in order that the tangency portfolio exists. However, even in this case, as
\muRF
\muMVP
If the covariance matrix is not invertible, then there exists some nonzero vector
v
vTR
Suppose
\sumivi=0
vTR=0
Suppose
\sumivi=0
vTR ≠ 0
Suppose
\sumivi ≠ 0
\sumivi=1
vTR
The above analysis describes optimal behavior of an individual investor. Asset pricing theory builds on this analysis, allowing MPT to derive the required expected return for a correctly priced asset in this context.
Intuitively (in a perfect market with rational investors), if a security was expensive relative to others - i.e. too much risk for the price - demand would fall and its price would drop correspondingly; if cheap, demand and price would increase likewise. This would continue until all such adjustments had ceased - a state of "market equilibrium".In this equilibrium, relative supplies will equal relative demands:given the relationship of price with supply and demand, since the risk-to-reward ratio is "identical" across all securities, proportions of each security in any fully-diversified portfolio would correspondingly be the same as in the overall market.
More formally, then, since everyone holds the risky assets in identical proportions to each other — namely in the proportions given by the tangency portfolio — in market equilibrium the risky assets' prices, and therefore their expected returns, will adjust so that the ratios in the tangency portfolio are the same as the ratios in which the risky assets are supplied to the market. The result for expected return then follows, as below.
Specific risk is the risk associated with individual assets - within a portfolio these risks can be reduced through diversification (specific risks "cancel out"). Specific risk is also called diversifiable, unique, unsystematic, or idiosyncratic risk. Systematic risk (a.k.a. portfolio risk or market risk) refers to the risk common to all securities—except for selling short as noted below, systematic risk cannot be diversified away (within one market). Within the market portfolio, asset specific risk will be diversified away to the extent possible. Systematic risk is therefore equated with the risk (standard deviation) of the market portfolio.
Since a security will be purchased only if it improves the risk-expected return characteristics of the market portfolio, the relevant measure of the risk of a security is the risk it adds to the market portfolio, and not its risk in isolation.In this context, the volatility of the asset, and its correlation with the market portfolio, are historically observed and are therefore given. (There are several approaches to asset pricing that attempt to price assets by modelling the stochastic properties of the moments of assets' returns - these are broadly referred to as conditional asset pricing models.)
Systematic risks within one market can be managed through a strategy of using both long and short positions within one portfolio, creating a "market neutral" portfolio. Market neutral portfolios, therefore, will be uncorrelated with broader market indices.
See main article: article and Capital asset pricing model.
The asset return depends on the amount paid for the asset today. The price paid must ensure that the market portfolio's risk / return characteristics improve when the asset is added to it. The CAPM is a model that derives the theoretical required expected return (i.e., discount rate) for an asset in a market, given the risk-free rate available to investors and the risk of the market as a whole. The CAPM is usually expressed:
\operatorname{E}(Ri)=Rf+\betai(\operatorname{E}(Rm)-Rf)
(\operatorname{E}(Rm)-Rf)
A derivation [14] is as follows:
(1) The incremental impact on risk and expected return when an additional risky asset, a, is added to the market portfolio, m, follows from the formulae for a two-asset portfolio. These results are used to derive the asset-appropriate discount rate.
- Updated portfolio risk =
2 (w m
2 \sigma m +[
2 w a
2 \sigma a +2wmwa\rhoam\sigmaa\sigmam])
Hence, risk added to portfolio =
[
2 w a
2 \sigma a +2wmwa\rhoam\sigmaa\sigmam]
but since the weight of the asset will be very low re. the overall market,
2 w a ≈ 0
i.e. additional risk =
[2wmwa\rhoam\sigmaa\sigmam]
- Updated expected return =
(wm\operatorname{E}(Rm)+[wa\operatorname{E}(Ra)])
Hence additional expected return =
[wa\operatorname{E}(Ra)]
(2) If an asset, a, is correctly priced, the improvement for an investor in her risk-to-expected return ratio achieved by adding it to the market portfolio, m, will at least (in equilibrium, exactly) match the gains of spending that money on an increased stake in the market portfolio. The assumption is that the investor will purchase the asset with funds borrowed at the risk-free rate,
; this is rational ifRf
.\operatorname{E}(Ra)>Rf
Thus:
[wa(\operatorname{E}(Ra)-Rf)]/[2wmwa\rhoam\sigmaa\sigmam]=[wa(\operatorname{E}(Rm)-Rf)]/[2wmwa\sigmam\sigmam]
i.e.:
[\operatorname{E}(Ra)]=Rf+[\operatorname{E}(Rm)-Rf]*[\rhoam\sigmaa\sigmam]/[\sigmam\sigmam]
i.e.:
[\operatorname{E}(Ra)]=Rf+[\operatorname{E}(Rm)-Rf]*[\sigmaam]/[\sigmamm]
is the "beta",[\sigmaam]/[\sigmamm]
return mentioned — the covariance between the asset's return and the market's return divided by the variance of the market return — i.e. the sensitivity of the asset price to movement in the market portfolio's value (see also).\beta
This equation can be estimated statistically using the following regression equation:
SCL:Ri,t-Rf=\alphai+\betai(RM,t-Rf)+\epsiloni,t
where αi is called the asset's alpha, βi is the asset's beta coefficient and SCL is the security characteristic line.
Once an asset's expected return,
E(Ri)
Despite its theoretical importance, critics of MPT question whether it is an ideal investment tool, because its model of financial markets does not match the real world in many ways.[15]
The risk, return, and correlation measures used by MPT are based on expected values, which means that they are statistical statements about the future (the expected value of returns is explicit in the above equations, and implicit in the definitions of variance and covariance). Such measures often cannot capture the true statistical features of the risk and return which often follow highly skewed distributions (e.g. the log-normal distribution) and can give rise to, besides reduced volatility, also inflated growth of return.[16] In practice, investors must substitute predictions based on historical measurements of asset return and volatility for these values in the equations. Very often such expected values fail to take account of new circumstances that did not exist when the historical data were generated.[17] An optimal approach to capturing trends, which differs from Markowitz optimization by utilizing invariance properties, is also derived from physics. Instead of transforming the normalized expectations using the inverse of the correlation matrix, the invariant portfolio employs the inverse of the square root of the correlation matrix.[18] The optimization problem is solved under the assumption that expected values are uncertain and correlated.[19] The Markowitz solution corresponds only to the case where the correlation between expected returns is similar to the correlation between returns.
More fundamentally, investors are stuck with estimating key parameters from past market data because MPT attempts to model risk in terms of the likelihood of losses, but says nothing about why those losses might occur. The risk measurements used are probabilistic in nature, not structural. This is a major difference as compared to many engineering approaches to risk management.
Mathematical risk measurements are also useful only to the degree that they reflect investors' true concerns—there is no point minimizing a variable that nobody cares about in practice. In particular, variance is a symmetric measure that counts abnormally high returns as just as risky as abnormally low returns. The psychological phenomenon of loss aversion is the idea that investors are more concerned about losses than gains, meaning that our intuitive concept of risk is fundamentally asymmetric in nature. There many other risk measures (like coherent risk measures) might better reflect investors' true preferences.
Modern portfolio theory has also been criticized because it assumes that returns follow a Gaussian distribution. Already in the 1960s, Benoit Mandelbrot and Eugene Fama showed the inadequacy of this assumption and proposed the use of more general stable distributions instead. Stefan Mittnik and Svetlozar Rachev presented strategies for deriving optimal portfolios in such settings.[20] [21] [22] More recently, Nassim Nicholas Taleb has also criticized modern portfolio theory on this ground, writing:
Contrarian investors and value investors typically do not subscribe to Modern Portfolio Theory.[23] One objection is that the MPT relies on the efficient-market hypothesis and uses fluctuations in share price as a substitute for risk. Sir John Templeton believed in diversification as a concept, but also felt the theoretical foundations of MPT were questionable, and concluded (as described by a biographer): "the notion that building portfolios on the basis of unreliable and irrelevant statistical inputs, such as historical volatility, was doomed to failure."[24]
A few studies have argued that "naive diversification", splitting capital equally among available investment options, might have advantages over MPT in some situations.[25]
When applied to certain universes of assets, the Markowitz model has been identified by academics to be inadequate due to its susceptibility to model instability which may arise, for example, among a universe of highly correlated assets.[26]
Since MPT's introduction in 1952, many attempts have been made to improve the model, especially by using more realistic assumptions.
Post-modern portfolio theory extends MPT by adopting non-normally distributed, asymmetric, and fat-tailed measures of risk.[27] This helps with some of these problems, but not others.
Black–Litterman model optimization is an extension of unconstrained Markowitz optimization that incorporates relative and absolute 'views' on inputs of risk and returns from.
The model is also extended by assuming that expected returns are uncertain, and the correlation matrix in this case can differ from the correlation matrix between returns.[18] [19]
Modern portfolio theory is inconsistent with main axioms of rational choice theory, most notably with monotonicity axiom, stating that, if investing into portfolio X will, with probability one, return more money than investing into portfolio Y, then a rational investor should prefer X to Y. In contrast, modern portfolio theory is based on a different axiom, called variance aversion,[28] and may recommend to invest into Y on the basis that it has lower variance. Maccheroni et al.[29] described choice theory which is the closest possible to the modern portfolio theory, while satisfying monotonicity axiom. Alternatively, mean-deviation analysis[30] is a rational choice theory resulting from replacing variance by an appropriate deviation risk measure.
In the 1970s, concepts from MPT found their way into the field of regional science. In a series of seminal works, Michael Conroy modeled the labor force in the economy using portfolio-theoretic methods to examine growth and variability in the labor force. This was followed by a long literature on the relationship between economic growth and volatility.[31]
More recently, modern portfolio theory has been used to model the self-concept in social psychology. When the self attributes comprising the self-concept constitute a well-diversified portfolio, then psychological outcomes at the level of the individual such as mood and self-esteem should be more stable than when the self-concept is undiversified. This prediction has been confirmed in studies involving human subjects.[32]
Recently, modern portfolio theory has been applied to modelling the uncertainty and correlation between documents in information retrieval. Given a query, the aim is to maximize the overall relevance of a ranked list of documents and at the same time minimize the overall uncertainty of the ranked list.[33]
Some experts apply MPT to portfolios of projects and other assets besides financial instruments.[34] [35] When MPT is applied outside of traditional financial portfolios, some distinctions between the different types of portfolios must be considered.
Neither of these necessarily eliminate the possibility of using MPT and such portfolios. They simply indicate the need to run the optimization with an additional set of mathematically expressed constraints that would not normally apply to financial portfolios.
Furthermore, some of the simplest elements of Modern Portfolio Theory are applicable to virtually any kind of portfolio. The concept of capturing the risk tolerance of an investor by documenting how much risk is acceptable for a given return may be applied to a variety of decision analysis problems. MPT uses historical variance as a measure of risk, but portfolios of assets like major projects do not have a well-defined "historical variance". In this case, the MPT investment boundary can be expressed in more general terms like "chance of an ROI less than cost of capital" or "chance of losing more than half of the investment". When risk is put in terms of uncertainty about forecasts and possible losses then the concept is transferable to various types of investment.