Modern portfolio theory (MPT), or mean-variance analysis, is a mathematical framework for assembling a portfolio of assets such that the expected return is maximized for a given level of risk. It is a formalization and extension of diversification in investing, the idea that owning different kinds of financial assets is less risky than owning only one type. Its key insight is that an asset's risk and return should not be assessed by itself, but by how it contributes to a portfolio's overall risk and return. The variance of return (or its transformation, the standard deviation) is used as a measure of risk, because it is tractable when assets are combined into portfolios.[1] Often, the historical variance and covariance of returns is used as a proxy for the forward-looking versions of these quantities,[2] but other, more sophisticated methods are available.[3]
Economist Harry Markowitz introduced MPT in a 1952 essay,[1] for which he was later awarded a Nobel Memorial Prize in Economic Sciences; see Markowitz model.
In 1940, Bruno de Finetti published[4] the mean-variance analysis method, in the context of proportional reinsurance, under a stronger assumption. The paper was obscure and only became known to economists of the English-speaking world in 2006.[5]
MPT assumes that investors are risk averse, meaning that given two portfolios that offer the same expected return, investors will prefer the less risky one. Thus, an investor will take on increased risk only if compensated by higher expected returns. Conversely, an investor who wants higher expected returns must accept more risk. The exact trade-off will not be the same for all investors. Different investors will evaluate the trade-off differently based on individual risk aversion characteristics. The implication is that a rational investor will not invest in a portfolio if a second portfolio exists with a more favorable risk vs expected return profile — i.e., if for that level of risk an alternative portfolio exists that has better expected returns.
Under the model:
In general:
- Expected return:
- where is the return on the portfolio, is the return on asset i and is the weighting of component asset (that is, the proportion of asset "i" in the portfolio, so that ).
- Portfolio return variance:
- ,
- where is the (sample) standard deviation of the periodic returns on an asset i, and is the correlation coefficient between the returns on assets i and j. Alternatively the expression can be written as:
- ,
- where for , or
- ,
- where is the (sample) covariance of the periodic returns on the two assets, or alternatively denoted as , or .
- Portfolio return volatility (standard deviation):
For a two-asset portfolio:
- Portfolio expected return:
- Portfolio variance:
For a three-asset portfolio:
- Portfolio expected return:
- Portfolio variance:
The algebra can be much simplified by expressing the quantities involved in matrix notation.[6] Arrange the returns of N risky assets in an vector , where the first element is the return of the first asset, the second element of the second asset, and so on. Arrange their expected returns in a column vector , and their variances and covariances in a covariance matrix . Consider a portfolio of risky assets whose weights in each of the N risky assets is given by the corresponding element of the weight vector . Then:
- Portfolio expected return:
and
- Portfolio variance:
For the case where there is investment in a riskfree asset with return , the weights of the weight vector do not sum to 1, and the portfolio expected return becomes . The expression for the portfolio variance is unchanged.
An investor can reduce portfolio risk (especially ) simply by holding combinations of instruments that are not perfectly positively correlated (correlation coefficient ). In other words, investors can reduce their exposure to individual asset risk by holding a diversified portfolio of assets. Diversification may allow for the same portfolio expected return with reduced risk. The mean-variance framework for constructing optimal investment portfolios was first posited by Markowitz and has since been reinforced and improved by other economists and mathematicians who went on to account for the limitations of the framework.
If all the asset pairs have correlations of 0—they are perfectly uncorrelated—the portfolio's return variance is the sum over all assets of the square of the fraction held in the asset times the asset's return variance (and the portfolio standard deviation is the square root of this sum).
If all the asset pairs have correlations of 1—they are perfectly positively correlated—then the portfolio return's standard deviation is the sum of the asset returns' standard deviations weighted by the fractions held in the portfolio. For given portfolio weights and given standard deviations of asset returns, the case of all correlations being 1 gives the highest possible standard deviation of portfolio return.
The MPT is a mean-variance theory, and it compares the expected (mean) return of a portfolio with the standard deviation of the same portfolio. The image shows expected return on the vertical axis, and the standard deviation on the horizontal axis (volatility). Volatility is described by standard deviation and it serves as a measure of risk.[7] The return - standard deviation space is sometimes called the space of 'expected return vs risk'. Every possible combination of risky assets, can be plotted in this risk-expected return space, and the collection of all such possible portfolios defines a region in this space. The left boundary of this region is hyperbolic,[8] and the upper part of the hyperbolic boundary is the efficient frontier in the absence of a risk-free asset (sometimes called "the Markowitz bullet"). Combinations along this upper edge represent portfolios (including no holdings of the risk-free asset) for which there is lowest risk for a given level of expected return. Equivalently, a portfolio lying on the efficient frontier represents the combination offering the best possible expected return for given risk level. The tangent to the upper part of the hyperbolic boundary is the capital allocation line (CAL).
Matrices are preferred for calculations of the efficient frontier.
In matrix form, for a given "risk tolerance" , the efficient frontier is found by minimizing the following expression:
where
- is a vector of portfolio weights and (The weights can be negative);
- is the covariance matrix for the returns on the assets in the portfolio;
- is a "risk tolerance" factor, where 0 results in the portfolio with minimal risk and results in the portfolio infinitely far out on the frontier with both expected return and risk unbounded; and
- is a vector of expected returns.
- is the variance of portfolio return.
- is the expected return on the portfolio.
The above optimization finds the point on the frontier at which the inverse of the slope of the frontier would be q if portfolio return variance instead of standard deviation were plotted horizontally. The frontier in its entirety is parametric on q.
Harry Markowitz developed a specific procedure for solving the above problem, called the critical line algorithm,[9] that can handle additional linear constraints, upper and lower bounds on assets, and which is proved to work with a semi-positive definite covariance matrix. Examples of implementation of the critical line algorithm exist in Visual Basic for Applications,[10] in JavaScript[11] and in a few other languages.
Also, many software packages, including MATLAB, Microsoft Excel, Mathematica and R, provide generic optimization routines so that using these for solving the above problem is possible, with potential caveats (poor numerical accuracy, requirement of positive definiteness of the covariance matrix...).
An alternative approach to specifying the efficient frontier is to do so parametrically on the expected portfolio return This version of the problem requires that we minimize
subject to
and
for parameter . This problem is easily solved using a Lagrange multiplier which leads to the following linear system of equations:
One key result of the above analysis is the two mutual fund theorem.[12][13] This theorem states that any portfolio on the efficient frontier can be generated by holding a combination of any two given portfolios on the frontier; the latter two given portfolios are the "mutual funds" in the theorem's name. So in the absence of a risk-free asset, an investor can achieve any desired efficient portfolio even if all that is accessible is a pair of efficient mutual funds. If the location of the desired portfolio on the frontier is between the locations of the two mutual funds, both mutual funds will be held in positive quantities. If the desired portfolio is outside the range spanned by the two mutual funds, then one of the mutual funds must be sold short (held in negative quantity) while the size of the investment in the other mutual fund must be greater than the amount available for investment (the excess being funded by the borrowing from the other fund).
The risk-free asset is the (hypothetical) asset that pays a risk-free rate. In practice, short-term government securities (such as US treasury bills) are used as a risk-free asset, because they pay a fixed rate of interest and have exceptionally low default risk. The risk-free asset has zero variance in returns if held to maturity (hence is risk-free); it is also uncorrelated with any other asset (by definition, since its variance is zero). As a result, when it is combined with any other asset or portfolio of assets, the change in return is linearly related to the change in risk as the proportions in the combination vary.
When a risk-free asset is introduced, the half-line shown in the figure is the new efficient frontier. It is tangent to the hyperbola at the pure risky portfolio with the highest Sharpe ratio. Its vertical intercept represents a portfolio with 100% of holdings in the risk-free asset; the tangency with the hyperbola represents a portfolio with no risk-free holdings and 100% of assets held in the portfolio occurring at the tangency point; points between those points are portfolios containing positive amounts of both the risky tangency portfolio and the risk-free asset; and points on the half-line beyond the tangency point are portfolios involving negative holdings of the risk-free asset and an amount invested in the tangency portfolio equal to more than 100% of the investor's initial capital. This efficient half-line is called the capital allocation line (CAL), and its formula can be shown to be
In this formula P is the sub-portfolio of risky assets at the tangency with the Markowitz bullet, F is the risk-free asset, and C is a combination of portfolios P and F.
By the diagram, the introduction of the risk-free asset as a possible component of the portfolio has improved the range of risk-expected return combinations available, because everywhere except at the tangency portfolio the half-line gives a higher expected return than the hyperbola does at every possible risk level. The fact that all points on the linear efficient locus can be achieved by a combination of holdings of the risk-free asset and the tangency portfolio is known as the one mutual fund theorem,[12] where the mutual fund referred to is the tangency portfolio.
The efficient frontier can be pictured as a problem in quadratic curves.[12] On the market, we have the assets . We have some funds, and a portfolio is a way to divide our funds into the assets. Each portfolio can be represented as a vector , such that , and we hold the assets according to .
Since we wish to maximize expected return while minimizing the standard deviation of the return, we are to solve a quadratic optimization problem: Portfolios are points in the Euclidean space . The third equation states that the portfolio should fall on a plane defined by . The first equation states that the portfolio should fall on a plane defined by . The second condition states that the portfolio should fall on the contour surface for that is as close to the origin as possible. Since the equation is quadratic, each such contour surface is an ellipsoid (assuming that the covariance matrix is invertible). Therefore, we can solve the quadratic optimization graphically by drawing ellipsoidal contours on the plane , then intersect the contours with the plane . As the ellipsoidal contours shrink, eventually one of them would become exactly tangent to the plane, before the contours become completely disjoint from the plane. The tangent point is the optimal portfolio at this level of expected return.
As we vary , the tangent point varies as well, but always falling on a single line (this is the two mutual funds theorem).
Let the line be parameterized as . We find that along the line, giving a hyperbola in the plane. The hyperbola has two branches, symmetric with respect to the axis. However, only the branch with is meaningful. By symmetry, the two asymptotes of the hyperbola intersect at a point on the axis. The point is the height of the leftmost point of the hyperbola, and can be interpreted as the expected return of the global minimum-variance portfolio (global MVP).
The tangency portfolio exists if and only if .
In particular, if the risk-free return is greater or equal to , then the tangent portfolio does not exist. The capital market line (CML) becomes parallel to the upper asymptote line of the hyperbola. Points on the CML become impossible to achieve, though they can be approached from below.
It is usually assumed that the risk-free return is less than the return of the global MVP, in order that the tangency portfolio exists. However, even in this case, as approaches from below, the tangency portfolio diverges to a portfolio with infinite return and variance. Since there are only finitely many assets in the market, such a portfolio must be shorting some assets heavily while longing some other assets heavily. In practice, such a tangency portfolio would be impossible to achieve, because one cannot short an asset too much due to short sale constraints, and also because of price impact, that is, longing a large amount of an asset would push up its price, breaking the assumption that the asset prices do not depend on the portfolio.
If the covariance matrix is not invertible, then there exists some nonzero vector , such that is a random variable with zero variance—that is, it is not random at all.
Suppose and , then that means one of the assets can be exactly replicated using the other assets at the same price and the same return. Therefore, there is never a reason to buy that asset, and we can remove it from the market.
Suppose and , then that means there is free money, breaking the no arbitrage assumption.
Suppose , then we can scale the vector to . This means that we have constructed a risk-free asset with return . We can remove each such asset from the market, constructing one risk-free asset for each such asset removed. By the no arbitrage assumption, all their return rates are equal. For the assets that still remain in the market, their covariance matrix is invertible.
The above analysis describes optimal behavior of an individual investor. Asset pricing theory builds on this analysis, allowing MPT to derive the required expected return for a correctly priced asset in this context.
Intuitively (in a perfect market with rational investors), if a security was expensive relative to others - i.e. too much risk for the price - demand would fall and its price would drop correspondingly; if cheap, demand and price would increase likewise. This would continue until all such adjustments had ceased - a state of "market equilibrium". In this equilibrium, relative supplies will equal relative demands: given the relationship of price with supply and demand, since the risk-to-reward ratio is "identical" across all securities, proportions of each security in any fully-diversified portfolio would correspondingly be the same as in the overall market.
More formally, then, since everyone holds the risky assets in identical proportions to each other — namely in the proportions given by the tangency portfolio — in market equilibrium the risky assets' prices, and therefore their expected returns, will adjust so that the ratios in the tangency portfolio are the same as the ratios in which the risky assets are supplied to the market.[14] The result for expected return then follows, as below.
Specific risk is the risk associated with individual assets - within a portfolio these risks can be reduced through diversification (specific risks "cancel out"). Specific risk is also called diversifiable, unique, unsystematic, or idiosyncratic risk. Systematic risk (a.k.a. portfolio risk or market risk) refers to the risk common to all securities—except for selling short as noted below, systematic risk cannot be diversified away (within one market). Within the market portfolio, asset specific risk will be diversified away to the extent possible. Systematic risk is therefore equated with the risk (standard deviation) of the market portfolio.
Since a security will be purchased only if it improves the risk-expected return characteristics of the market portfolio, the relevant measure of the risk of a security is the risk it adds to the market portfolio, and not its risk in isolation. In this context, the volatility of the asset, and its correlation with the market portfolio, are historically observed and are therefore given. (There are several approaches to asset pricing that attempt to price assets by modelling the stochastic properties of the moments of assets' returns - these are broadly referred to as conditional asset pricing models.)
Systematic risks within one market can be managed through a strategy of using both long and short positions within one portfolio, creating a "market neutral" portfolio. Market neutral portfolios, therefore, will be uncorrelated with broader market indices.
The asset return depends on the amount paid for the asset today. The price paid must ensure that the market portfolio's risk / return characteristics improve when the asset is added to it. The CAPM is a model that derives the theoretical required expected return (i.e., discount rate) for an asset in a market, given the risk-free rate available to investors and the risk of the market as a whole. The CAPM is usually expressed:
A derivation [14] is as follows:
(1) The incremental impact on risk and expected return when an additional risky asset, a, is added to the market portfolio, m, follows from the formulae for a two-asset portfolio. These results are used to derive the asset-appropriate discount rate.
- Updated portfolio risk =
- Hence, risk added to portfolio =
- but since the weight of the asset will be very low re. the overall market,
- i.e. additional risk =
- Updated expected return =
- Hence additional expected return =
(2) If an asset, a, is correctly priced, the improvement for an investor in her risk-to-expected return ratio achieved by adding it to the market portfolio, m, will at least (in equilibrium, exactly) match the gains of spending that money on an increased stake in the market portfolio. The assumption is that the investor will purchase the asset with funds borrowed at the risk-free rate, ; this is rational if .
- Thus:
- i.e.:
- i.e.: (since )
- is the "beta", return mentioned — the covariance between the asset's return and the market's return divided by the variance of the market return — i.e. the sensitivity of the asset price to movement in the market portfolio's value (see also Beta (finance) § Adding an asset to the market portfolio).
This equation can be estimated statistically using the following regression equation:
where αi is called the asset's alpha, βi is the asset's beta coefficient and SCL is the security characteristic line.
Once an asset's expected return, , is calculated using CAPM, the future cash flows of the asset can be discounted to their present value using this rate to establish the correct price for the asset. A riskier stock will have a higher beta and will be discounted at a higher rate; less sensitive stocks will have lower betas and be discounted at a lower rate. In theory, an asset is correctly priced when its observed price is the same as its value calculated using the CAPM derived discount rate. If the observed price is higher than the valuation, then the asset is overvalued; it is undervalued for a too low price.
Despite its theoretical importance, critics of MPT question whether it is an ideal investment tool, because its model of financial markets does not match the real world in many ways.[15][2]
The risk, return, and correlation measures used by MPT are based on expected values, which means that they are statistical statements about the future (the expected value of returns is explicit in the above equations, and implicit in the definitions of variance and covariance). Such measures often cannot capture the true statistical features of the risk and return which often follow highly skewed distributions (e.g. the log-normal distribution) and can give rise to, besides reduced volatility, also inflated growth of return.[16] In practice, investors must substitute predictions based on historical measurements of asset return and volatility for these values in the equations. Very often such expected values fail to take account of new circumstances that did not exist when the historical data were generated.[17] An optimal approach to capturing trends, which differs from Markowitz optimization by utilizing invariance properties, is also derived from physics. Instead of transforming the normalized expectations using the inverse of the correlation matrix, the invariant portfolio employs the inverse of the square root of the correlation matrix.[18] The optimization problem is solved under the assumption that expected values are uncertain and correlated.[19] The Markowitz solution corresponds only to the case where the correlation between expected returns is similar to the correlation between returns.
More fundamentally, investors are stuck with estimating key parameters from past market data because MPT attempts to model risk in terms of the likelihood of losses, but says nothing about why those losses might occur. The risk measurements used are probabilistic in nature, not structural. This is a major difference as compared to many engineering approaches to risk management.
Options theory and MPT have at least one important conceptual difference from the probabilistic risk assessment done by nuclear power [plants]. A PRA is what economists would call a structural model. The components of a system and their relationships are modeled in Monte Carlo simulations. If valve X fails, it causes a loss of back pressure on pump Y, causing a drop in flow to vessel Z, and so on.
But in the Black–Scholes equation and MPT, there is no attempt to explain an underlying structure to price changes. Various outcomes are simply given probabilities. And, unlike the PRA, if there is no history of a particular system-level event like a liquidity crisis, there is no way to compute the odds of it. If nuclear engineers ran risk management this way, they would never be able to compute the odds of a meltdown at a particular plant until several similar events occurred in the same reactor design.
— Douglas W. Hubbard, The Failure of Risk Management, p. 67, John Wiley & Sons, 2009. ISBN 978-0-470-38795-5
Mathematical risk measurements are also useful only to the degree that they reflect investors' true concerns—there is no point minimizing a variable that nobody cares about in practice. In particular, variance is a symmetric measure that counts abnormally high returns as just as risky as abnormally low returns. The psychological phenomenon of loss aversion is the idea that investors are more concerned about losses than gains, meaning that our intuitive concept of risk is fundamentally asymmetric in nature. There many other risk measures (like coherent risk measures) might better reflect investors' true preferences.
Modern portfolio theory has also been criticized because it assumes that returns follow a Gaussian distribution. Already in the 1960s, Benoit Mandelbrot and Eugene Fama showed the inadequacy of this assumption and proposed the use of more general stable distributions instead. Stefan Mittnik and Svetlozar Rachev presented strategies for deriving optimal portfolios in such settings.[20][21][22] More recently, Nassim Nicholas Taleb has also criticized modern portfolio theory on this ground, writing:
After the stock market crash (in 1987), they rewarded two theoreticians, Harry Markowitz and William Sharpe, who built beautifully Platonic models on a Gaussian base, contributing to what is called Modern Portfolio Theory. Simply, if you remove their Gaussian assumptions and treat prices as scalable, you are left with hot air. The Nobel Committee could have tested the Sharpe and Markowitz models—they work like quack remedies sold on the Internet—but nobody in Stockholm seems to have thought about it.
— Nassim N. Taleb, The Black Swan: The Impact of the Highly Improbable, p. 277, Random House, 2007. ISBN 978-1-4000-6351-2
Contrarian investors and value investors typically do not subscribe to Modern Portfolio Theory.[23] One objection is that the MPT relies on the efficient-market hypothesis and uses fluctuations in share price as a substitute for risk. Sir John Templeton believed in diversification as a concept, but also felt the theoretical foundations of MPT were questionable, and concluded (as described by a biographer): "the notion that building portfolios on the basis of unreliable and irrelevant statistical inputs, such as historical volatility, was doomed to failure."[24]
A few studies have argued that "naive diversification", splitting capital equally among available investment options, might have advantages over MPT in some situations.[25]
When applied to certain universes of assets, the Markowitz model has been identified by academics to be inadequate due to its susceptibility to model instability which may arise, for example, among a universe of highly correlated assets.[26]
Since MPT's introduction in 1952, many attempts have been made to improve the model, especially by using more realistic assumptions.
Post-modern portfolio theory extends MPT by adopting non-normally distributed, asymmetric, and fat-tailed measures of risk.[27] This helps with some of these problems, but not others.
Black–Litterman model optimization is an extension of unconstrained Markowitz optimization that incorporates relative and absolute 'views' on inputs of risk and returns from.
The model is also extended by assuming that expected returns are uncertain, and the correlation matrix in this case can differ from the correlation matrix between returns.[18][19]
Modern portfolio theory is inconsistent with main axioms of rational choice theory, most notably with monotonicity axiom, stating that, if investing into portfolio X will, with probability one, return more money than investing into portfolio Y, then a rational investor should prefer X to Y. In contrast, modern portfolio theory is based on a different axiom, called variance aversion,[28] and may recommend to invest into Y on the basis that it has lower variance. Maccheroni et al.[29] described choice theory which is the closest possible to the modern portfolio theory, while satisfying monotonicity axiom. Alternatively, mean-deviation analysis[30] is a rational choice theory resulting from replacing variance by an appropriate deviation risk measure.
In the 1970s, concepts from MPT found their way into the field of regional science. In a series of seminal works, Michael Conroy[citation needed] modeled the labor force in the economy using portfolio-theoretic methods to examine growth and variability in the labor force. This was followed by a long literature on the relationship between economic growth and volatility.[31]
More recently, modern portfolio theory has been used to model the self-concept in social psychology. When the self attributes comprising the self-concept constitute a well-diversified portfolio, then psychological outcomes at the level of the individual such as mood and self-esteem should be more stable than when the self-concept is undiversified. This prediction has been confirmed in studies involving human subjects.[32]
Recently, modern portfolio theory has been applied to modelling the uncertainty and correlation between documents in information retrieval. Given a query, the aim is to maximize the overall relevance of a ranked list of documents and at the same time minimize the overall uncertainty of the ranked list.[33]
Some experts apply MPT to portfolios of projects and other assets besides financial instruments.[34][35] When MPT is applied outside of traditional financial portfolios, some distinctions between the different types of portfolios must be considered.
Neither of these necessarily eliminate the possibility of using MPT and such portfolios. They simply indicate the need to run the optimization with an additional set of mathematically expressed constraints that would not normally apply to financial portfolios.
Furthermore, some of the simplest elements of Modern Portfolio Theory are applicable to virtually any kind of portfolio. The concept of capturing the risk tolerance of an investor by documenting how much risk is acceptable for a given return may be applied to a variety of decision analysis problems. MPT uses historical variance as a measure of risk, but portfolios of assets like major projects do not have a well-defined "historical variance". In this case, the MPT investment boundary can be expressed in more general terms like "chance of an ROI less than cost of capital" or "chance of losing more than half of the investment". When risk is put in terms of uncertainty about forecasts and possible losses then the concept is transferable to various types of investment.[34]
{{cite journal}}
: CS1 maint: multiple names: authors list (link)
{{cite web}}
: CS1 maint: numeric names: authors list (link)