Creditflux Newsletter

Analysis

Modelling default distributions
Thursday, December 1, 2011. Joe Pimbley

Pricing and risk analysis for debt securities backed by portfolios of assets such as receivables, bonds and loans require the assumption of a default distribution for future defaults by the underlying obligors [1]. It is challenging enough to estimate the expected (i.e., mean) number of defaults over time using historical performance of similar assets or plausible scoring models. But today’s structured finance analyst has less guidance – and correspondingly less confidence – in projecting a default distribution.

Definition of the default distribution

The default distribution is simply the tabulation of all possible default outcomes and their respective probabilities. For example, if a portfolio has N obligors, then it is possible that none of these obligors will default during the life of the transaction. It’s also possible that every obligor will default or that any number n with 0 < n < N will default. For the two endpoints (0 and N) and all values of n in between, specifying the probability function P(n) defines the default distribution. [2]

There are surprisingly few good methods for assuming (or guessing) an analytical default distribution. The most basic beginning point requires all obligors to be independent of each other and have a common default probability p. With these restrictions, the probability function becomes the binomial distribution:

 

In addition to the independence and common default probability restrictions, equation (1) is unwieldy for large values of n and N. Even with N as low as 20, N! is greater than 2 × 1018 [3]. For portfolios where N is large (greater than 40 or so), an improvement to equation (1) is the corresponding Gaussian distribution [4]:

Here we use x as the fraction n/N of defaulting obligors and treat this x(0 ≤ x ≤ 1) as a continuous variable, which is both convenient and plausible for large N. Instead of being a simple probability, the function f(x) is a probability density function (PDF).

An additional, extremely useful mathematical representation of a default distribution for the continuous default variable x is the cumulative distribution function (CDF), which we write as F(x). F(x) is the probability that the actual default fraction is less than x. Hence, F(0) is zero, since the fraction of the portfolio in default must be greater than or equal to zero. Also, F(1) is one, since the default fraction will certainly be less than or equal to 100% [5]. The PDF f(x) is the (calculus) derivative of the CDF F(x): f(x) = F'(x). In deriving or analysing a default distribution, it is often more convenient to work with the PDF (or CDF) and then derive the CDF (or PDF) with the relationship f(x) = F'(x). The CDF for the PDF of equation (2) is, in terms of the standard normal distribution Φ( ),

 

Vasicek allows for correlation

Of the several shortcomings of equations (1) to (3), the most severe is the assumption that the large number N of obligors default independently of one another. This is the zero correlation assumption, and it is unwarranted. In the financial world there is not much that we know about correlation – but we know that it’s not zero!

Oldrich Vasicek devised a default distribution with the assumption that all obligors have mutual asset correlation ρ with one another. The CDF for the Vasicek distribution is, in terms of the standard normal distribution Φ( ) and its inverse Φ-1( ) [6],

 

This result maintains the restrictive assumption that all obligors have the same default probability p and adds the new – and also restrictive – assumption that the number of obligors N becomes infinite. Hence, a common name for this Vasicek distribution is the large pool approximation (LPA). The PDF of the Vasicek LPA is, in terms of the standard normal density Φ( ) and standard normal distribution inverse Φ-1( ),

 

Despite its limitations, the LPA is a vast improvement over binomial or Gaussian representations. One can see directly the influence of correlation ρ in equation (5). The density function is fat-tailed as is evident in Figure 1, which compares a typical asset correlation of 20% with a near-zero value of 1% (with obligor default probability p of 10%).

 

Extension to the Vasicek large pool approximation

Equations (4) and (5) show the impact on the default distribution of the imposition of a uniform asset correlation ρ on the obligor portfolio. It is useful to have this Vasicek closed-form analytical solution even if the assumptions do not fit real-world situations. In recent work, we have created an extended LPA (XLPA) that relaxes the assumptions of infinite pool size and common default probability [7]. The Gaussian function of equation (2) embeds the finite pool size N. In essence, we combine the Gaussian with the LPA to get the XLPA CDF:

In equation (6), F( )and f( ) are the Vasicek CDF and PDF of equations (4) and (5), respectively. H( ) is the Heaviside function [9]. One computes the parameter &gamma; in equation (6) from the specific default probabilities of the subject portfolio. When one creates a loss distribution rather than a default distribution in this framework, the value of γ also includes variation in the obligor concentrations and loss severities.

 

The XLPA CDF of equation (6) is more difficult to read than the Vasicek LPA CDF of equation (4). The purpose of the XLPA is not so much to replace the LPA as it is to provide a demonstration of where the LPA remains accurate. As an example, industry practitioners apply the LPA to real portfolios that, of course, have finite pool size N and nonuniform obligor default probabilities. Figure 2 compares the Vasicek LPA PDF with that of the XLPA for pool sizes of 100 and 1,000 obligors. With mean default probability p of 10% and asset correlation of 20%, we see that a pool of 1,000 obligors is sufficiently large such that the LPA provides a good approximation. When the pool size falls to 100, though, it is better to have the XLPA correction.

 

Analytical solutions and simple correlation prescriptions

Expert practitioners must apply caution and judgment in the analysis of structured transactions whether the goal be pricing, risk assessment, or credit rating determination. Even if one considers the numerous assumptions for the LPA or XLPA of equations (4), (5), and (6) to be valid, the input variables of mean default probability p and asset correlation ρ are merely estimates. Hence, the resulting default distribution is, at best, a plausible estimate that will be least reliable precisely where it is most needed – in one of the tails of the distribution.

Industry best practice does not rely solely on analytical solutions. Some alternative approaches combine numerical solution techniques, such as Monte Carlo simulation, with greater refinement in correlation specification or with default clustering that is not expressible in terms of pair-wise correlation. Other approaches are of the stress test variety, which subject each portfolio asset to specific stress scenarios and/or apply a global default stress to all assets.

A full analysis should combine both the refined model solution and the stress tests, since neither provides a complete answer. The refined model default distribution will still be unreliable in the tails due to uncertainty in the inputs. The stress tests provide no guidance on the probabilistic default distribution since, by their nature, the likelihood of each scenario is unknown.

The absence of a fully satisfactory solution to the problem of structured finance analysis has two immediate ramifications. First, the application of analytical solutions such as the LPA and XLPA to simplified problems remain important because they help analysts assess the results of more complicated models and procedures. Second, pricing and risk assessment in structured finance will never be fully model driven. Out-of-model considerations – such as illiquidity and supply/demand, input uncertainty and potential downside events (involving bank counterparties, the sovereign or the currency) – all require the addition of expert judgment.

 

Joe Pimbley is the principal of Maxwell Consulting (maxwell-consulting.com). His expertise includes enterprise risk management, structured products, derivatives and quantitative modelling.

 

Notes

[1] We restrict ourselves here to structured finance bonds that are “cash flow deals” rather than “market value deals”.  With this restriction, the analysis treats all assets as either making contractual payments to the issuing special purpose vehicle (SPV) or as being in default.  The number and likelihood of assets being in default (“default distribution”), then, is of central importance to the performance of the structured finance liabilities.

[2] Note that probability values must be non-negative and the sum of all values must be 1.

[3] The notation “N!” – read as “N factorial” – stands for the product of N with all lesser integers down to 1.

[4] See Pimbley (2011) for this derivation:  “Portfolio Loss Analysis:  Extending the Large Pool Approximation”, Risk Professional, 14-22, August 2011 at http://www.garp.org/media/672363/quantperspectives_portfoliolossanalysis.pdf .

[5] A mathematical purist could correctly note that we should mention the possibility of discontinuities at the x values of 0 and 1.  The discussion of the text remains valid after minor modification in this circumstance.

[6] See Vasicek (1987):  O. Vasicek, “Probability of Loss on Loan Portfolio”, KMV Working Paper, 1987.  Or see Schönbucher (2003):  P. J. Schönbucher, Credit derivatives pricing models, John Wiley & Sons Ltd., 2003.  Or see Pimbley (2011).

[7]See Pimbley (2011) for details.

[8] A more descriptive name for the discontinuous Heaviside function is the “step function”.   is 1 for x > 0, 0 for x < 0, and one-half for x = 0.

Recent bond & loan issuance

>>More information from the Issuer Tracker

CFlux secondary 
CLO index levels:

Index
21 May
CFlux USD AAA  ↑ 96.2
CFlux USD AA  ↑

88.3

CFlux USD A  ↓ 84.1
CFlux USD BBB  ↓ 75.3
CFlux USD BB  ↓

74.1

CFlux USD EQ  ↑ 77.5

 

>> More information & historical data