1. Random
  2. 4. Special Distributions
  3. Stable Distributions

Stable Distributions

This section discusses a theoretical topic that you may want to skip if you are a new student of probability.

Basic Theory

Stable distributions are an important general class of probability distributions on R that are defined in terms of location-scale transformations. Stable distributions occur as limits (in distribution) of scaled and centered sums of independent, identically distributed variables. Such limits generalize the central limit theorem, and so stable distributions generalize the normal distribution in a sense. The pioneering work on stable distributions was done by Paul Lévy.

Definition

In this section, we consider real-valued random variables whose distributions are not degenerate (that is, not concentrated at a single value). After all, a random variable with a degenerate distribution is not really random, and so is not of much interest.

Random variable X has a stable distribution if the following condition holds: If nN+ and (X1,X2,,Xn) is a sequence of independent copies of X, then X1+X2++Xn has the same distribution as an+bnX for some anR and bn(0,). If an=0 for nN+ then the distribution of X is strictly stable.

  1. The parameters an for nN+ are the centering parameters.
  2. The parameters bn for nN+ are the norming parameters.
Details:

Since the distribution of X is not point mass at 0, note that if the distribution of a+bX is the same as the distribution of c+dX for some a,cR and b,d(0,), then a=c and b=d. Thus, the centering parameters an and the norming parameters bn are uniquely defined for nN+.

Recall that two distributions on R that are related by a location-scale transformation are said to be of the same type, and that being of the same type defines an equivalence relation on the class of distributions on R. With this terminology, the definition of stability has a more elegant expression: X has a stable distribution if the sum of a finite number of independent copies of X is of the same type as X. As we will see, the norming parameters are more important than the centering parameters, and in fact, only certain norming parameters can occur.

Basic Properties

We start with some very simple results that follow easily from the definition, before moving on to the deeper results.

Suppose that X has a stable distribution with mean μ and finite variance. Then the norming parameters are n and the centering parameters are (nn)μ for nN+.

Details:

As usual, let an and bn denote the centering and norming parameters of X for nN+, and let σ2 denote the (finite) variance of X. Suppose that nN+ and that (X1,X2,,Xn) is a sequence of independent copies of X. Then X1+X2++Xn has the same distribution as an+bnX. Taking variances gives nσ2=bn2σ2 and hence bn=n. Taking expected values now gives nμ=an+nμ.

It will turn out that the only stable distribution with finite variance is the normal distribution, but the result above is useful as an intermediate step. Next, it seems fairly clear from the definition that the family of stable distributions is itself a location-scale family.

Suppose that the distribution of X is stable, with centering parameters anR and norming parameters bn(0,)for nN+. If cR and d(0,), then the distribution of Y=c+dX is also stable, with centering parameters dan+(nbn)c and norming parameters bn for nN+.

Details:

Suppose that nN+ and that (Y1,Y2,,Yn) is a sequence of independent copies of Y. Then Y1+Y2++Yn has the same distribution nc+d(X1+X2++Xn) where (X1,X2,) is a sequence of independent copies of X. By stability, X1+X2++Xn has the same distribution as an+bnX. Hence Y1+Y2++Yn has the same distribution as (nc+dan)+dbnX, which in turn has the same distribution as [dan+(nbn)c]+bnY.

An important point is the the norming parameters are unchanged under a location-scale transformation.

Suppose that the distribution of X is stable, with centering parameters anR and norming parameters bn(0,) for nN+. Then the distribution of X is stable, with centering parameters an and norming parameters bn for nN+.

Details:

If nN+ and (X1,X2,,Xn) is a sequence of independent copies of X then (X1,X2,,Xn) is a sequence of independent copies of X. By stability, i=1nXi has the same distribution as (an+bnX)=an+bn(X).

From [3] and [4], if X has a stable distribution, then so does c+dX, with the same norming parameters, for every c,dR with d0. Stable distributions are also closed under convolution (corresponding to sums of independent variables) if the norming parameters are the same.

Suppose that X and Y are independent variables. Assume also that X has a stable distribution with centering parameters anR and norming parameters bn(0,) for nN+, and that Y has a stable distribution with centering parameters cnR and the same norming parameters bn for nN+. Then Z=X+Y has a stable distribution with centering paraemters an+cn and norming parameters bn for nN+.

Details:

Suppose that nN+ and that (Z1,X2,,Zn) is a sequence of independent copies of Z. Then i=1nZi has the same distribution as i=1nXi+i=1nYi where X=(X1,X2,,Xn) is a sequence of independent copies of X, and Y=(Y1,Y2,,Yn) is a sequence of independent copies of Y, and where X and Y are independent. By stability, this is the same as the distribution of (an+bnX)+(cn+bnY)=(an+cn)+bn(X+Y).

We can now give another characterization of stability that just involves two independent copies of X.

Random variable X has a stable distribution if and only if the following condition holds: If X1,X2 are independent copies of X and d1,d2(0,) then d1X1+d2X2 has the same distribution as a+bX for some aR and b(0,).

Details:

Clearly the condition in definition [1] implies the condition here. Coversely, suppose that the condition here holds. We will show by induction that the condition in definition [1] holds. For n=2, definition [1] is a special case of the condition in this theorem, with d1=d2=1. Suppose that condition [1] holds for a given nN+. Suppose that (X1,X2,,Xn,Xn+1) is a sequence of independent copies of X. By the induction hypothesis, Yn=X1+X2++Xn has the same distribution as an+bnX for some anR and bn(0,). By independence, Yn+1=X1+X2++Xn+Xn+1 has the same distribution as an+bnX1+Xn+1. By another application of the condition above, bnX1+Xn+1 has the same distribution as c+bn+1X for some cR and bn+1(0,). But then Yn+1 has the same distribution as (an+c)+bn+1X.

As a corollary of [5] and [6], we have the following:

Suppose that X and Y are independent with the same stable distribution. Then the distribution of XY is stable, with the same norming parameters.

Note that the distribution of XY is symmetric (about 0). The last result is useful because it allows us to get rid of the centering parameters when proving facts about the norming parameters. Here is the most important of those facts:

Suppose that X has a stable distribution. Then the norming parameters have the form bn=n1/α for nN+, for some α(0,2]. The parameter α is known as the index or characteristic exponent of the distribution.

Details:

The proof is in several steps, and is based on the proof in An Introduction to Probability Theory and Its Applications, Volume II, by William Feller. The proof uses the basic trick of writing a sum of independent copies of X in different ways in order to obtain relationships between the norming constants bn.

First we can assume from [7] that the distribution of X is symmetric and strictly stable. Let (X1,X2,) be a sequence of independent copies of X. Let Yn=i=1nXi for nN+. Now let n,mN+ and consider Ymn. Directly from stability, Ymn has the same distribution as bmnX. On the other hand, Ymn can be thought of as a sum of m blocks, where each block is a sum of n independent copies of X. Each block has the same distribution as bnX, and since the blocks are independent, it follows that Ymn has the same distribution as bnX1+bnX2++bnXm=bn(X1+X2++Xm) But by another application of stability, the random variable on the right has the same distribution as bnbmX. It then follows that bmn=bmbn for all m,nN+ which in turn leads to bnk=bnk for all n,kN+.

We use the same trick again, this time with a sum. Let m,nN+ and consider Ym+n. Directly from stability, Ym+n has the same distribution as bm+nX. On the other hand, Ym+n can be thought of as the sum of two blocks. The first is the sum of m independent copies of X and hence has the same distribution as bmX, while the second is the sum of n independent copies of X and hence has the same distribution as bnX. Since the blocks are independent, it follows that bm+nX has the same distribution as bmX1+bnX2, or equivalently, X has the same distribution as U=bmbm+nX1+bnbm+nX2 Next note that for x>0, {X10,X2>bm+nbnx}{U>x} and so by independence, P(U>x)P(X10,X2>bm+nbnx)=P(X10)P(X2>bm+nbnx) But by symmetry, P(X10)12. Also X2 and U have the same distribution as X, so we conclude that P(X>x)12P(X>bm+nbnx),x>0 It follows that the ratios bn/bm+n are bounded for m,nN+. If that were not the case, we could find a sequence of integers m,n with bm+n/bn0, in which case the displayed equation above would give the contradiction P(X>x)14 for all x>0. Restating, the ratios bk/bn are bounded for k,nN+ with k<n.

Fix rN+. There exists a unique α(0,) with br=r1/α. It then follows from step 1 above that bn=n1/α for every n=rj with jN+. Similarly, if sN+, there exists β(0,) with bs=s1/β and then bm=m1/β for every m=sk with kN+. For our next step, we show that α=β and it then follows that bn=n1/α for every nN+. Towards that end, note that if m=sk with kN+ there exists n=rj with jN+ with nmrn. Hence bm=m1/β(rn)1/β=r1/βbnα/β Therefore bmbnr1/βbnα/β1 Since the coefficients bn are unbounded in nN+, but the ratios bn/bm are bounded for m,nN+ with m>n, the last inequality implies that βα. Reversing the roles of m and n then gives αβ and hence α=β.

All that remains to show is that α2. We will do this by showing that if α>2, then X must have finite variance, in which case the finite variance property in [2] leads to the contradiction α=2. Since X2 is nonnegative, E(X2)=0P(X2>x)dx=0P(|X|>x)dx=k=12k12kP(|X|>x)dx So the idea is to find bounds on the integrals on the right so that the sum converges. Towards that end, note that for t>0 and nN+ P(|Yn|>tbn)=P(bn|X|>tbn)=P(|X|>t) Hence we can choose t so that P(|Yn|>tbn)14. On the other hand, using a special inequality for symmetric distributions, 12(1exp[nP(|X|>tbn)])P(|Yn|>tbn) This implies that nP(|X|>tbn) is bounded in n or otherwise the two inequalities together would lead to 1214. Substituting x=tbn=tn1/α leads to P(|X|>x)Mxα for some M>0. It then follows that 2k12kP(|X|>x)dxM2k(1α/2) If α>2, the series with the terms on the right converges and we have E(X2)<.

Every stable distribution is continuous.

Details:

As in the proof of [8], suppose that X has a symmetric stable distribution with norming parameters bn for nN+. As a special case of the last proof, for nN+, X has the same distribution as 1bn+1X1+bnbn+1X2 where X1 and X2 are independent and also have this distribution. Suppose now that P(X=x)=p for some x0 where p>0. Then P(X=1+bnb1+nx)P(X1=x)P(X2=x)=p2>0 If the index α1, the points (1+bn)b1+nx=1+n1/α(1+n)1/αx,nN+ are distinct, which gives us infinitely many atoms, each with probability at least p2—clearly a contradiction.

Next, suppose that the only atom is x=0 and that P(X=0)=p where p(0,1). Then X1+X2 has the same distribution as b2X. But P(X1+X2=0)=P(X1=0)P(X2=0)=p2 while P(b2X=0)=P(X=0)=p, another contradiction.

The next result is a precise statement of the limit theorem alluded to in the introductory paragraph.

Suppose that (X1,X2,) is a sequence of independent, identically distributed random variables, and let Yn=i=1nXi for nN+. If there exist constants anR and bn(0,) for nN+ such that (Ynan)/bn has a (non-degenerate) limiting distribution as n, then the limiting distribution is stable.

The following theorem completely characterizes stable distributions in terms of the characteristic function.

Suppose that X has a stable distribution. The characteristic function of X has the following form, for some α(0,2], β[1,1], cR, and d(0,) χ(t)=E(eitX)=exp(itcdα|t|α[1+iβsgn(t)uα(t)]),tR where sgn is the usual sign function, and where uα(t)={tan(πα2),α12πln(|t|),α=1

  1. The parameter α is the index, as before.
  2. The parameter β is the skewness parameter.
  3. The parameter c is the location parameter.
  4. The parameter d is the scale parameter.

Thus, the family of stable distributions is a 4 parameter family. The index parameter α and and the skewness parameter β can be considered shape parameters. When the location parameter c=0 and the scale parameter d=1, we get the standard form of the stable distributions, with characteristic function χ(t)=E(eitX)=exp(|t|α[1+iβsgn(t)uα(t)]),tR

The characteristic function gives another proof that stable distributions are closed under convolution (corresponding to sums of independent variables), if the index is fixed.

Suppose that X1 and X2 are independent random variables, and that X1 and X2 have the stable distribution with common index α(0,2], skewness parameter βk[1,1], location parameter ckR, and scale parameter dk(0,). Then X1+X2 has the stable distribution with index α, location parameter c=c1+c2 , scale parameter d=(d1α+d2α)1/α, and skewness parameter β=β1d1α+β2d2αd1α+d2α

Details:

Let χk denote the characteristic function of Xk for k{1,2}. Then X1+X2 has characteristic function χ=χ1χ2. The result follows from using the form of the characteristic function in [11] and some algebra.

Special Cases

Three special parametric families of distributions studied in this chapter are stable. In the proofs in this subsection, we use the definition of stability and various important properties of the distributions. These properties, in turn, are verified in the sections devoted to the distributions. We also give proofs based on the characteristic function, which allows us to identify the skewness parameter.

The normal distribution is stable with index α=2. There is no skewness parameter.

Details:

Suppose that Z has the standard normal distribution. If nN+ and (Z1,Z2,,Zn) is a sequence of independent copies of A, then Z1+Z2++Zn has the normal distribution with mean 0 and variance n. But this is also the distribution of nZ. Hence the standard normal distribution is strictly stable, with index α=2. The normal distribution with mean μR and standard deviation σ(0,) is the distribution of μ+σZ. From our basic properties above, this distribution is stable with index α=2 and centering parameters (nn)μ for nN.

In terms of the characteristic function, note that if α=2 then uα(t)=tan(π)=0 so the skewness parameter β drops out completely. The characteristic function in standard form χ(t)=et2 for tR, which is the characteristic function of the normal distribution with mean 0 and variance 2.

Of course, the normal distribution has finite variance, so once we know that it is stable, it follows from the finite variance property [2] that the index must be 2. Moreover, the characteristic function shows that the normal distribution is the only stable distribution with index 2, and hence the only stable distribution with finite variance.

Open the special distribution simulator and select the normal distribution. Vary the parameters and note the shape and location of the probability density function. For various values of the parameters, run the simulation 1000 times and compare the empirical density function to the probability density function.

The Cauchy distribution is stable with index α=1 and skewness parameter β=0.

Details:

Suppose that Z has the standard Cauchy distribiution. If nN+ and (Z1,Z2,,Zn) is a sequence of independent copies of Z, then Z1+Z2++Zn has the Cauchy distribution scale parameter n. By definition this is the same as the distribution of nZ. Hence the standard Cauchy distribution is strictly stable, with index α=1. The Cauchy distribution with location parameter aR and scale parameter b(0,) is the distribution of a+bZ. From our basic properties above, this distribution is strictly stable with index α=1.

When α=1 and β=0 the characteristic function in standard form is χ(t)=exp(|t|) for tR, which is the characteristic function of the standard Cauchy distribution.

Open the special distribution simulator and select the Cauchy distribution. Vary the parameters and note the shape and location of the probability density function. For various values of the parameters, run the simulation 1000 times and compare the empirical density function to the probability density function.

The Lévy distribution is stable with index α=12 and skewness parameter β=1.

Details:

If nN+ and (Z1,Z2,,Zn) is a sequence of independent variables, each with the standard Lévy distribution, then Z1+Z2++Zn has the Lévy distribution scale parameter n2. By definition this is the same as the distribution of n2Z where Z has the standard Lévy distribution. Hence the standard Lévy distribution is strictly stable, with index α=12. The Lévy distribution with location parameter aR and scale parameter b(0,) is the distribution of a+bZ. From our basic properties above, this distribution is stable with index α=12 and centering parameters (nn2)a for nN+.

When α=12 note that uα(t)=tan(π4)=1. So the characteristic function in standard form with α=12 and β=1 is χ(t)=exp(|t|1/2[1+isgn(t)]) which is the characteristic function of the standard Lévy distribution.

Open the special distribution simulator and select the Lévy distribution. Vary the parameters and note the shape and location of the probability density function. For various values of the parameters, run the simulation 1000 times and compare the empirical density function to the probability density function.

The normal, Cauchy, and Lévy distributions are the only stable distributions for which the probability density function is known in closed form.