1. Random
  2. 13. The Poisson Process
  3. 1
  4. 2
  5. 3
  6. 4
  7. 5
  8. 6
  9. 7
  10. 8

5. Thinning and Superposition

Recall that a Poisson process with rate r(0,) defines three interrelated stochastic processes:

  1. The sequence of inter-arrival times X=(X1,X2,). These are indpendent random variables and each has the exponential distribution on [0,) with rate r.
  2. The sequence of arrival times T=(T0,T1,). This is the partial sum sequence associated with X, so that Tn has the gamma distribution on [0,) for nN+ with shape parameter n and rate parameter r.
  3. The counting process N=(Nt:t0) where Nt is the number of arrivals in (0,t] for t[0,) and has the Poisson distribution on N with parameter rt. This process has an inverse relationship with T so that Ntn if and only if Tnt for nN and t[0,).

Thinning

Thinning or splitting a Poisson process refers to classifying each random point, independently, into one of a finite number of different types. The random points of a given type also form Poisson processes, and these processes are independent. Our exposition will concentrate on the case of just two types, but this case has all of the essential ideas.

The Two-Type Process

Consider a Poisson process with rate r(0,). Suppose that each arrival, independently of the others, is one of two types: type 1 with probability p and type 0 with probability 1p, where p(0,1) is a parameter. Here are some common examples:

  1. The arrivals are radioactive emissions and each emitted particle is either detected (type 1) or missed (type 0) by a counter.
  2. The arrivals are customers at a service station and each customer is classified as either male (type 1) or female (type 0).

We want to consider the type 1 and type 0 random points separately. For this reason, the new random process is usually referred to as thinning or splitting the original Poisson process. In some applications, the type 1 points are accepted while the type 0 points are rejected. The main result of this section is that the type 1 and type 0 points form separate Poisson processes, with rates rp and r(1p) respectively, and are independent. We will explore this important result from several points of view.

In the previous sections, we have explored the analogy between the Bernoulli trials process and the Poisson process. Both have the strong renewal property that at each fixed time and at each arrival time, the process stochastically restarts, independently of the past. The difference, of course, is that time is discrete in the Bernoulli trials process and continuous in the Poisson process. In this section, we have both processes simultaneously, and given our previous explorations, it's perhaps not surprising that this leads to some interesting mathematics.

Consider the two-type Poisson process in [2]. In addition to the processes X, T, and N in [1], we have a sequence of Bernoulli trials I=(I1,I2,) with success parameter p. Indicator variable Ij specifies the type of the jth arrival. Moreover, because of our assumptions, I is independent of X, T, and N.

Recall that Vk, the trial number of the kth success has the negative binomial distribution with parameters k and p for kN+. We take V0=0 by convention. Also, Uk, the number of trials needed to go from the (k1)st success to the kth success has the geometric distribution with success parameter p for kN+. Moreover, U=(U1,U2,) is independent and V=(V0,V1,) is the partial sum process associated with U: Vk=i=1kUi,kNUk=VkVk1,kN+ As noted above, the Bernoulli trials process can be thought of as random points in discrete time, namely the trial numbers of the successes. With this understanding, U is the sequence of inter-arrival times and V is the sequence of arrival times.

The next result shows that the type 1 points form a Poisson process with rate pr.

For the two-type process in [3], consider just the type 1 points. The time between the arrival of (k1)st and kth type 1 point is Yk=i=Vk1+1VkXi,kN+ Y=(Y1,Y2,) is a sequence of independent variables and each has the exponential distribution with rate parameter pr.

Details:

Note that Yk has Uk terms. From the renewal properties of the Poisson process and the Bernoulli trials process, the inter-arrival times are independent and identically distributed. Each inter-arrival time is the sum of a random number of independent terms; each term has the exponential distribution with rate r, and the number of terms has the geometric distribution on N+ with parameter p. Moreover, the number of terms is independent of the terms themselves. We showed in the section on the exponential distribution that a random sum of this form has the exponential distribution with parameter rp.

Similarly, if Z=(Z1,Z2,) is the sequence of interarrvial times for the type 0 points, then Z is a sequence of independent variables, and each has the exponential distribution with rate (1p)r. Moreover, Y and Z are independent.

For t[0,), let Mt denote the number of type 1 arrivals in (0,t] and Wt the number of type 0 arrivals in (0,t]. So M=(Mt:t0) and W=(Wt:t0) are the counting processes for the type 1 arrivals and for the type 0 arrivals.

The next result follows from the previous results, but a direct proof is interesting.

For t0, Mt has the Poisson distribution with parameter pr, Wt has the Poisson distribution with parameter (1p)r, and Mt and Wt are independent.

Details:

The important observation is that the conditional distribution of Mt given Nt=n is binomial with parameters n and p. Thus for jN and kN, P(Mt=j,Wt=k)=P(Mt=j,Nt=j+k)=P(Nt=j+k)P(Mt=jNt=j+k)=ert(rt)j+k(j+k)!(j+k)!j!k!pj(1p)k=eprt(prt)jj!e(1p)rt[(1p)rt]kk!

In the two-type Poisson experiment vary r, p, and t with the scrollbars and note the shape of the probability density functions. For various values of the parameters, run the experiment 1000 times and compare the relative frequency functions to the probability density functions.

Estimating the Number of Arrivals

Suppose that the type 1 arrivals are observable, but not the type 0 arrivals. This setting is natural, for example, if the arrivals are radioactive emissions, and the type 1 arrivals are emissions that are detected by a counter, while the type 0 arrivals are emissions that are missed. Suppose that for a given t>0, we would like to estimate the total number arrivals Nt after observing the number of type 1 arrivals Mt.

The conditional distribution of Nt given Mt=k is the same as the distribution of k+Wt. P(Nt=nMt=k)=e(1p)rt[(1p)rt]nk(nk)!,n{k,k+1,}

Details:

Recall from [6] that Mt and Wt are independent. Thus, for nN, P(Nt=nMt=k)=P(Nt=n,Mt=k)P(Mt=k)=P(Mt=k,Wt=nk)P(Mt=k)=P(Mt=k)P(Wt=nk)P(Mt=k)=P(Wt=nk) The form of the probability density function follows since Wt as the Poisson distribution with parameter (1p)r.

E(NtMt=k)=k+(1p)r.

Details:

This follows easily from exercise since E(NtMt=k)=E(k+Wt)=k+(1p)r.

So if the overall rate r of the process and the probability p that an arrival is type 1 are known, then it follows form the general theory of conditional expectation that the best estimator of Nt based on Mt, in the least squares sense, is E(NtMt)=Mt+(1p)r

The mean square error is E([NtE(NtMt)]2)=(1p)rt.

Details:

Note that NtE(NtMt)=Wt(1p)r. Thus the mean square error is just var(Wt)=(1p)rt.

As you might guess, the results in this subsection generalize from 2 types to k types for general kN+. Once again, we start with a Poisson process with rate r>0. Suppose that each arrival, independently of the others, is type i with probability pi for i{0,1,,k1}. Of course we must have pi0 for each i and i=0k1pi=1. Then for each i, the type i points form a Poisson process with rate pir, and these processes are independent.

Superposition

Complementary to splitting or thinning a Poisson process is superposition: if we combine the random points in time from independent Poisson processes, then we have a new Poisson processes. The rate of the new process is the sum of the rates of the processes that were combined. Once again, our exposition will concentrate on the superposition of two processes. This case contains all of the essential ideas.

Two Processes

Suppose that we have two independent Poisson processes. We will denote the sequence of inter-arrival times, the sequence of arrival times, and the counting variables for the process i{1,2} by Xi=(X1i,X2i), Ti=(T1i,T2i,), and Ni=(Nti:t[0,)), and we assume that process i has rate ri(0,). The new process that we want to consider is obtained by simply combining the random points. That is, the new random points are {Tn1:nN+}{Tn2:nN+}, but of course then ordered in time. We will denote the sequence of inter-arrival times, the sequence of arrival times, and the counting variables for the new process by X=(X1,X2), T=(T1,T2,), and N=(Nt:t[0,)).

Clearly if A is an interval in [0,) then N(A)=N1(A)+N2(A) the number of combined points in A is simply the sum of the number of point in A for processes 1 and 2. It's also worth noting that X1=min{X11,X12} the first arrival for the combined process is the smaller of the first arrival times for processes 1 and 2. The other inter-arrival times, and hence also the arrival times, for the combined process are harder to state.

The combined process is a Poisson process with rate r1+r2.

Details:

As noted above, if A is a subinterval of [0,) then N(A)=N1(A)+N2(A). The first term has the Poisson distribution with parameter r1λ(A), the second term has the Poisson distribution with parameter r2λ(A), and the terms are independent. Hence N(A) has the Poisson distribution with parameter r1λ(A)+r2λ(A)=(r1+r2)λ(A). Thus the counting process has stationary, Poisson distributed increments. Next, if (A1,A2,,An) is a sequence of disjoint subintervals of [0,) then (N(A1),N(A2),,N(An))=(N1(A1)+N2(A1),N1(A2)+N2(A2),,N1(An)+N2(An)) is an independent sequence, so the counting process has independent increments.

Computational Exercises

In the two-type Poisson experiment, set r=2, t=3, and p=0.7. Run the experiment 1000 times, Compute the appropriate relative frequency functions and investigate empirically the independence of the number of type 1 points and the number of type 0 points.

Suppose that customers arrive at a service station according to the Poisson model, with rate r=20 per hour. Moreover, each customer, independently, is female with probability 0.6 and male with probability 0.4. Find the probability that in a 2 hour period, there will be at least 20 women and at least 15 men.

Details:

0.5814

In the two-type Poisson experiment, set r=3, t=4, and p=0.8. Run the experiment 100 times.

  1. Compute the estimate of Nt based on Mt for each run.
  2. Over the 100 runs, compute average of the sum of the squares of the errors.
  3. Compare the result in (b) with [14].

Suppose that a piece of radioactive material emits particles according to the Poisson model at a rate of r=100 per second. Moreover, assume that a counter detects each emitted particle, independently, with probability 0.9. Suppose that the number of detected particles in a 5 second period is 465.

  1. Estimate the number of particles emitted.
  2. Compute the mean square error of the estimate.
Details:
  1. 515
  2. 50