The exponential distribution is often concerned with the amount of time until some specific event occurs. For example, the amount of time (beginning now) until an earthquake occurs has an exponential distribution. Other examples include the length of time, in minutes, of long distance business telephone calls, and the amount of time, in months, a car battery lasts. It can be shown, too, that the value of the change that you have in your pocket or purse approximately follows an exponential distribution.
Values for an exponential random variable occur in the following way. There are fewer large values and more small values. For example, marketing studies have shown that the amount of money customers spend in one trip to the supermarket follows an exponential distribution. There are more people who spend small amounts of money and fewer people who spend large amounts of money.
Exponential distributions are commonly used in calculations of product reliability, or the length of time a product lasts.
The random variable for the exponential distribution is continuous and often measures a passage of time, although it can be used in other applications. Typical questions may be, “what is the probability that some event will occur within the next hours or days, or what is the probability that some event will occur between hours and hours, or what is the probability that the event will take more than hours to perform?” In short, the random variable X equals (a) the time between events or (b) the passage of time to complete an action, e.g. wait on a customer. The probability density function is given by:
where μ is the historical average waiting time.
and has a mean and standard deviation of 1/μ.
An alternative form of the exponential distribution formula recognizes what is often called the decay factor. The decay factor simply measures how rapidly the probability of an event declines as the random variable X increases. When the notation using the decay parameter m is used, the probability density function is presented as:
In order to calculate probabilities for specific probability density functions, the cumulative density function is used. The cumulative density function (cdf) is simply the integral of the pdf and is:
Let X = amount of time (in minutes) a postal clerk spends with a customer. The time is known from historical data to have an average amount of time equal to four minutes.
It is given that μ = 4 minutes, that is, the average time the clerk spends with a customer is 4 minutes. Remember that we are still doing probability and thus we have to be told the population parameters such as the mean. To do any calculations, we need to know the mean of the distribution: the historical time to provide a service, for example. Knowing the historical mean allows the calculation of the decay parameter, m.
. Therefore, .
When the notation used the decay parameter, m, the probability density function is presented as , which is simply the original formula with m substituted for , or .
To calculate probabilities for an exponential probability density function, we need to use the cumulative density function. As shown below, the curve for the cumulative density function is:
f(x) = 0.25e–0.25x where x is at least zero and m = 0.25.
For example, f(5) = 0.25e(-0.25)(5) = 0.072. In other words, the function has a value of .072 when x = 5.
The graph is as follows:
Notice the graph is a declining curve. When x = 0,
f(x) = 0.25e(−0.25)(0) = (0.25)(1) = 0.25 = m. The maximum value on the y-axis is always m, one divided by the mean.
Try It 5.3
The amount of time spouses shop for anniversary cards can be modeled by an exponential distribution with the average amount of time equal to eight minutes. Write the distribution, state the probability density function, and graph the distribution.
a. Using the information in Example 5.3, find the probability that a clerk spends four to five minutes with a randomly selected customer.
a. Find P(4 < x < 5).
The cumulative distribution function (CDF) gives the area to the left.
P(x < x) = 1 – e–mx
P(x < 5) = 1 – e(–0.25)(5) = 0.7135 and P(x < 4) = 1 – e(–0.25)(4) = 0.6321
P(4 < x < 5)= 0.7135 – 0.6321 = 0.0814
Try It 5.4
The number of days ahead travelers purchase their airline tickets can be modeled by an exponential distribution with the average amount of time equal to 15 days. Find the probability that a traveler will purchase a ticket fewer than ten days in advance. How many days do half of all travelers wait?
On the average, a certain computer part lasts ten years. The length of time the computer part lasts is exponentially distributed.
a. What is the probability that a computer part lasts more than 7 years?
a. Let x = the amount of time (in years) a computer part lasts.
μ = 10 so
Find P(x > 7). Draw the graph.
P(x > 7) = 1 – P(x < 7).
Since P(X < x) = 1 – e–mx then P(X > x) = 1 – ( 1 –e–mx) = e–mx
P(x > 7) = e(–0.1)(7) = 0.4966. The probability that a computer part lasts more than seven years is 0.4966.
b. On the average, how long would five computer parts last if they are used one after another?
b. On the average, one computer part lasts ten years. Therefore, five computer parts, if they are used one right after the other would last, on the average, (5)(10) = 50 years.
d. What is the probability that a computer part lasts between nine and 11 years?
d. Find P(9 < x < 11). Draw the graph.
P(9 < x < 11) = P(x < 11) – P(x < 9) = (1 – e(–0.1)(11)) – (1 – e(–0.1)(9)) = 0.6671 – 0.5934 = 0.0737. The probability that a computer part lasts between nine and 11 years is 0.0737.
Try It 5.5
On average, a pair of running shoes can last 18 months if used every day. The length of time running shoes last is exponentially distributed. What is the probability that a pair of running shoes last more than 15 months? On average, how long would six pairs of running shoes last if they are used one after the other? Eighty percent of running shoes last at most how long if used every day?
Suppose that the length of a phone call, in minutes, is an exponential random variable with decay parameter . The decay p[parameter is another way to view 1/λ. If another person arrives at a public telephone just before you, find the probability that you will have to wait more than five minutes. Let X = the length of a phone call, in minutes.
What is m, μ, and σ? The probability that you must wait more than five minutes is _______ .
- m =
- μ = 12
- σ = 12
P(x > 5) = 0.6592
The time spent waiting between events is often modeled using the exponential distribution. For example, suppose that an average of 30 customers per hour arrive at a store and the time between arrivals is exponentially distributed.
- On average, how many minutes elapse between two successive arrivals?
- When the store first opens, how long on average does it take for three customers to arrive?
- After a customer arrives, find the probability that it takes less than one minute for the next customer to arrive.
- After a customer arrives, find the probability that it takes more than five minutes for the next customer to arrive.
- Is an exponential distribution reasonable for this situation?
- Since we expect 30 customers to arrive per hour (60 minutes), we expect on average one customer to arrive every two minutes on average.
- Since one customer arrives every two minutes on average, it will take six minutes on average for three customers to arrive.
- Let X = the time between arrivals, in minutes. By part a, μ = 2, so m = = 0.5.
The cumulative distribution function is P(X < x) = 1 – e(-0.5)(x)
Therefore P(X < 1) = 1 – e(–0.5)(1) = 0.3935.
- P(X > 5) = 1 – P(X < 5) = 1 – (1 – e(-0.5)(5)) = e–2.5 ≈ 0.0821.
- This model assumes that a single customer arrives at a time, which may not be reasonable since people might shop in groups, leading to several customers arriving at the same time. It also assumes that the flow of customers does not change throughout the day, which is not valid if some times of the day are busier than others.
Memorylessness of the Exponential Distribution
Recall that the amount of time between customers for the postal clerk discussed earlier is exponentially distributed with a mean of two minutes. Suppose that five minutes have elapsed since the last customer arrived. Since an unusually long amount of time has now elapsed, it would seem to be more likely for a customer to arrive within the next minute. With the exponential distribution, this is not the case–the additional time spent waiting for the next customer does not depend on how much time has already elapsed since the last customer. This is referred to as the memoryless property. The exponential and geometric probability density functions are the only probability functions that have the memoryless property. Specifically, the memoryless property says that
P (X > r + t | X > r) = P (X > t) for all r ≥ 0 and t ≥ 0
For example, if five minutes have elapsed since the last customer arrived, then the probability that more than one minute will elapse before the next customer arrives is computed by using r = 5 and t = 1 in the foregoing equation.
P(X > 5 + 1 | X > 5) = P(X > 1) = = 0.6065.
This is the same probability as that of waiting more than one minute for a customer to arrive after the previous arrival.
The exponential distribution is often used to model the longevity of an electrical or mechanical device. In Example 5.5, the lifetime of a certain computer part has the exponential distribution with a mean of ten years. The memoryless property says that knowledge of what has occurred in the past has no effect on future probabilities. In this case it means that an old part is not any more likely to break down at any particular time than a brand new part. In other words, the part stays as good as new until it suddenly breaks. For example, if the part has already lasted ten years, then the probability that it lasts another seven years is P(X > 17|X > 10) = P(X > 7) = 0.4966, where the vertical line is read as "given".
Refer back to the postal clerk again where the time a postal clerk spends with his or her customer has an exponential distribution with a mean of four minutes. Suppose a customer has spent four minutes with a postal clerk. What is the probability that he or she will spend at least an additional three minutes with the postal clerk?
The decay parameter of X is m = = 0.25, so X ∼ Exp(0.25).
The cumulative distribution function is P(X < x) = 1 – e–0.25x.
We want to find P(X > 7|X > 4). The memoryless property says that P(X > 7|X > 4) = P (X > 3), so we just need to find the probability that a customer spends more than three minutes with a postal clerk.
This is P(X > 3) = 1 – P (X < 3) = 1 – (1 – e–0.25⋅3) = e–0.75 ≈ 0.4724.
Relationship between the Poisson and the Exponential Distribution
There is an interesting relationship between the exponential distribution and the Poisson distribution. Suppose that the time that elapses between two successive events follows the exponential distribution with a mean of μ units of time. Also assume that these times are independent, meaning that the time between events is not affected by the times between previous events. If these assumptions hold, then the number of events per unit time follows a Poisson distribution with mean μ. Recall that if X has the Poisson distribution with mean μ, then .
The formula for the exponential distribution: Where m = the rate parameter, or μ = average time between occurrences.
We see that the exponential is the cousin of the Poisson distribution and they are linked through this formula. There are important differences that make each distribution relevant for different types of probability problems.
First, the Poisson has a discrete random variable, x, where time; a continuous variable is artificially broken into discrete pieces. We saw that the number of occurrences of an event in a given time interval, x, follows the Poisson distribution.
For example, the number of times the telephone rings per hour. By contrast, the time between occurrences follows the exponential distribution. For example. The telephone just rang, how long will it be until it rings again? We are measuring length of time of the interval, a continuous random variable, exponential, not events during an interval, Poisson.
The Exponential Distribution v. the Poisson Distribution
A visual way to show both the similarities and differences between these two distributions is with a time line.
The random variable for the Poisson distribution is discrete and thus counts events during a given time period, t1 to t2 on Figure 5.20, and calculates the probability of that number occurring. The number of events, four in the graph, is measured in counting numbers; therefore, the random variable of the Poisson is a discrete random variable.
The exponential probability distribution calculates probabilities of the passage of time, a continuous random variable. In Figure 5.20 this is shown as the bracket from t1 to the next occurrence of the event marked with a triangle.
Classic Poisson distribution questions are "how many people will arrive at my checkout window in the next hour?".
Classic exponential distribution questions are "how long it will be until the next person arrives," or a variant, "how long will the person remain here once they have arrived?".
Again, the formula for the exponential distribution is:
We see immediately the similarity between the exponential formula and the Poisson formula.
Both probability density functions are based upon the relationship between time and exponential growth or decay. The “e” in the formula is a constant with the approximate value of 2.71828 and is the base of the natural logarithmic exponential growth formula. When people say that something has grown exponentially this is what they are talking about.
An example of the exponential and the Poisson will make clear the differences been the two. It will also show the interesting applications they have.
Poisson DistributionSuppose that historically 10 customers arrive at the checkout lines each hour. Remember that this is still probability so we have to be told these historical values. We see this is a Poisson probability problem.
We can put this information into the Poisson probability density function and get a general formula that will calculate the probability of any specific number of customers arriving in the next hour.
The formula is for any value of the random variable we chose, and so the x is put into the formula. This is the formula:
As an example, the probability of 15 people arriving at the checkout counter in the next hour would be
Here we have inserted x = 15 and calculated the probability that in the next hour 15 people will arrive is .061.
Exponential DistributionIf we keep the same historical facts that 10 customers arrive each hour, but we now are interested in the service time a person spends at the counter, then we would use the exponential distribution. The exponential probability function for any value of x, the random variable, for this particular checkout counter historical data is:
To calculate µ, the historical average service time, we simply divide the number of people that arrive per hour, 10 , into the time period, one hour, and have µ = 0.1. Historically, people spend 0.1 of an hour at the checkout counter, or 6 minutes. This explains the .1 in the formula.
There is a natural confusion with µ in both the Poisson and exponential formulas. They have different meanings, although they have the same symbol. The mean of the exponential is one divided by the mean of the Poisson. If you are given the historical number of arrivals you have the mean of the Poisson. If you are given an historical length of time between events you have the mean of an exponential.
Continuing with our example at the checkout clerk; if we wanted to know the probability that a person would spend 9 minutes or less checking out, then we use this formula. First, we convert to the same time units which are parts of one hour. Nine minutes is 0.15 of one hour. Next we note that we are asking for a range of values. This is always the case for a continuous random variable. We write the probability question as:
We can now put the numbers into the formula and we have our result.
The probability that a customer will spend 9 minutes or less checking out is 0.7769.
We see that we have a high probability of getting out in less than nine minutes and a tiny probability of having 15 customers arriving in the next hour.