Barbara Illowsky; Susan Dean

Establishing the type of distribution, sample size, and known or unknown standard deviation can help you figure out how to go about a hypothesis test. However, there are several other factors you should consider when working out a hypothesis test.

Rare Events

The thinking process in hypothesis testing can be summarized as follows: You want to test whether or not a particular property of the population is true. You make an assumption about the true population mean for numerical data or the true population proportion for categorical data. This assumption is the null hypothesis. Then you gather sample data that is representative of the population. From this sample data you compute the sample mean (or the sample proportion). If the value that you observe is very unlikely to occur (a rare event) if the null hypothesis is true, then you wonder why this is happening. A plausible explanation is that the null hypothesis is false.

For example, Didi and Ali are at a birthday party of a very wealthy friend. They hurry to be first in line to grab a prize from a tall basket that they cannot see inside because they will be blindfolded. There are 200 plastic bubbles in the basket, and Didi and Ali have been told that there is only one with a $100 bill. Didi is the first person to reach into the basket and pull out a bubble. Her bubble contains a $100 bill. The probability of this happening is $\frac{1}{200}$ = 0.005. Because this is so unlikely, Ali is hoping that what the two of them were told is wrong and there are more $100 bills in the basket. A rare event has occurred (Didi getting the $100 bill) so Ali doubts the assumption about only one $100 bill being in the basket.

Using the Sample to Test the Null Hypothesis

After you collect data and obtain the test statistic (the sample mean, sample proportion, or other test statistic), you can determine the probability of obtaining that test statistic when the null hypothesis is true. This probability is called the p-value.

When the p-value is very small, it means that the observed test statistic is very unlikely to happen if the null hypothesis is true. This gives significant evidence to suggest that the null hypothesis is false, and to reject it in favor of the alternative hypothesis. In practice, to reject the null hypothesis we want the p-value to be smaller than 0.05 (5 percent) or sometimes even smaller than 0.01 (1 percent).

Example 9.9

Suppose a baker claims that his bread height is more than 15 cm, on average. Several of his customers do not believe him. To persuade his customers that he is right, the baker decides to do a hypothesis test. He bakes 10 loaves of bread. The mean height of the sample loaves is 17 cm. The baker knows from baking hundreds of loaves of bread that the standard deviation for the height is 0.5 cm and the distribution of heights is normal.

The null hypothesis could be H₀: μ ≤ 15. The alternate hypothesis is H_a: μ > 15.

The words is more than translates as a ">" so "μ > 15" goes into the alternate hypothesis. The null hypothesis must contradict the alternate hypothesis.

Since σ is known (σ = 0.5 cm), the distribution for the population is known to be normal with mean μ = 15 and standard deviation $\frac{σ}{\sqrt{n}} = \frac{0.5}{\sqrt{10}} = 0.16$ .

Suppose the null hypothesis is true (which is that the mean height of the loaves is no more than 15 cm). Then is the mean height (17 cm) calculated from the sample unexpectedly large? The hypothesis test works by asking the question how unlikely the sample mean would be if the null hypothesis were true. The graph shows how far out the sample mean is on the normal curve. The p-value is the probability that, if we were to take other samples, any other sample mean would fall at least as far out as 17 cm.

The p-value, then, is the probability that a sample mean is the same or greater than 17 cm when the population mean is, in fact, 15 cm. We can calculate this probability using the normal distribution for means. In Figure 9.2, the p-value is the area under the normal curve to the right of 17. Using a normal distribution table or a calculator, we can compute that this probability is practically zero.

Normal distribution curve on average bread heights with values 15, as the population mean, and 17, as the point to determine the p-value, on the x-axis.

Figure 9.2

p-value = P( $\bar{x}$ > 17), which is approximately zero.

Because the p-value is almost 0, we conclude that obtaining a sample height of 17 cm or higher from 10 loaves of bread is very unlikely if the true mean height is 15 cm. We reject the null hypothesis and conclude that there is sufficient evidence to claim that the true population mean height of the baker’s loaves of bread is higher than 15 cm.

Try It 9.9

A normal distribution has a standard deviation of 1. We want to verify a claim that the mean is greater than 12. A sample of 36 is taken with a sample mean of 12.5.

H₀: μ ≤ 12
H_a: μ > 12
The p-value is 0.0013.
Draw a graph that shows the p-value.

Decision and Conclusion

A systematic way to make a decision of whether to reject or not reject the null hypothesis is to compare the p-value and a preset or preconceived α, also called the level of significance of the test. A preset α is the probability of a Type I error (rejecting the null hypothesis when the null hypothesis is true). It may or may not be given to you at the beginning of the problem.

When you make a decision to reject or not reject H₀, do as follows:

If p-value $< α$ , reject H₀. The results of the sample data are significant. There is sufficient evidence to conclude that H₀ is an incorrect belief and that the alternative hypothesis, H_a, may be correct.
If p-value $\geq α$ , do not reject H₀. The results of the sample data are not significant.There is not sufficient evidence to conclude that the alternative hypothesis, H_a, may be correct.
When you do not reject H₀, it does not mean that you should believe that H₀ is true. It simply means that the sample data have failed to provide sufficient evidence to cast serious doubt about the truthfulness of H₀.

Conclusion: After you make your decision, write a thoughtful conclusion about the hypotheses in terms of the given problem.

Example 9.10

When using the p-value to evaluate a hypothesis test, you might find it useful to use the following mnemonic device:

If the p-value is low, the null must go.

If the p-value is high, the null must fly.

This memory aid relates a p-value less than the established alpha (the p is low) as rejecting the null hypothesis and, likewise, relates a p-value higher than the established alpha (the p is high) as not rejecting the null hypothesis.

Problem

Fill in the blanks.

Reject the null hypothesis when ______________________________________.

The results of the sample data _____________________________________.

Do not reject the null hypothesis when __________________________________________.

The results of the sample data ____________________________________________.

Solution

Reject the null hypothesis when the p-value is less than the established alpha value. The results of the sample data support the alternative hypothesis.

Do not reject the null hypothesis when the p-value is greater or equal to the established alpha value. The results of the sample data do not support the alternative hypothesis.

Try It 9.10

It’s a Boy Genetics Labs, a genetics company, claims their procedures improve the chances of a boy being born. The results for a test of a single population proportion are as follows:

H₀: p = 0.50, H_a: p > 0.50

α = 0.01

p-value = 0.025

Interpret the results and state a conclusion in simple, nontechnical terms.

9.4 Rare Events, the Sample, and the Decision and Conclusion

Rare Events

Using the Sample to Test the Null Hypothesis

Decision and Conclusion

Problem

Solution