### Stats Lab

#### Lab 1: Chi-Square Goodness-of-Fit

- The student will evaluate data collected to determine if they fit either the uniform or exponential distributions.

Collect the DataGo to your local supermarket. Ask 30 people as they leave for the total amount on their grocery receipts. Or, ask 3 cashiers for the last 10 amounts. Be sure to include the express lane, if it is open.

### Note

- Record the values.
__________ __________ __________ __________ __________ __________ __________ __________ __________ __________ __________ __________ __________ __________ __________ __________ __________ __________ __________ __________ __________ __________ __________ __________ __________ __________ __________ __________ __________ __________ - Construct a histogram of the data. Make five to six intervals. Sketch the graph using a ruler and pencil. Scale the axes.
- Calculate the following:
- $\overline{x}$ = ________
*s*= ________*s*^{2}= ________

Uniform Distribution Test to see if grocery receipts follow the uniform distribution.

- Using your lowest and highest values,
*X*~*U*(_______, _______). - Divide the distribution into fifths.
- Calculate the following:
- lowest value = _________
- 20
^{th}percentile = _________ - 40
^{th}percentile = _________ - 60
^{th}percentile = _________ - 80
^{th}percentile = _________ - highest value = _________

- For each fifth, count the observed number of receipts and record it. Then determine the expected number of receipts and record that.
Fifth Observed Expected 1 ^{st}2 ^{nd}3 ^{rd}4 ^{th}5 ^{th} *H*: _________{0}*H*: _________{a}- What distribution should you use for a hypothesis test?
- Why did you choose this distribution?
- Calculate the test statistic.
- Find the
*p*-value. - Sketch a graph of the situation. Label and scale the
*x*-axis. Shade the area corresponding to the*p*-value. - State your decision.
- State your conclusion in a complete sentence.

Exponential Distribution Test to see if grocery receipts follow the exponential distribution with decay parameter $\frac{1}{\overline{x}}$.

- Using $\frac{1}{\overline{x}}$ as the decay parameter,
*X*~*Exp*(_________). - Calculate the following:
- lowest value = ________
- first quartile = ________
- 37
^{th}percentile = ________ - median = ________
- 63
^{rd}percentile = ________ - 3
^{rd}quartile = ________ - highest value = ________

- For each cell, count the observed number of receipts and record it. Then determine the expected number of receipts and record that.
Cell Observed Expected 1 ^{st}2 ^{nd}3 ^{rd}4 ^{th}5 ^{th}6 ^{th} *H*: _________{0}*H*: _________{a}- What distribution should you use for a hypothesis test?
- Why did you choose this distribution?
- Calculate the test statistic.
- Find the
*p*-value. - Sketch a graph of the situation. Label and scale the
*x*-axis. Shade the area corresponding to the*p*-value. - State your decision.
- State your conclusion in a complete sentence.

- Did your data fit either distribution? If so, which?
- In general, do you think it’s likely that data could fit more than one distribution? In complete sentences, explain why or why not.