Barbara Illowsky; Susan Dean

Santa Clara County, CA, has approximately 27,873 Japanese-Americans. Their ages are as follows:

Age Group	Percent of Community
0–17	18.9
18–24	8.0
25–34	22.8
35–44	15.0
45–54	13.1
55–64	11.9
65+	10.3

Table 2.77

Construct a histogram of the Japanese-American community in Santa Clara County, CA. The bars will not be the same width for this example. Why not? What impact does this have on the reliability of the graph?
What percentage of the community is under age 35?
Which box plot most resembles the information above?

Three box plots with values between 0 and 100. Plot i has Q1 at 24, M at 34, and Q3 at 53; Plot ii has Q1 at 18, M at 34, and Q3 at 45; Plot iii has Q1 at 24, M at 25, and Q3 at 54.

Figure 2.47

109.

Javier and Ercilia are supervisors at a shopping mall. Each was given the task of estimating the mean distance that shoppers live from the mall. They each randomly surveyed 100 shoppers. The samples yielded the following information.

	Javier	Ercilia
$\bar{x}$	6.0 miles	6.0 miles
$s$	4.0 miles	7.0 miles

Table 2.78

How can you determine which survey was correct ?
Explain what the difference in the results of the surveys implies about the data.
If the two histograms depict the distribution of values for each supervisor, which one depicts Ercilia's sample? How do you know?

Figure 2.48
If the two box plots depict the distribution of values for each supervisor, which one depicts Ercilia’s sample? How do you know?

Figure 2.49

Use the following information to answer the next three exercises: We are interested in the number of years students in a particular elementary statistics class have lived in California. The information in the following table is from the entire section.

Number of years	Frequency	Number of years	Frequency
7	1	22	1
14	3	23	1
15	1	26	1
18	1	40	2
19	4	42	2
20	3
			Total = 20

Table 2.79

110.

What is the IQR?

8
11
15
35

111.

What is the mode?

19
19.5
14 and 20
22.65

112.

Is this a sample or the entire population?

sample
entire population
neither

113.

Twenty-five randomly selected students were asked the number of movies they watched the previous week. The results are as follows:

# of movies	Frequency
0	5
1	9
2	6
3	4
4	1

Table 2.80

Find the sample mean $\bar{x}$ .
Find the approximate sample standard deviation, s.

114.

Forty randomly selected students were asked the number of pairs of sneakers they owned. Let X = the number of pairs of sneakers owned. The results are as follows:

X	Frequency
1	2
2	5
3	8
4	12
5	12
6	0
7	1

Table 2.81

Find the sample mean $\bar{x}$
Find the sample standard deviation, s
Construct a histogram of the data.
Complete the columns of the chart.
Find the first quartile.
Find the median.
Find the third quartile.
Construct a box plot of the data.
What percent of the students owned at least five pairs?
Find the 40^th percentile.
Find the 90^th percentile.
Construct a line graph of the data
Construct a stemplot of the data

115.

Following are the published weights (in pounds) of all of the team members of the San Francisco 49ers from a previous year.

177; 205; 210; 210; 232; 205; 185; 185; 178; 210; 206; 212; 184; 174; 185; 242; 188; 212; 215; 247; 241; 223; 220; 260; 245; 259; 278; 270; 280; 295; 275; 285; 290; 272; 273; 280; 285; 286; 200; 215; 185; 230; 250; 241; 190; 260; 250; 302; 265; 290; 276; 228; 265

Organize the data from smallest to largest value.
Find the median.
Find the first quartile.
Find the third quartile.
Construct a box plot of the data.
The middle 50% of the weights are from _______ to _______.
If our population were all professional football players, would the above data be a sample of weights or the population of weights? Why?
Assume the population was the San Francisco 49ers. Find:
1. the population mean, μ.
2. the population standard deviation, σ.
3. the weight that is two standard deviations below the mean.
4. When Steve Young, quarterback, played football, he weighed 205 pounds. How many standard deviations above or below the mean was he?
That same year, the mean weight for the Dallas Cowboys was 240.08 pounds with a standard deviation of 44.38 pounds. Emmit Smith weighed in at 209 pounds. With respect to his team, who was lighter, Smith or Young? How did you determine your answer?

116.

One hundred teachers attended a seminar on mathematical problem solving. The attitudes of a representative sample of 12 of the teachers were measured before and after the seminar. A positive number for change in attitude indicates that a teacher's attitude toward math became more positive. The 12 change scores are as follows:

3; 8; –1; 2; 0; 5; –3; 1; –1; 6; 5; –2

What is the mean change score?
What is the standard deviation for this population?
What is the median change score?
Find the change score that is 2.2 standard deviations below the mean.

117.

Refer to Figure 2.50 determine which of the following are true and which are false. Explain your solution to each part in complete sentences.

This shows three graphs. The first is a histogram with a mode of 3 and fairly symmetrical distribution between 1 (minimum value) and 5 (maximum value). The second graph is a histogram with peaks at 1 (minimum value) and 5 (maximum value) with 3 having the lowest frequency. The third graph is a box plot. The first whisker extends from 0 to 1. The box begins at the firs quartile, 1, and ends at the third quartile,6. A vertical, dashed line marks the median at 3. The second whisker extends from 6 on.

Figure 2.50

The medians for all three graphs are the same.
We cannot determine if any of the means for the three graphs is different.
The standard deviation for graph b is larger than the standard deviation for graph a.
We cannot determine if any of the third quartiles for the three graphs is different.

118.

In a recent issue of the IEEE Spectrum, 84 engineering conferences were announced. Four conferences lasted two days. Thirty-six lasted three days. Eighteen lasted four days. Nineteen lasted five days. Four lasted six days. One lasted seven days. One lasted eight days. One lasted nine days. Let X = the length (in days) of an engineering conference.

Organize the data in a chart.
Find the median, the first quartile, and the third quartile.
Find the 65^th percentile.
Find the 10^th percentile.
Construct a box plot of the data.
The middle 50% of the conferences last from _______ days to _______ days.
Calculate the sample mean of days of engineering conferences.
Calculate the sample standard deviation of days of engineering conferences.
Find the mode.
If you were planning an engineering conference, which would you choose as the length of the conference: mean; median; or mode? Explain why you made that choice.
Give two reasons why you think that three to five days seem to be popular lengths of engineering conferences.

119.

A survey of enrollment at 35 community colleges across the United States yielded the following figures:

6414; 1550; 2109; 9350; 21828; 4300; 5944; 5722; 2825; 2044; 5481; 5200; 5853; 2750; 10012; 6357; 27000; 9414; 7681; 3200; 17500; 9200; 7380; 18314; 6557; 13713; 17768; 7493; 2771; 2861; 1263; 7285; 28165; 5080; 11622

Organize the data into a chart with five intervals of equal width. Label the two columns "Enrollment" and "Frequency."
Construct a histogram of the data.
If you were to build a new community college, which piece of information would be more valuable: the mode or the mean?
Calculate the sample mean.
Calculate the sample standard deviation.
A school with an enrollment of 8000 would be how many standard deviations away from the mean?

Use the following information to answer the next two exercises. X = the number of days per week that 100 clients use a particular exercise facility.

x	Frequency
0	3
1	12
2	33
3	28
4	11
5	9
6	4

Table 2.82

120.

The 80^th percentile is _____

5
80
3
4

121.

The number that is 1.5 standard deviations BELOW the mean is approximately _____

0.7
4.8
–2.8
Cannot be determined

122.

Suppose that a publisher conducted a survey asking adult consumers the number of fiction paperback books they had purchased in the previous month. The results are summarized in the Table 2.83.

# of books	Freq.	Rel. Freq.
0	18
1	24
2	24
3	22
4	15
5	10
7	5
9	1

Table 2.83

Are there any outliers in the data? Use an appropriate numerical test involving the IQR to identify outliers, if any, and clearly state your conclusion.
If a data value is identified as an outlier, what should be done about it?
Are any data values further than two standard deviations away from the mean? In some situations, statisticians may use this criteria to identify data values that are unusual, compared to the other data values. (Note that this criteria is most appropriate to use for data that is mound-shaped and symmetric, rather than for skewed data.)
Do parts a and c of this problem give the same answer?
Examine the shape of the data. Which part, a or c, of this question gives a more appropriate result for this data?
Based on the shape of the data which is the most appropriate measure of center for this data: mean, median or mode?

Bringing It Together: Homework